Senior Data Scientist II

closed
MetroStar Logo

MetroStar

πŸ“Remote - Worldwide

Summary

Join MetroStar as a Sr. Data Scientist II and empower the design, development, and optimization of AI/ML models. You will drive progress on AI/ML components for DoD Search Portfolio products. Collaborate with cross-functional teams, optimize models for performance and scalability, and stay updated on AI advancements. Manage the lifecycle of AI/ML components, apply analytical methodologies, and document findings. This role requires a Bachelor's degree and 7-10 years of experience or a Master's degree and 5 years of experience, along with an active Top Secret clearance. MetroStar offers a generous benefits package, professional growth opportunities, and a commitment to a diverse and inclusive culture.

Requirements

  • Bachelor’s degree plus 7-10 years of data science experience, or a Masters Degree plus 5 years of data science experience
  • Active Top Secret clearance with the ability to obtain a SCI
  • Experience with ML fields, e.g., natural language processing, computer vision, statistical learning theory
  • Hands-on experience with Natural Language Processing (NLP), Large Language Models, text embedding, semantic query, use of generative AI for text, and retrieval augmented generation (RAG)
  • Familiarity with data preprocessing, feature engineering, and model evaluation techniques essential for machine learning projects
  • Strong understanding of various machine learning algorithms, including supervised and unsupervised learning, reinforcement learning, and neural networks
  • Experience with version control systems like Git, enabling effective collaboration and code management
  • Experience in an ML engineer or data scientist role building ML models
  • Experience writing code in Python, R, Scala, Java, C++ with documentation for reproducibility
  • Experience using Apache Spark/Databricks distributed compute environments for AI/ML workloads
  • Experience handling petabyte size datasets, diving into data to discover hidden patterns, using data visualization tools, writing SQL, and working with GPUs to develop models
  • Experience with cloud-based data persistence products, especially RDS PostgreSQL and PostgreSQL extensions such as pgvector
  • Experience writing and speaking about technical concepts to business, technical, and lay audiences and giving data-driven presentations

Responsibilities

  • Design, configure, develop, test, and support informatics and data science solutions for a wide array of technical use cases
  • Collaborate with cross-functional teams, including data scientists and software engineers to integrate AI solutions developed by other elements of CDAO or the DoD community into Search Portfolio products when appropriate
  • Optimize AI models for performance, scalability, and efficiency, leveraging cloud-based resources and distributed computing frameworks, specifically Apache Spark/Databricks. Ability to adapt code base to also run using GPU enabled Kubernetes clusters
  • Stay updated on and contribute to the latest advancements in AI research, applying new findings to improve Search Portfolio products
  • Manage the lifecycle of AI/ML components used in Search Portfolio products from research and development to deployment and optimization
  • Apply analytical methodologies to diagnose data-related challenges, implement solutions, and evaluate performance
  • Document and present requirements, design alternatives, and findings to team members and clients
  • Develop strategic, baselined, data modeling processes; accurately determine cause-and-effect relationships
  • Maintain and guide the development of common libraries and tools used by multiple teams
  • Aid in formulating a strategy on how to achieve rapid prototyping

Benefits

  • Generous benefits package
  • Professional growth
  • Valuable time to recharge
This job is filled or no longer available

Similar Remote Jobs