Principal MLOps Engineer

Logo of Raft

Raft

πŸ’΅ $140k-$225k
πŸ“Remote - United States

Job highlights

Summary

Join Raft, a customer-obsessed company specializing in distributed data systems, as a Principal MLOps Engineer. You will contribute to a real-time data platform for the Department of Defense, handling over a billion events daily with millisecond latency. Key responsibilities include deploying ML infrastructure, building MLOps pipelines, and developing a full-lifecycle ML platform. This role requires extensive experience with Kubernetes, Docker, cloud applications, and machine learning. The position is remote, with a preference for candidates local to Tampa, FL, and may involve up to 30% travel. Raft offers a highly competitive salary and benefits package.

Requirements

  • 7+ years of relevant hands-on experience
  • 5+ years experience with Docker and Kubernetes, provisioning production clusters and maintaining their compliance
  • 5+ years experience supporting enterprise Cloud applications or infrastructure (AWS, Azure, etc.)
  • Solid understanding of Helm Charts
  • Practical experience with Machine Learning on Kubernetes
  • Experience managing clusters with GPU machines
  • Experience building and maintaining machine learning platforms and pipelines
  • Practical programming and scripting skills (Python preferred)
  • Fast learner, analytical thinker, creative, hands-on, strong communication skills
  • Able to work both independently and as part of a team
  • Excellent problem-solving skills and attention to detail
  • Proven experience with modern software development and engineering practices including scrum/agile, Git, and DevOps
  • Ability to obtain a Security+ certification within the first 90 days of employment with Raft
  • Ability to obtain and maintain a Top Secret clearance

Responsibilities

  • Deploy ML infrastructure
  • Build MLOps pipelines
  • Contribute to the development of a full-lifecycle ML platform

Preferred Qualifications

  • Currently Cleared or a Clearance in the past
  • Experience with or managing KubeFlow deployments
  • Knowledge of Istio
  • Comfortable provisioning and debugging complex CI/CD pipelines
  • Prior experience with Terraform
  • Remote (Local to Tampa, FL is highly preferred)

Benefits

  • Highly competitive salary
  • Fully covered healthcare, dental, and vision coverage
  • 401(k) and company match
  • Take as you need PTO + 11 paid holidays
  • Education & training benefits
  • Annual budget for your tech/gadgets needs
  • Monthly box of yummy snacks to eat while doing meaningful work
  • Remote, hybrid, and flexible work options
  • Team off-site in fun places!
  • Generous Referral Bonuses

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Raft know you found this job on JobsCollider. Thanks! πŸ™