Summary
Join Raft, a customer-obsessed company specializing in distributed data systems, as a Principal MLOps Engineer. You will contribute to a real-time data platform for the Department of Defense, handling over a billion events daily with millisecond latency. Key responsibilities include deploying ML infrastructure, building MLOps pipelines, and developing a full-lifecycle ML platform. This role requires extensive experience with Kubernetes, Docker, cloud applications, and machine learning. The position is remote, with a preference for candidates local to Tampa, FL, and may involve up to 30% travel. Raft offers a highly competitive salary and benefits package.
Requirements
- 7+ years of relevant hands-on experience
- 5+ years experience with Docker and Kubernetes, provisioning production clusters and maintaining their compliance
- 5+ years experience supporting enterprise Cloud applications or infrastructure (AWS, Azure, etc.)
- Solid understanding of Helm Charts
- Practical experience with Machine Learning on Kubernetes
- Experience managing clusters with GPU machines
- Experience building and maintaining machine learning platforms and pipelines
- Practical programming and scripting skills (Python preferred)
- Fast learner, analytical thinker, creative, hands-on, strong communication skills
- Able to work both independently and as part of a team
- Excellent problem-solving skills and attention to detail
- Proven experience with modern software development and engineering practices including scrum/agile, Git, and DevOps
- Ability to obtain a Security+ certification within the first 90 days of employment with Raft
- Ability to obtain and maintain a Top Secret clearance
Responsibilities
- Deploy ML infrastructure
- Build MLOps pipelines
- Contribute to the development of a full-lifecycle ML platform
Preferred Qualifications
- Currently Cleared or a Clearance in the past
- Experience with or managing KubeFlow deployments
- Knowledge of Istio
- Comfortable provisioning and debugging complex CI/CD pipelines
- Prior experience with Terraform
- Remote (Local to Tampa, FL is highly preferred)
Benefits
- Highly competitive salary
- Fully covered healthcare, dental, and vision coverage
- 401(k) and company match
- Take as you need PTO + 11 paid holidays
- Education & training benefits
- Annual budget for your tech/gadgets needs
- Monthly box of yummy snacks to eat while doing meaningful work
- Remote, hybrid, and flexible work options
- Team off-site in fun places!
- Generous Referral Bonuses