Senior MLOps Engineer

Guild Logo

Guild

πŸ’΅ $144k-$211k
πŸ“Remote - United States

Summary

Join Guild as a Senior MLOps Engineer and play a key role in designing and implementing the infrastructure and tooling for efficient machine learning model and AI agent development, deployment, and iteration. You will collaborate with data scientists and engineers to streamline ML model production and ensure scalability, security, and reliability of the AI platform. This pivotal role involves providing technical leadership in adopting best practices for model governance and mentoring team members. Guild offers a competitive compensation package, including a base salary of $144,000-$211,000 and stock options. The company is committed to equal pay and embraces a distributed work model, with opportunities across 32 states.

Requirements

  • 5–7 years of experience in MLOps, DevOps, software engineering, or related fields
  • Strong experience in building and maintaining scalable machine learning infrastructure and pipelines
  • Expertise with cloud platforms (AWS, Azure, or GCP), particularly in managed AI/ML services
  • Proficiency with containerization (Docker, Kubernetes) and orchestration tools
  • Experience in model deployment frameworks and serving infrastructure (TensorFlow Serving, TorchServe, FastAPI, etc.)
  • Skilled in infrastructure-as-code tools like Terraform and familiarity with CI/CD automation (GitHub Actions, Jenkins)
  • Deep understanding of ML lifecycle management, monitoring, version control, and experiment tracking tools (e.g., MLflow, Kubeflow, Weights & Biases)
  • Strong coding skills, especially in Python, and familiarity with software engineering best practices
  • Knowledge of monitoring, logging, and alerting systems for ML models in production

Responsibilities

  • Design, implement, and maintain platforms for seamless deployment, management, and monitoring of ML models and AI agents
  • Develop and optimize CI/CD pipelines tailored specifically for AI and machine learning workflows
  • Collaborate closely with data scientists, software engineers, and product teams to streamline ML model productionization
  • Ensure infrastructure is scalable, secure, and adheres to best practices in reliability and observability
  • Provide technical leadership in adopting best practices for model governance, versioning, testing, and validation
  • Continuously improve platform performance, efficiency, and ease-of-use to accelerate development cycles
  • Mentor team members on MLOps standards, practices, and emerging technologies

Preferred Qualifications

Experience in MCP (model context protocol); any specific experience with Databricks MCP or AWS MCP is a plus

Benefits

  • Access to low-cost, high-quality health care options through Cigna and Kaiser (due to coverage limitations, Kaiser is currently only available in CA & CO)
  • Access to a 401k to help save for the future
  • Open vacation policy for employees to rest and recharge
  • 8 days of fully-paid sick leave, to take the time to heal and or recover
  • Family-friendly benefits, including 12 weeks of parental leave for non-birthing parents and 18-20 weeks for birthing parents; 4-week ramp-up period for when employees return from a leave of 6 weeks or more; as well as employer-paid short-term and long-term disability, employer-sponsored life insurance, fertility and caregiving benefits
  • Well-rounded wellness benefits including free and low cost mental health resources and financial wellbeing support services
  • Education benefits and tuition assistance to help your future development and growth

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs