Senior MLOps Engineer

Guild
Summary
Join Guild as a Senior MLOps Engineer and play a key role in designing and implementing the infrastructure and tooling for efficient machine learning model and AI agent development, deployment, and iteration. You will collaborate with data scientists and engineers to streamline ML model production and ensure scalability, security, and reliability of the AI platform. This pivotal role involves providing technical leadership in adopting best practices for model governance and mentoring team members. Guild offers a competitive compensation package, including a base salary of $144,000-$211,000 and stock options. The company is committed to equal pay and embraces a distributed work model, with opportunities across 32 states.
Requirements
- 5β7 years of experience in MLOps, DevOps, software engineering, or related fields
- Strong experience in building and maintaining scalable machine learning infrastructure and pipelines
- Expertise with cloud platforms (AWS, Azure, or GCP), particularly in managed AI/ML services
- Proficiency with containerization (Docker, Kubernetes) and orchestration tools
- Experience in model deployment frameworks and serving infrastructure (TensorFlow Serving, TorchServe, FastAPI, etc.)
- Skilled in infrastructure-as-code tools like Terraform and familiarity with CI/CD automation (GitHub Actions, Jenkins)
- Deep understanding of ML lifecycle management, monitoring, version control, and experiment tracking tools (e.g., MLflow, Kubeflow, Weights & Biases)
- Strong coding skills, especially in Python, and familiarity with software engineering best practices
- Knowledge of monitoring, logging, and alerting systems for ML models in production
Responsibilities
- Design, implement, and maintain platforms for seamless deployment, management, and monitoring of ML models and AI agents
- Develop and optimize CI/CD pipelines tailored specifically for AI and machine learning workflows
- Collaborate closely with data scientists, software engineers, and product teams to streamline ML model productionization
- Ensure infrastructure is scalable, secure, and adheres to best practices in reliability and observability
- Provide technical leadership in adopting best practices for model governance, versioning, testing, and validation
- Continuously improve platform performance, efficiency, and ease-of-use to accelerate development cycles
- Mentor team members on MLOps standards, practices, and emerging technologies
Preferred Qualifications
Experience in MCP (model context protocol); any specific experience with Databricks MCP or AWS MCP is a plus
Benefits
- Access to low-cost, high-quality health care options through Cigna and Kaiser (due to coverage limitations, Kaiser is currently only available in CA & CO)
- Access to a 401k to help save for the future
- Open vacation policy for employees to rest and recharge
- 8 days of fully-paid sick leave, to take the time to heal and or recover
- Family-friendly benefits, including 12 weeks of parental leave for non-birthing parents and 18-20 weeks for birthing parents; 4-week ramp-up period for when employees return from a leave of 6 weeks or more; as well as employer-paid short-term and long-term disability, employer-sponsored life insurance, fertility and caregiving benefits
- Well-rounded wellness benefits including free and low cost mental health resources and financial wellbeing support services
- Education benefits and tuition assistance to help your future development and growth
Share this job:
Similar Remote Jobs


