DevOps Engineer

Loop Returns Logo

Loop Returns

πŸ’΅ $123k-$184k
πŸ“Remote - United States

Summary

Join Loop as a DevOps (MLOps) Engineer and pioneer our machine learning operations capabilities. You will build robust infrastructure and deployment pipelines in AWS, owning the infrastructure for all productionalized ML models. Collaborate with ML engineers to ensure models are up-to-date, scalable, and observable. Design and implement CI/CD pipelines, establish ML operational best practices, and implement comprehensive monitoring solutions. Maintain model repositories and drive the adoption of Infrastructure as Code (IaC) principles. This role requires strong AWS expertise, proficiency in Python and key ML libraries, and experience with containerization and ML lifecycle management platforms. The position offers a blended work environment with options for in-office or remote work.

Requirements

  • 5+ years of experience in DevOps or MLOps Engineering roles, with at least 2+ years dedicated to hands-on experience specifically with machine learning operations, including enterprise-grade deep learning architectures (e.g., transformers, graph NNs, VAEs)
  • Bachelor’s degree or higher in Computer Science, Mathematics, Statistics, or a related quantitative discipline, or equivalent practical experience, is highly preferred
  • Deep expertise in AWS infrastructure and services, with a proven track record of deploying and managing scalable ML workloads in the cloud
  • Strong proficiency in Python and extensive experience with key machine learning libraries such as PyTorch, Pandas, and scikit-learn
  • Extensive experience with containerization technologies (ex: Docker) for packaging and deploying ML models
  • Demonstrated experience with ML lifecycle management platforms such as MLflow
  • Proven ability to thrive as a self-starter in ambiguous, greenfield environments, taking initiative and delivering solutions with minimal oversight
  • Excellent collaboration and communication skills, with a demonstrated ability to act as a critical bridge between machine learning, data science, and core engineering teams

Responsibilities

  • Design and implement scalable CI/CD pipelines for machine learning models within our AWS infrastructure, driving automated builds, deployments, and engineering excellence
  • Establish and evolve ML operational best practices in a greenfield environment, defining the standards for model versioning, reproducibility, and MLOps maturity
  • Collaborate closely with Machine Learning Engineers to understand model requirements and provide expert guidance on infrastructure, deployment strategies, and operationalizing their models
  • Implement and manage comprehensive monitoring and observability solutions for deployed ML models using tools like Datadog, ensuring high performance, accuracy, and quick issue resolution
  • Maintain and optimize ML model repositories to ensure efficient versioning and management of all model artifacts throughout their lifecycle
  • Drive the adoption of Infrastructure as Code (IaC) principles for ML infrastructure, ensuring reusability, consistency, and reliability across environments
  • Participate in the broader DevOps team ceremonies and planning, integrating ML Ops initiatives seamlessly into the overall engineering roadmap

Benefits

  • Medical, dental, and vision insurance
  • Flexible PTO
  • Company holidays
  • Sick & safe leave
  • Parental leave
  • 401k
  • Monthly wellness benefit
  • Home workstation benefit
  • Phone/internet benefit
  • Equity

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.