Cloud DevOps/Site Reliability Engineer

closed
Inworld AI Logo

Inworld AI

πŸ’΅ $124k-$160k
πŸ“Remote - Canada

Summary

The job is for a Staff Cloud DevOps/Site Reliability Engineer at Inworld, a well-funded AI and game startup. The role involves managing the infrastructure, DevOps, and Site Reliability of their platform using Terraform, Helm, Kubernetes, AWS, Azure, or GCP, and CI/CD with GitOps.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience)
  • At least 2 years experience each with: Terraform, Helm, Kubernetes, AWS, Azure, or GCP, and CI/CD using modern tools (GitOps)

Responsibilities

  • Infrastructure: Maintain and contribute to Infrastructure-as-Code (Terraform)
  • DevOps and CI/CD Pipelines: Orchestrate pipelines using Github Actions, Helm, ArgoCD
  • Microservices scalability: Kubernetes Administration
  • Cloud Administration
  • Site Reliability: Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis

Preferred Qualifications

  • MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)
  • Prometheus / Grafana
  • Multi-cloud deployments (2 or more)
  • ArgoCD
  • Network management and VPNs

Benefits

  • In-office location: Vancouver, Canada
  • Remote location: Canada
  • The Canada base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits
This job is filled or no longer available