πPortugal
Cloud DevOps/Site Reliability Engineer
closed
Inworld AI
π΅ $124k-$160k
πRemote - Canada
Summary
The job is for a Staff Cloud DevOps/Site Reliability Engineer at Inworld, a well-funded AI and game startup. The role involves managing the infrastructure, DevOps, and Site Reliability of their platform using Terraform, Helm, Kubernetes, AWS, Azure, or GCP, and CI/CD with GitOps.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field
- 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience)
- At least 2 years experience each with: Terraform, Helm, Kubernetes, AWS, Azure, or GCP, and CI/CD using modern tools (GitOps)
Responsibilities
- Infrastructure: Maintain and contribute to Infrastructure-as-Code (Terraform)
- DevOps and CI/CD Pipelines: Orchestrate pipelines using Github Actions, Helm, ArgoCD
- Microservices scalability: Kubernetes Administration
- Cloud Administration
- Site Reliability: Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis
Preferred Qualifications
- MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)
- Prometheus / Grafana
- Multi-cloud deployments (2 or more)
- ArgoCD
- Network management and VPNs
Benefits
- In-office location: Vancouver, Canada
- Remote location: Canada
- The Canada base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits
This job is filled or no longer available