DevOps Engineer
AllCares
Summary
Join our team as a DevOps Engineer to design, build, and maintain scalable, secure, and production-grade infrastructure for a high-load SaaS platform. You will own cloud infrastructure, deployment automation, system reliability, and performance. Engineer infrastructure using automation, observability, cost-efficiency, and high availability. Design and maintain AWS infrastructure, implement IaC using Terraform or AWS CDK, and create CI/CD pipelines for automatic deployments. Monitor infrastructure health, investigate and resolve incidents, and optimize infrastructure for performance and cost. Collaborate with engineers to align infrastructure with software architecture. This role requires 5+ years of hands-on DevOps experience with a strong focus on AWS.
Requirements
- 5+ years of hands-on DevOps experience, with strong focus on AWS infrastructure
- Proven experience in a SaaS environment is mandatory: you understand the expectations around uptime, multi-tenancy, deployments, and rapid iteration
- Deep knowledge of AWS services: EC2, EKS, RDS, IAM, S3, Lambda, WAF, etc
- Experience setting up and maintaining robust CI/CD pipelines (with fast, safe deployments)
- Proficiency with IaC tools like Terraform or AWS CDK
- Solid skills with Docker, Kubernetes (EKS)
- Understanding of relational databases (especially MySQL/PostgreSQL) and advanced SQL
- Comfort with monitoring, alerting, and incident response in a production environment
- Familiarity with scripting/automation in Python or Bash
- Strong Linux/Unix administration skills
- English proficiency at least Pre-Intermediate
Responsibilities
- Design and maintain a secure, scalable, and highly available AWS infrastructure
- Implement Infrastructure as Code (IaC) using tools like Terraform or AWS CDK
- Create and maintain CI/CD pipelines for automatic, zero-downtime deployments (every 2 weeks)
- Monitor and manage infrastructure health, logs, metrics, and set up effective alerting systems
- Investigate and resolve incidents, perform root cause analysis, and prevent recurrences
- Continuously optimize infrastructure for performance and cost
- Collaborate closely with backend/frontend engineers to align infrastructure with software architecture
Preferred Qualifications
- Experience managing high-load and distributed systems in production
- Experience working with ClickHouse
- Exposure to GitOps practices and tools like ArgoCD or FluxCD
- Enthusiasm for clean code, automation, and operational excellence
Benefits
- Lead infrastructure efforts on a next-gen high-load SaaS platform
- Work in a product-driven environment with bi-weekly releases and full DevOps ownership
- Build with the latest cloud-native stack (AWS, EKS, IaC, GitOps)
- Collaborate with a senior, passionate engineering team
- Enjoy full remote flexibility and a product-first culture