DevOps/QA Engineer

Sleeper
Summary
Join Sleeper's Risk & Trading team as a DevOps/QA Engineer and contribute to the development and maintenance of critical infrastructure and testing pipelines for our real-money gaming platform. You will design, build, and maintain CI/CD pipelines, develop infrastructure as code, build automated QA pipelines, collaborate with engineers and data scientists, monitor system performance, create fault-injection tests, and participate in post-mortem reviews. This role requires strong Python skills, experience with CI/CD and infrastructure automation tools, familiarity with cloud environments, and strong analytical and troubleshooting skills. The ideal candidate will have 3+ years of experience in DevOps, SRE, or QA roles, preferably in high-availability or finance/gaming environments. Sleeper offers a competitive salary and stock options, comprehensive health insurance, 401k, flexible working hours, and remote-first culture.
Requirements
- 3+ years of experience in DevOps, SRE, or QA roles—preferably in high-availability or finance/gaming-related environments
- Strong Python skills with experience in test frameworks (e.g., Pytest, Hypothesis)
- Experience with CI/CD pipelines (GitHub Actions, CircleCI, or similar) and infrastructure automation tools (Terraform, Ansible)
- Familiarity with cloud environments (GCP preferred) and container orchestration (Docker, Kubernetes)
- Strong analytical and troubleshooting skills; a passion for finding edge cases and preventing failure modes
Responsibilities
- Design, build, and maintain CI/CD pipelines for trading and pricing services (Python & Elixir)
- Develop infrastructure as code (Terraform, GCP-native tools) for deploying and scaling trading infrastructure
- Build automated QA pipelines to test settlement, pricing accuracy, and edge-case simulations across sporting events
- Work closely with engineers and data scientists to validate data integrity and model behavior in production
- Monitor system performance and implement observability tools (Prometheus, Grafana, etc.) to identify and mitigate production risks
- Create fault-injection tests and run load-testing simulations that replicate real-world market stress
- Participate in post-mortem reviews and help create actionable feedback loops for continuous improvement
Preferred Qualifications
- Understanding of real-time systems, pricing engines, or risk models is a big plus
- Bonus: Experience with Elixir or functional programming environments
Benefits
- Competitive salary and stock options
- Comprehensive health, dental, and vision insurance
- 401(k)
- Flexible working hours and remote-first culture
- Clear paths for career growth and leadership