Summary
Join Coalition's Platform SRE team as a Senior Site Reliability Engineer and build and operate the infrastructure, tools, and paved roads that empower developers to deliver scalable, secure, and reliable software. You will work across the stack, collaborating with software engineers and security teams to improve platform resilience, security, and operability. Your contributions will enhance infrastructure as code, CI/CD pipelines, and scale the internal developer platform. This high-impact role allows you to influence infrastructure development and operations across the company. Coalition values pragmatism and engineering excellence, with systems primarily written in Python and Go and operating entirely in AWS.
Requirements
- 6+ years of experience in SRE, DevOps, Cloud Engineering, or Software Development roles
- Hands-on experience operating production environments in AWS
- Proficiency in Go or Python, with experience building production-grade automation, tooling or libraries
- Strong experience with Terraform or similar infrastructure as code tools
- Working in CI/CD tools such as GitHub Actions
- Experience with container orchestration platforms like ECS or Kubernetes
- Experience designing and implementing re-usable platform components based on team requirements
- Implementing observability practices including system metrics, distributed tracing, and SLOs
- Exposure to failure-based testing approaches and automated recovery strategies
- Strong communication skills, both written and verbal
- Experience mentoring engineers on reliability best practices
Responsibilities
- Build and operate the infrastructure, tools, and paved roads that empower developers to deliver scalable, secure, and reliable software with speed and confidence
- Work across the stackβfrom infrastructure automation and observability to developer enablement and system reliability
- Collaborate closely with software engineers and security teams to improve the resilience, security, and operability of our platform
- Help evolve our infrastructure as code, enhance CI/CD pipelines, and scale our internal developer platform to support a growing engineering organization
- Help shape our approach to decoupling systems, building self-service capabilities, and reducing toil through automation
Preferred Qualifications
- Experience with microservices architectures
- Exposure to Kafka or other event streaming systems
- Experience building internal developer platforms or self-service infrastructure
- Familiarity with systems security, compliance requirements, or hardening practices
Benefits
- 100% medical, dental, and vision coverage
- Flexible PTO
- Annual home office stipend and WeWork access
- Mental & physical health wellness programs like Headspace, Lumino, and more!
- Competitive compensation and opportunity for advancement
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.