Senior Infrastructure Engineer

Docker, Inc
Summary
Join Docker's Infrastructure Engineering team and build, operate, and evolve the cloud-native platform powering Docker products. You will design resilient services, automate processes, and measure key performance indicators to support hundreds of engineers serving millions of users. Responsibilities include designing, developing, and shipping internal platform services using Go or Python, codifying infrastructure with Terraform and Go, and evolving Docker's ingress stack. The ideal candidate possesses strong software development skills, significant experience in cloud application operations, and a solid foundation in Linux, networking, and cloud security. The role offers opportunities for growth and leadership, including leading major infrastructure initiatives and mentoring junior engineers. Docker provides a remote-first work environment with various benefits.
Requirements
- Strong software development skills in Go, Python, or similar (design, testing, and code review)
- Significant experience shipping and operating cloud applications/services in production (typically 5+ years of relevant work)
- Solid foundation in Linux, networking, and cloud security
- Excellent written and verbal communication in a remote environment
Responsibilities
- Build and run internal platform services (provisioning APIs, cost-optimisation tools, observability pipelines) on AWS
- Evolve our multi-tenant Kubernetes environment and networking layer to deliver secure, reliable, and cost-effective compute at global scale
- Drive reliability through code, embracing GitOps, Infrastructure as Code, and SLO-based operations
- Design, develop, and ship internal platform services (e.g. provisioning, cost insights, rate-limiting) in Go or Python
- Partner with product and engineering teams to provide paved-road patterns for deployment, observability, and security
- Codify infrastructure with Terraform and Go; champion GitOps best practices
- Define SLOs, lead on-call rotations, conduct blameless post-mortems, and implement preventive actions
- Evolve Docker’s ingress stack—Envoy Gateway, ALB/NLB, AWS VPC CNI—to deliver secure, reliable, and cost-efficient request routing
- Operate and scale multi-tenant EKS clusters; guide the evaluation and adoption of new infrastructure technologies
Preferred Qualifications
- Kubernetes ecosystem (EKS, ingress, CNI, service mesh)
- Observability tooling (OpenTelemetry, Prometheus, Grafana)
- CI/CD & release automation (GitHub Actions, Argo CD)
- Cost optimisation at scale (FinOps, capacity modelling)
- Distributed systems, containers, and Go-based platform tooling
Benefits
- Freedom & flexibility; fit your work around your life
- Designated quarterly Whaleness Days
- Home office setup; we want you comfortable while you work
- 16 weeks of paid Parental leave
- Technology stipend equivalent to $100 net/month
- PTO plan that encourages you to take time to do the things you enjoy
- Quarterly, company-wide hackathons
- Training stipend for conferences, courses and classes
- Equity; we are a growing start-up and want all employees to have a share in the success of the company
- Docker Swag
- Medical benefits, retirement and holidays vary by country