Senior Cloud Data Infrastructure Engineer

ClickHouse
Summary
Join ClickHouse's Cloud AutoScaling team as a software engineer to build a cutting-edge Cloud Native platform. You will improve the metrics pipeline, build auto-scaling algorithms, and work on the Kubernetes operator for seamless vertical and horizontal auto-scaling. Collaborate with the core database team and other data plane teams to support auto-scaling use cases and internal infrastructure improvements. Architect and build a robust, scalable, and highly available distributed infrastructure. This role requires 5+ years of relevant software development experience and expertise in building scalable, fault-tolerant, distributed systems. The position offers competitive salary and benefits, including flexible work environment, healthcare contributions, equity, flexible time off, and a home office setup allowance.
Requirements
- 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
- Experience building operators with Kubernetes, controller runtime
- Production experience with programming languages like Go, C++
- You are not a stranger to PagerDuty On-call, debugging things in production and are a strong problem-solver
- Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2)
- Experience with Data Storage, Ingestion, and Transformation (Spark, Kafka or similar tools)
- You are passionate about solving data problems at Scale
- You have excellent communication skills and the ability to work well within and across engineering teams
Responsibilities
- Build a cutting-edge Cloud Native platform on top of the public cloud
- Improve the metrics pipeline and build algorithms to generate better autoscaling statistics and recommendations
- Work on the autoscale and Kubernetes operator to support seamless Vertical and Horizontal Auto-scaling
- Work closely with our ClickHouse core development team and other data plane teams, partnering with them to support auto-scaling use cases as well as other internal infrastructure improvements
- Architecting and building a robust, scalable, and highly available distributed infrastructure
Preferred Qualifications
Experience with Python (uv, rye, fastAPI) Data Science (Pandas, NumPy etc) is good to have
Benefits
- Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
- Healthcare - Employer contributions towards your healthcare
- Equity in the company - Every new team member who joins our company receives stock options
- Time off - Flexible time off in the US, generous entitlement in other countries
- A $500 Home office setup if youβre a remote employee