Senior Cloud Data Infrastructure Engineer

ClickHouse
Summary
Join ClickHouse's Cloud AutoScaling team as a software engineer to build a cutting-edge Cloud Native platform on top of the public cloud. You will improve the metrics pipeline, build algorithms for better autoscaling, and work on the autoscale and Kubernetes operator to support seamless vertical and horizontal auto-scaling. Collaborate with the core database team and other data plane teams to support auto-scaling use cases and internal infrastructure improvements. Architect and build a robust, scalable, and highly available distributed infrastructure. This role requires 5+ years of relevant software development experience and expertise in building and operating scalable, fault-tolerant, distributed systems. ClickHouse offers a flexible work environment, healthcare contributions, company equity, flexible time off, a home office setup allowance, global gatherings, and the opportunity to shape the company culture.
Requirements
- 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
- Experience building operators with Kubernetes, controller runtime
- Production experience with programming languages like Go, C++
- You are not a stranger to PagerDuty On-call, debugging things in production and are a strong problem-solver
- Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2)
- Experience with Data Storage, Ingestion, and Transformation (Spark, Kafka or similar tools)
- You are passionate about solving data problems at Scale
- You have excellent communication skills and the ability to work well within and across engineering teams
Responsibilities
- Build a cutting-edge Cloud Native platform on top of the public cloud
- Improve the metrics pipeline and build algorithms to generate better autoscaling statistics and recommendations
- Work on the autoscale and Kubernetes operator to support seamless Vertical and Horizontal Auto-scaling
- Work closely with our ClickHouse core development team and other data plane teams, partnering with them to support auto-scaling use cases as well as other internal infrastructure improvements
- Architecting and building a robust, scalable, and highly available distributed infrastructure
Preferred Qualifications
Experience with Python (uv, rye, fastAPI) Data Science (Pandas, NumPy etc) is good to have
Benefits
- Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
- Healthcare - Employer contributions towards your healthcare
- Equity in the company - Every new team member who joins our company receives stock options
- Time off - Flexible time off in the US, generous entitlement in other countries
- A $500 Home office setup if youβre a remote employee
- Global Gatherings β We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites