Senior Python Engineer - ML and Data Science

ClickHouse
Summary
Join ClickHouse, a leading provider of open-source column-oriented database systems, as a Senior Python Software Engineer. You will play a key role in enhancing ClickHouse's Python ecosystem, ensuring seamless integration for data scientists and machine learning teams. Your responsibilities will include improving the data science Python experience, contributing to Python integrations, working with the open-source ecosystem, optimizing performance, collaborating with internal and external teams, and advocating for best practices. This position requires strong proficiency in Python, experience with databases and data engineering workflows, and a deep understanding of distributed systems and OLAP databases. The ideal candidate will also have experience with data science libraries, open-source contributions, and strong communication skills. ClickHouse offers a flexible work environment, healthcare contributions, equity in the company, flexible time off, a home office setup allowance, global gatherings, and a chance to shape the company culture.
Requirements
- Strong proficiency in Python and familiarity with data science libraries and data processing frameworks such as Pandas, Polars, Scikit-learn, PyTorch
- Experience working with databases, query engines, or analytical tools (Snowflake, BigQuery, ClickHouse, DuckDB, etc.)
- Understanding of distributed systems and OLAP databases
- Hands-on experience with data engineering workflows and integrating databases with machine learning or analytics pipelines
- Contributions to open-source projects or experience working in an open-source development environment
- Strong problem-solving skills and ability to work in a cross-functional team
- Strong communication skills
Responsibilities
- Improve ClickHouse's Data Science Python experience: Design and implement features that simplify data ingestion, transformation, and analysis for data scientists using Python
- Contribute to ClickHouse's Python Integrations: Enhance existing ClickHouse’s Python integrations to provide a seamless data science experience
- Work with Open-Source Ecosystem: Contribute to ClickHouse’s open-source repositories, ensuring compatibility with popular data science toolkits
- Performance Optimization: Ensure efficient query execution and data handling when interfacing with Python
- Collaborate with Internal & External Teams: Work closely with product managers, engineers, and the data science community to gather feedback and refine features
- Advocate for Best Practices: Educate users on best practices for leveraging ClickHouse in data science applications via examples and reference architectures
Preferred Qualifications
- Prior experience with ClickHouse or cloud-based OLAP analytics systems
- Understanding of ML workloads and integration with analytical databases
- Prior experience working in a developer-first or database technology company
Benefits
- Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
- Healthcare - Employer contributions towards your healthcare
- Equity in the company - Every new team member who joins our company receives stock options
- Time off - Flexible time off in the US, generous entitlement in other countries
- A $500 Home office setup if you’re a remote employee
- Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites