Lead Infrastructure Engineer
closed
Pallon
Summary
Join Pallon, a spin-off from ETH Zurich, and become a seasoned infrastructure engineer responsible for our infrastructure, from high-performance GPU clusters to cloud systems. You will lead critical decisions around architecture, performance, and scale, collaborate with platform and computer vision teams, and solve real-world issues. This hands-on role requires 5+ years of experience owning infrastructure end-to-end, ideally in startups, and strong Linux fundamentals. You should be comfortable with all layers, from bare-metal servers to cloud-native tools, and possess strong coding skills for automation and debugging. The ideal candidate thrives with autonomy, is a fast learner, and holds a university degree in Computer Science or a related field. Pallon offers a chance to contribute to a positive societal impact, work on a novel product, and be part of a supportive team.
Requirements
- Youβve spent 5+ years owning infrastructure end-to-end, ideally in startup environments
- Youβre comfortable at every layer β from bare-metal servers and NVMe drives to container orchestration and cloud-native tools
- You have strong Linux fundamentals, and you know your way around networking, storage, and distributed systems
- You can code well enough to automate, debug, and build tooling across a variety of languages
- You communicate clearly and collaborate well β especially with engineers who arenβt infra specialists
- You thrive with autonomy and can manage your own priorities effectively
- Youβre curious and fast-learning, especially when tackling new tools or challenges
- You have a university degree in Computer Science or a related field
Responsibilities
- Design and build a custom GPU cluster for deep learning workloads
- Decide how we manage and scale our infrastructure β both on-prem and in the cloud
- Keep systems running smoothly and securely β from data pipelines to distributed training jobs
- Troubleshoot weird kernel errors, configure systemd units, or debug Kubernetes evictions
- Make calls on when to script, when to automate, and when to just fix the thing
Preferred Qualifications
- Experience with machine learning infrastructure or HPC clusters
- Familiarity with data engineering workflows and ETL pipelines
Benefits
- Contribute to a positive impact on society and the environment
- Develop a novel product that changes a whole industry
- Be part of a motivated, smart, fun, and supportive team of software engineers and AI researchers
- Own a part of Pallon and have a part in our success with our Employee Stock Option Plan (ESOP)
- Work from home or enjoy access to our beautiful office space located in ZΓΌrich
Similar Remote Jobs









