Lead Infrastructure Engineer

closed
Pallon Logo

Pallon

πŸ“Remote - Switzerland

Summary

Join Pallon, a spin-off from ETH Zurich, and become a seasoned infrastructure engineer responsible for our infrastructure, from high-performance GPU clusters to cloud systems. You will lead critical decisions around architecture, performance, and scale, collaborate with platform and computer vision teams, and solve real-world issues. This hands-on role requires 5+ years of experience owning infrastructure end-to-end, ideally in startups, and strong Linux fundamentals. You should be comfortable with all layers, from bare-metal servers to cloud-native tools, and possess strong coding skills for automation and debugging. The ideal candidate thrives with autonomy, is a fast learner, and holds a university degree in Computer Science or a related field. Pallon offers a chance to contribute to a positive societal impact, work on a novel product, and be part of a supportive team.

Requirements

  • You’ve spent 5+ years owning infrastructure end-to-end, ideally in startup environments
  • You’re comfortable at every layer β€” from bare-metal servers and NVMe drives to container orchestration and cloud-native tools
  • You have strong Linux fundamentals, and you know your way around networking, storage, and distributed systems
  • You can code well enough to automate, debug, and build tooling across a variety of languages
  • You communicate clearly and collaborate well β€” especially with engineers who aren’t infra specialists
  • You thrive with autonomy and can manage your own priorities effectively
  • You’re curious and fast-learning, especially when tackling new tools or challenges
  • You have a university degree in Computer Science or a related field

Responsibilities

  • Design and build a custom GPU cluster for deep learning workloads
  • Decide how we manage and scale our infrastructure β€” both on-prem and in the cloud
  • Keep systems running smoothly and securely β€” from data pipelines to distributed training jobs
  • Troubleshoot weird kernel errors, configure systemd units, or debug Kubernetes evictions
  • Make calls on when to script, when to automate, and when to just fix the thing

Preferred Qualifications

  • Experience with machine learning infrastructure or HPC clusters
  • Familiarity with data engineering workflows and ETL pipelines

Benefits

  • Contribute to a positive impact on society and the environment
  • Develop a novel product that changes a whole industry
  • Be part of a motivated, smart, fun, and supportive team of software engineers and AI researchers
  • Own a part of Pallon and have a part in our success with our Employee Stock Option Plan (ESOP)
  • Work from home or enjoy access to our beautiful office space located in ZΓΌrich
This job is filled or no longer available

Similar Remote Jobs