Hugging Face is hiring a
Site Reliability Engineer

Logo of Hugging Face

Hugging Face

πŸ’΅ ~$117k-$176k
πŸ“Remote - France

Summary

Hugging Face is seeking a Site Reliability Engineer to maintain and scale their product infrastructure. The role involves designing, developing, deploying, and maintaining reliable and scalable infrastructure, managing large Kubernetes clusters, measuring and optimizing system performance, patching infrastructure to avoid vulnerabilities, keeping important systems up and running, and providing primary operational support for multiple teams.

Requirements

  • 7+ years of experience in a Site Reliability Engineer or Infrastructure Engineer role
  • Strong knowledge of cloud providers such as AWS, GCP, infra-as-code frameworks and observability tools
  • Strong communication, collaboration, and documentation skills
  • Experience with Linux, Git, containers, networking and command line tools
  • Collaborate and communicate asynchronously

Responsibilities

  • Design, develop, deploy, and maintain reliable and scalable infrastructure
  • Manage large Kubernetes clusters
  • Measure and optimize system performance
  • Patch infrastructure to avoid vulnerabilities
  • Keep important, revenue-critical systems up and running despite outages and configuration errors
  • Provide primary operational support and engineering for multiple teams

Benefits

  • Health, dental, and vision benefits for employees and their dependents
  • Parental leave and flexible paid time off
  • Flexible working hours and remote options

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Hugging Face know you found this job on JobsCollider. Thanks! πŸ™