Guardant Health is hiring a
Senior HPC Infrastructure Engineer

Logo of Guardant Health

Guardant Health

πŸ’΅ $138k-$187k
πŸ“Remote - United States

Summary

Join Guardant Health's HPC team as a strong technical engineer to maintain and grow the HPC infrastructure during its aggressive expansion, working with corporate IT, SQA, and DevOps/SRE teams.

Requirements

  • 2+ years of Linux/Unix administration, knowledge of Unix network protocols, TCP/IP network fundamentals, core infrastructure technologies and virtualization
  • 2+ years of large-scale data storage and compute clusters (HPC) infrastructure
  • 2+ years working in and with on-premise and cloud-based (AWS, Google, IBM and Azure) data-centers
  • 2+ years of building software release and ops processes and automation toolset
  • 2+ years providing documentation of system administration

Responsibilities

  • Assist in managing the HPC interconnect
  • Assist in integrating the HPC systems with the bandwidth on-demand system
  • Work with the networking infrastructure team to manage and optimize the connectivity to and from the HPC systems and locales
  • Help manage multiple HPC clusters and cluster file systems
  • Help research, develop and implement the next generation HPC solution
  • Troubleshoot the production system stack down to source code level e.g. shell scripts, python and others
  • Maintain, monitor, and support the infrastructure environment and/or facilities
  • Use and maintain enhanced production monitoring and additional capability
  • Support improvements for increased system reliability and performance
  • Support multiple systems or applications of medium to high complex (complexity defined by size, technology used, and system feeds and interfaces) with multiple concurrent users, ensuring control, integrity, and accessibility
  • Support systems at remote locations, including internationally
  • Work with offsite consultants to maintain the infrastructure
  • Work with vendors to troubleshoot, upgrade and repair systems as needed
  • Participate in a 24/7 on-call rotation

Preferred Qualifications

  • Experience administering IBM’s General Parallel File System
  • Experience administering Grid Engine scheduler
  • Experience administering SLURM scheduler
  • Experience with using Bright Cluster Manager
  • Experience with cloud bursting technologies
  • Experience with wide area file systems
  • Experience with docker and container technologies
  • Experience with Kubernetes, preferably with Certified Kubernetes Administrator (CKA)

Benefits

  • Hybrid Work Model: At Guardant Health, we have defined days for in-person/onsite collaboration and work-from-home days for individual-focused time
  • US base salary range for this full-time position is $138,700 to $187,300

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Guardant Health know you found this job on JobsCollider. Thanks! πŸ™