Remote Staff HPC Infrastructure Engineer

closed
Logo of Guardant Health

Guardant Health

πŸ’΅ $148k-$199k
πŸ“Remote - United States

Job highlights

Summary

Join Guardant Health's HPC team as a strong technical engineer to maintain and grow the company's computational technology backbone, including scalable data storage, high-performance compute clusters, and software infrastructure.

Requirements

  • 4+ years of Linux/Unix administration, knowledge of Unix network protocols, TCP/IP network fundamentals, core infrastructure technologies and virtualization
  • 4+ years of large-scale data storage and compute clusters (HPC) infrastructure
  • 2+ years working in and with on-premise and cloud-based (AWS, Google, IBM and Azure) data-centers
  • 3+ years of building software release and ops processes and automation toolset
  • 4+ years providing documentation of system administration

Responsibilities

  • Help manage multiple HPC clusters and cluster file systems
  • Help research, develop and implement the next generation HPC solution
  • Troubleshoot the production system stack down to source code level e.g. shell scripts, python and others
  • Maintains, monitors, and supports the infrastructure environment and/or facilities
  • Used and maintained enhanced production monitoring and additional capability
  • Support improvements for increased system reliability and performance
  • Supports in a senior role multiple systems or applications of medium to high complex (complexity defined by size, technology used, and system feeds and interfaces) with multiple concurrent users, ensuring control, integrity, and accessibility
  • Work with offsite consultants to maintain the infrastructure
  • Work with vendors to troubleshoot, upgrade and repair systems as needed
  • Participate in a 24/7 on-call rotation

Preferred Qualifications

  • Experience administering IBM’s General Parallel File System
  • Experience administering Grid Engine scheduler
  • Experience administering SLURM scheduler
  • Experience with using Bright Cluster Manager
  • Experience with cloud bursting technologies
  • Experience with wide area file systems
  • Experience with docker and container technologies
  • Experience with Kubernetes, preferably with Certified Kubernetes Administrator (CKA)

Benefits

Hybrid Work Model: At Guardant Health, we have defined days for in-person/onsite collaboration and work-from-home days for individual-focused time

This job is filled or no longer available

Similar Remote Jobs