Manager, Bare Metal Infrastructure Support & Systems Operations

CoreWeave Logo

CoreWeave

πŸ’΅ $120k-$160k
πŸ“Remote - United States

Summary

Join CoreWeave, a leading AI hyperscaler, as a versatile manager overseeing Infrastructure Support and Systems Operations teams. You will lead teams ensuring smooth hardware operations, build a new support group for dedicated infrastructure solutions, and manage incident resolution. This role requires 3+ years of experience leading physical infrastructure support teams, knowledge of Linux and server hardware, and excellent communication skills. Preferred qualifications include experience in cloud/data center environments and familiarity with NVIDIA GPU technologies. CoreWeave offers competitive salaries ($120,000-$160,000), comprehensive benefits (medical, dental, vision, life insurance, disability insurance, 401k, PTO), and a hybrid work environment with flexibility for remote work.

Requirements

  • 3+ years experience leading teams focused on physical infrastructure support and incident resolution
  • Knowledge of Linux environments and basic networking
  • Strong understanding of server hardware, configuration, and troubleshooting
  • Excellent communication skills for collaboration across technical and non-technical teams
  • Strong organizational and project management skills

Responsibilities

  • Manage Systems Operations Team: Lead a skilled team responsible for maintaining and optimizing physical infrastructure across multiple client environments
  • Build and Lead a Dedicated Infrastructure Support Team: Develop and manage a team focused on supporting key infrastructure, handling escalations, and ensuring smooth hardware operations
  • Incident and Escalation Management: Oversee the resolution of infrastructure-related incidents, collaborating with internal teams to deliver effective solutions
  • Operational Efficiency: Improve support processes to enhance efficiency and reduce downtime, ensuring the infrastructure meets client expectations
  • Cross-functional Collaboration: Work closely with product, infrastructure, and other teams to ensure seamless delivery of infrastructure resources
  • Client Communication: Manage client communication during escalations and issue resolution, ensuring transparency and client satisfaction
  • Team Leadership and Development: Mentor team members, developing their skills to manage and maintain critical infrastructure effectively

Preferred Qualifications

  • Experience managing technical operations or infrastructure support teams in cloud or data center environments
  • Familiarity with distributed computing environments, networking, and storage infrastructure
  • Experience with NVIDIA GPU technologies
  • Knowledge of Kubernetes, Slurm, and Bright Cluster Manager technologies

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption
  • At CoreWeave, we are committed to operating as a hybrid workplace, offering employees flexibility in how they structure their time between in-office and remote work

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs