Senior Engineer, Compute Services

CoreWeave Logo

CoreWeave

πŸ’΅ $185k-$210k
πŸ“Remote - United States

Summary

Join CoreWeave, a leading AI hyperscaler, as a Senior Engineer in Compute Services. You will design, develop, and maintain automated tooling for Kubernetes control planes, utilizing Python, Golang, and Bash. Responsibilities include performing day-2 lifecycle tasks, identifying and implementing fault-tolerant architectures, optimizing reliability, and designing automated testing. The ideal candidate possesses proven Kubernetes provisioning experience, advanced Linux skills, and extensive DevOps experience. CoreWeave offers a competitive salary, comprehensive benefits including medical, dental, vision, life insurance, and paid parental leave, and a flexible hybrid work environment.

Requirements

  • Proven experience provisioning Kubernetes using tools such as kubeadm, Cluster API, Kubeception, Kubespray, or similar
  • Demonstrated ability debugging complex kubernetes cluster issues and carrying out upgrades
  • Proficiency in Golang, Bash, and Python
  • Advanced Linux OS troubleshooting skills
  • Extensive experience with Ansible
  • Advanced DevOps experience (e.g., GitLab CI, GitHub Actions)
  • Demonstrated ability to collaborate effectively on shared codebases
  • Excellent documentation skills and high attention to detail
  • Strong analytical and problem-solving abilities
  • Experience participating in an on-call rotation to support production services

Responsibilities

  • Design, develop, and maintain automated tooling to provision Kubernetes control planes on bare-metal
  • Use Python, Golang, and Bash to create tooling and go operators
  • Perform day 2 lifecycle tasks and maintenance on running clusters
  • Identify gaps and implement fault-tolerant architectures
  • Optimize reliability using the Grafana ecosystem
  • Design automated testing to validate build quality and stability
  • Participate in an on-call rotation every two months serving as point of contact

Preferred Qualifications

  • Bare-metal OS provisioning experience
  • Kubernetes operator coding experience
  • Advanced Linux networking expertise
  • AWX/Ansible tower knowledge

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.