Staff Software Engineer

CoreWeave Logo

CoreWeave

πŸ’΅ $230k-$275k
πŸ“Remote - United States

Summary

Join CoreWeave, a leading AI hyperscaler, as a Staff Software Engineer to lead efforts in building, maintaining, and optimizing highly scalable, reliable, and secure systems. You will be responsible for deploying and maintaining critical infrastructure, including internal production infrastructure, CoreWeave Kubernetes Service (CKS), and Bare Metal Node Management. This role involves working with some of the largest Kubernetes clusters and data centers, ensuring their uptime, reliability, and performance. You will lead and mentor engineers, design and implement highly available systems, and develop monitoring and alerting solutions. The position requires 7+ years of experience in software engineering or a related field and strong expertise in Kubernetes and containerization. CoreWeave offers a competitive salary, comprehensive benefits, and a hybrid work environment.

Requirements

  • 7+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field
  • Strong expertise in Kubernetes, containerization, and microservices architectures
  • Expertise in monitoring and observability tools such as Prometheus, Grafana, Datadog, or Splunk
  • Strong scripting and automation skills using Python, Go, Bash, or similar languages
  • Strong Understanding of Linux fundamentals and principals
  • Deep understanding of networking, security best practices, and compliance frameworks (SOC 2, ISO 27001, etc.)
  • Proven track record of leading incident management and post-mortem analysis
  • Excellent problem-solving, analytical, and communication skills

Responsibilities

  • Lead and mentor engineers, fostering a culture of collaboration and continuous improvement
  • Design, implement, and maintain highly available, scalable, and secure computing environments in Kubernetes
  • Develop and refine monitoring, alerting, and observability solutions to enhance system reliability and performance
  • Manage Production Clusters and ensure development teams follow best practices for deployments and lifecycle of applications
  • Develop Applications and Kubernetes Operators in Go
  • Implement and Promote proper GitOps management for applications
  • Support the deployment and operations of CoreWeave’s Compute Infrastructure layer
  • Develop tooling and systems which bridge the gap between Linux, Networking, and Kubernetes
  • Develop software applications in GoLang

Preferred Qualifications

  • Knowledge of distributed systems, databases, and caching strategies
  • Experience working with large scale computing clusters

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.