Staff Site Reliability Engineer

closed
MongoDB Logo

MongoDB

πŸ’΅ $147k-$289k
πŸ“Remote - United States

Summary

Join MongoDB's Fabric team as a Site Reliability Engineer (SRE) and contribute to building and maintaining a robust, secure, and globally connected multi-cloud network. This pivotal role requires 10+ years of experience in software and distributed systems, with deep networking expertise. You will collaborate with service-owning teams, participate in on-call rotations, and leverage your skills in automation to ensure system resilience and scalability. The position offers hybrid work arrangements or fully remote options within North America. MongoDB provides a supportive and enriching culture with various benefits, including generous parental leave and a comprehensive compensation package.

Requirements

  • Have 10+ years of experience working on software and operating distributed systems, with deep expertise in networking fundamentals and a good understanding of how the internet works, e.g. TCP/IP (including IPv6), DNS, TLS/mTLS, BGP, tunnels, overlays, and SDN principles
  • Possess a customer-focused mindset, driving improvements that benefit end-users
  • Value efficiency in processes and operations, and display a strong preference for automation over manual processes (β€œallergic to ops work”)
  • Be intimately familiar with modern cloud-based infrastructure and the network design primitives of at least one of AWS, Azure, or GCP, e.g. VPCs, subnetting, routing, VPNs, peering, private link / private service connect, and CDNs
  • Have a strong knowledge of service mesh and load-balancing concepts, and be eager to implement these in a multi-cloud environment

Responsibilities

  • Participate in the development of a reliable and resilient multi-cloud globally-connected network that is crucial for MongoDB’s services
  • Collaborate with service-owning teams to provide internal support, addressing technical issues and offering guidance on best practices for service-to-service connectivity
  • Participate in a 24/7 on-call rotation to swiftly resolve issues related to network architecture and service-to-service connectivity, ensuring minimal disruption and high availability

Benefits

  • Flexible paid time off
  • 20 weeks fully-paid gender-neutral parental leave
  • Fertility and adoption assistance
  • 401(k) plan
  • Mental health counseling
  • Access to transgender-inclusive health insurance coverage
  • Health benefits offerings
This job is filled or no longer available