LaunchDarkly is hiring a
Lead Site Reliability Engineer

closed
Logo of LaunchDarkly

LaunchDarkly

πŸ’΅ ~$170k-$200k
πŸ“Remote - United States

Summary

LaunchDarkly is a company that empowers teams to deliver software faster and more reliably by providing feature flags. The Software Reliability Engineer will lead the development of SRE tools and processes, improve service health and reliability metrics, help incident management, mentor team members, and drive technology adoption. The role requires experience with large-scale distributed systems, server-side web development, cloud providers, observability tooling, RDBMS technologies, and security practices.

Requirements

  • Demonstrable experience building and operating large-scale, highly available distributed systems
  • Comfort with server-side web development (e.g., in Java / Scala, Ruby, Python, Golang, Node.js)
  • Experience guiding the architectural direction and scalability considerations for new projects
  • Strong understanding and proactive management of security practices related to SRE, coordinating with our Security team to fortify infrastructure
  • Extensive experience working with major cloud providers, observability tooling, and RDBMS technologies is crucial for this role
  • Experience leading team ceremonies: project ideation, planning, grooming, and project retrospectives

Responsibilities

  • Lead the development and continuous refinement of SRE tools and processes to improve software delivery, observability, reliability and operational efficiency
  • Define and standardize service health and reliability metrics that align with business goals
  • Help improve the effectiveness of our incident management lifecycle and drive initiatives to train key roles involved in incident response and post-incident review process
  • Partner with various team members to define and mature our SRE culture through principles, technical frameworks, tooling, and processes
  • Drive the adoption of new technologies, system designs and best practices in code health, testing, observability, and service maintainability across teams
  • Proactively identify and resolve potential performance and scalability bottlenecks in our front-end and back-end systems and underlying infrastructure
  • Analyze the performance of SQL queries, suggest improvements and build guardrails for teams

Benefits

  • Target pay ranges based on Geographic Zones for Levels P4-P5: Zone 1: $183,600 - $235,000, Zone 2: $165,600 - $212,000, Zone 3: $156,510 - $200,000
  • Restricted Stock Units (RSUs), health, vision, and dental insurance, and mental health benefits in addition to salary
This job is filled or no longer available

Similar Jobs