Summary
The job description is for a Database Reliability Engineer specializing in CockroachDB at LaunchDarkly. The role involves managing and maintaining CockroachDB clusters, monitoring performance, collaborating with development teams, addressing database issues, participating in on-call rotations, and more.
Requirements
- Extensive experience with CockroachDB administration, operations, and performance tuning
- Proficiency in SQL and database schema design, with a strong understanding of distributed database systems
- Familiarity with cloud services (AWS, GCP, Azure), containerization (Docker, Kubernetes),Β and infrastructure as code (Terraform)
- Extensive observability expertise with a major platform, ideally Datadog
- Demonstrated ability to work in a fast-paced, collaborative environment, with excellent problem-solving and communication skills
Responsibilities
- Manage and maintain CockroachDB clusters across multiple environments with high availability and cost efficiency
- Implement and oversee backup, restore, and DR procedures
- Monitor database performance, adjusting configuration and resources to meet evolving needs
- Conduct regular performance analysis and tuning of CockroachDB instances to ensure health and efficiency
- Collaborate with development teams to implement indexing strategies, improve schema design, and optimize application queries
- Engineer database solutions for scalability, employing best practices in clustering, partitioning, and replication
- Address and resolve database-related issues, ensuring minimal downtime and impact on users
- Participate in on-call rotations, providing front-line support and escalation for critical database incidents
- Work closely with engineering peers, security, and GRC teams to align database strategies with organizational goals
- Own database-related incidents including root cause analysis, observing trends, and diagnosing complex production issues
- Contribute to the development and refinement of database standards, guidelines, and procedures
- Own the full developer experience around the database, including operational stability, SQL tuning, capacity planning, consultation, education, and the client driver and code abstractions
- Mentor junior team members, fostering a culture of learning and continuous improvement
Benefits
Restricted Stock Units (RSUs), health, vision, and dental insurance, and mental health benefits in addition to salary