Senior Site Reliability Engineer

Calendly
Summary
Join Calendly's Engineering team and become a Site Reliability Engineer, responsible for designing, building, maintaining, and operating our next-generation infrastructure platform. You will empower application engineering teams by leveraging your infrastructure expertise, enabling monitoring best practices, and advising on optimal infrastructure use. A typical day involves building tools and applications, evaluating and deploying open-source tools, exercising expertise in cloud infrastructure, and ensuring resilient infrastructure through Infrastructure as code. You will also maintain and improve infrastructure observability, participate in on-call rotations, and define standard practices and tooling. This role requires collaboration with application engineering teams and a commitment to continuous learning and knowledge sharing. Calendly offers a competitive salary and benefits package.
Requirements
- An eagerness and drive to learn, and the skills for sharing your knowledge with and mentor others
- Creative problem-solving, a keen eye for detail, and can think your way through complex issues
- Comfortable working directly with internal facing customers to understand needs/requirements and collaborate on solutions
- A strong understanding of the Linux operating system
- Strong technical knowledge of cloud infrastructure (especially GCP), distributed systems, and reliability practices
- Deep experience designing, building, and running highly-available production infrastructure
- Strong Golang or Python development experience; especially writing APIs to build, orchestrate and manage cloud infrastructure
- Solid working knowledge of patterns and principles for designing and implementing cloud native applications on Kubernetes, such as Controllers and Operators
- Robust knowledge of computer networking principles and extensive experience with cloud networking technologies to create scalable and secure environments
- Extensive working experience with software and infrastructure monitoring tools (especially Datadog)
- Authorized to work lawfully in the United States of America as Calendly does not engage in immigration sponsorship at this time
Responsibilities
- Building tools and applications to extend Calendlyβs infrastructure platform
- Evaluating and deploying cloud native open source tools
- Exercising expertise in cloud infrastructure concepts and patterns
- Instituting resilient infrastructure through Infrastructure as code
- Maintaining and improving observability of our infrastructure platform, and offers patterns for application teams to consume for application observability
- Participating in an on-call rotation to support the infrastructure platform
- Defining standard practices and tooling around new services, changes, incidents, postmortems, and capacity management, and work with application engineering teams to adopt those practices
Benefits
- Quarterly Corporate Bonus program (or Sales incentive)
- Equity awards
- Competitive benefits