Reliability Lead
League
๐ต $80k-$113k
๐Remote - Canada
Please let League know you found this job on JobsCollider. Thanks! ๐
Job highlights
Summary
Join League's Reliability Team as a hands-on leader, contributing at least 75% individually while mentoring a Reliability Engineer. You will advance reliability capabilities, including reactive and proactive tools. Collaborate with cross-functional teams, report on system reliability, manage roadmaps, and design/build improvements. This role requires 8+ years of relevant experience, team lead experience, and expertise in cloud providers, monitoring systems, observability, and container orchestration. League offers comprehensive benefits, growth opportunities, equity participation, and wellness days.
Requirements
- 8+ years of relevant work experience in the reliability field
- Experience as a team lead or people manager, and interest to grow your career into management
- Experience with Cloud Providers - GCP preferred
- Experience with monitoring systems - Grafana/Prometheus preferred
- Experience implementing observability best practices
- Experience with container orchestration systems - Kubernetes preferred
- You know how to write high-quality and testable code in Python, Go, or other languages
- Experience working with automation in critical production environments, ensuring their reliability
- Experience analyzing and troubleshooting distributed systems
Responsibilities
- Collaborate with cross-functional teams to facilitate the adoption of reliability tooling and processes
- Report on and provide insights and recommendations on the reliability of League's systems
- Manage roadmap priorities, deadlines, and deliverables
- Design, build and improve capabilities to improve Leagueโs reliability posture and practices
- Support the adoption of Service Level Objectives (SLO) and error budgets
- Enhance the way availability, latency and overall system health is measured and monitored
- Improve tooling for observability, alerting and incident response across backend, web and mobile platforms
- Improve tooling and guidelines for Load and Performance testing
- Uphold quality standards by performing code reviews and monitoring performance
- Mentor other team members on reliability standards and best practices
- Compliance with Information Security Policies
- Compliance with Leagueโs secure coding practice
- Responsibility and accountability for executing League's policies and procedures
- Notification of HR, Legal, Compliance & Security of any incidents, breaches or policy violations
Preferred Qualifications
- Experience with Cloud Providers - GCP preferred
- Experience with monitoring systems - Grafana/Prometheus preferred
- Experience with container orchestration systems - Kubernetes preferred
Benefits
- Comprehensive Benefits : Generous health coverage for you and your family
- Growth Opportunities : Mentorship and a learning and development budget to support your professional growth
- Equity Participation : Share in the success of a high-growth company
- Wellness days : Take time off to reset and recharge
- Work flexibility: Flexibility to work from our Toronto HQ office or fully remote (within Canada only)
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- ๐Worldwide
- ๐ฐ$170k-$240k๐United States
- ๐ฐ$100k-$202k๐United States, Worldwide
- ๐Worldwide
- ๐United States
- ๐United States
- ๐United States
- ๐United States
Please let League know you found this job on JobsCollider. Thanks! ๐