Site Reliability Engineer

Tenderly
Summary
Join Tenderly, a leading Web3 infrastructure platform, as a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our platform. You will architect and scale infrastructure, monitor and optimize system performance, lead incident response, collaborate with engineering and product teams, and innovate with new technologies. Leveraging our sophisticated tech stack (Kubernetes, Prometheus, GCP, PostgreSQL, Kafka, etc.), you will maintain our cutting-edge infrastructure and optimize services for seamless user experiences. We are seeking a seasoned SRE with extensive experience in SRE or DevOps roles, a proven track record with blockchain clients, and deep knowledge of Unix/Linux environments. This role requires strong problem-solving skills, excellent communication, and a collaborative spirit. Tenderly offers a comprehensive benefits package including stock options, transparent compensation, annual bonuses, fully covered parental leave, flexible hours, health insurance, and continuous learning opportunities.
Requirements
- Proven Expertise: Extensive experience in SRE or DevOps roles, with a track record of working with Blockchain clients and supporting production systems at scale
- Problem-Solving Prowess: A systematic approach to problem-solving, coupled with strategic decision-making skills to ensure scalability and reliability
- Technical Proficiency: Deep knowledge of Unix/Linux environments and proficiency in scripting languages
- Team Collaboration: Excellent communication skills and a collaborative spirit to work effectively across teams and drive infrastructure improvements
Responsibilities
- Architect and Scale: Design high-level schematics for infrastructure and processes, ensuring scalability and alignment with business priorities
- Monitor and Optimize: Proactively monitor system performance, analyze scalability metrics, and optimize our platform for peak efficiency
- Lead Incident Response: Develop strategies and procedures for mitigating risks, preparing for contingencies, and minimizing disruptions
- Collaborate Across Teams: Work closely with engineering and product teams to implement infrastructure improvements and streamline deployment processes
- Innovate: Stay at the forefront of Web3 innovation, integrating new technologies and refining existing systems to maintain industry leadership
Benefits
- Stock Options - Join the ride! Own a piece of our journey with stock options for everyone
- Transparent compensation -No secrets here. Enjoy fair, equal, and transparent salaries
- Annual bonus - Performance-based bonus amounting up to 20% of employee's annual base salary paid out twice a year based on KPIs
- Fully covered parental leave - Welcome your little one with fully paid parental leave
- Tech Gear - MacBook and additional budget for your setup
- Flexible, no less-than vacation - We know that unlimited vacation policies usually have the opposite effect, so we have a minimum of 21.5 days to be used in 12 month timeframe
- Flexible Hours - Work when you work best. Collaboration is key
- Health Insurance - Keep yourself and your family healthy with our comprehensive coverage
- Continuous Learning - Elevate your skills with regular trainings and a professional development budget. We invest in your growth