Senior Site Reliability Engineer

Thumbtack Logo

Thumbtack

πŸ“Remote - Canada

Summary

Join Thumbtack's Site Reliability Engineering team and contribute to building a reliable, secure, and scalable platform. You will design and support resilient systems, prioritizing high performance and availability. This role involves collaborating across the business to develop and enhance existing capabilities, ensuring scalability and reliability of infrastructure and software. You will work with various engineering teams to build platform services. The position requires extensive fluency in AWS and Linux, expertise in large-scale distributed systems, and strong coding abilities. The role also involves troubleshooting, capacity planning, and on-call duties.

Requirements

  • Extensive fluency in AWS and Linux
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems across web technologies like: DNS, TLS, HTTP/S, TCP/IP
  • Ability to decompose complex problems while understanding the tradeoffs necessary to deliver impact
  • Demonstrable knowledge of instrumenting, operating, and observing a distributed system of microservices in a production cloud environment
  • Ability to effectively read, write, and debug code in programming languages like but not limited to: Python, Go, PHP, Javascript
  • Ability to communicate clearly and effectively to cross functional partners of various technical levels
  • Passion for reducing toil and improving developer experience

Responsibilities

  • Design, create, and maintain software and systems to improve the availability, scalability, and efficiency of Thumbtack's services
  • Set the architectural direction of infrastructure and platform services while supporting the engineering organization
  • Design and implement tools and processes used for deployment, change, service, and infrastructure management
  • Troubleshoot and debug critical systems throughout the SDLC
  • Contribute to the evolution and performance of capabilities we provide to engineering as a platform organization
  • Capacity planning and demand forecasting, anticipating performance bottlenecks
  • Participate in rotating on-call duties

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.