Staff Engineer, Reliability Insights and Excellence

Stripe Logo

Stripe

๐Ÿ“Remote - United States

Summary

Join Stripe's reliability infrastructure team as an experienced distributed systems engineer. You will design, build, and maintain core reliability platforms and tools used across the company. Collaborate with various teams to ensure Stripe's services are highly reliable and scalable. This role involves leading initiatives, mentoring engineers, and shaping the future of Stripe's reliability infrastructure. You will be responsible for designing, implementing, and testing reliability infrastructure components and influencing engineering teams to improve service reliability. The ideal candidate possesses extensive experience in distributed systems, leadership skills, and a passion for customer success.

Requirements

  • 9+ years of engineering experience or equivalent combined work experience reflecting domain expertise
  • Hands-on experience designing and building large scale distributed systems
  • Demonstrated experience of leading initiatives spanning multiple teams and leveraging deep domain expertise to influence tech roadmap planning and execution
  • Demonstrated ability to effectively collaborate across multiple teams and stakeholders to drive business outcomes
  • Experience, mentoring, and investing in the development of engineers and peers

Responsibilities

  • Design, build, test and operationalize end to end distributed systems reliability infrastructure and solutions that will be integrated into various services
  • Liaise with teams using this core infrastructure to ensure it meets their needs and expectations
  • Work cross functionally to ensure Stripe can scale to meet our biggest customersโ€™ needs
  • Shape the plan for the growth of Stripeโ€™s reliability infrastructure
  • Mentor other engineers in the organization and review code
  • Manage projects, including measuring impact and success of the project, and creating a maintenance and reliability plan for the future

Preferred Qualifications

  • Genuine interest and/or experience in debugging and troubleshooting complex distributed systems problems
  • Familiarity with the common patterns and practices for building reliable software
  • Experience with Kubernetes, Golang

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs