Senior Manager of Reliability Engineering

Verasity Logo

Verasity

πŸ“Remote - Worldwide

Summary

Join PrizePicks, a rapidly growing sports company, as the Senior Manager of Reliability Engineering. You will lead and mentor a team of DBREs and SREs, developing and implementing a comprehensive reliability strategy. Responsibilities include overseeing incident response, performance optimization, capacity planning, and fostering collaboration across teams. This remote position, based anywhere in the U.S., offers a competitive compensation package and numerous benefits. If you are a highly motivated leader with a passion for building high-performance systems, apply now and become part of our innovative team.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field or equivalent experience
  • 8+ years of experience in a reliability engineering or related role, with at least 3 years in a leadership capacity
  • Deep understanding of reliability principles, methodologies, and best practices
  • Hands-on experience with database technologies (e.g., SQL, NoSQL) and cloud platforms (e.g., AWS, GCP, Azure)
  • Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Ansible, Terraform)
  • Strong analytical, problem-solving, and troubleshooting skills
  • Excellent communication, interpersonal, and leadership abilities
  • Passion for technology and a commitment to excellence
  • You must be authorized to work for any employer in the U.S

Responsibilities

  • Lead Reliability Strategy: Develop and execute a comprehensive reliability strategy that aligns with our business objectives and ensures the high availability, performance, and scalability of our systems and applications
  • Team Leadership: Provide inspirational leadership and guidance to a team of DBREs and SREs, fostering their professional growth and development
  • Monitoring and Observability: Implement comprehensive monitoring and observability solutions to gain real-time insights into system health and performance
  • Incident Response: Oversee the incident response process, ensuring swift resolution of incidents, review and documentation of causes and contributing factors, and implementation of measures to prevent recurrence
  • Performance Optimization: Drive initiatives to optimize system performance, identify and resolve bottlenecks, and proactively address potential issues
  • Capacity Planning: Conduct capacity planning and forecasting to ensure adequate resources are available to meet current and future demands
  • Automation and Tooling: Champion the adoption of automation and tooling to enhance efficiency, reduce toil, and improve operational effectiveness
  • Collaboration: Foster strong partnerships with cross-functional teams, including development, operations, and security, to ensure alignment and collaboration on reliability initiatives
  • Continuous Improvement: Cultivate a culture of continuous improvement, encouraging experimentation, learning, and the adoption of best practices

Benefits

  • Company-subsidized medical, dental, & vision plans
  • 401(k) plan with company match
  • Stock options and bi-annual bonus
  • Uncapped PTO to encourage a healthy work/life balance (2-week MINIMUM required!)
  • Generous paid leave programs, including 16-week paid parental leave and disability benefits
  • Workplace flexibility and modern work schedules focused on getting the job done, not hours clocked
  • Company-wide in-person events and team outings
  • Lifestyle enhancement program
  • Company equipment provided (Windows & Mac options)
  • Annual performance reviews with opportunities for growth and career development
  • Annual bonus
  • Flexible PTO to encourage a healthy work/life balance (2 weeks STRONGLY encouraged!)

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.