Senior Manager, Site Reliability Engineering

Precisely Logo

Precisely

πŸ“Remote - United States

Summary

Join Precisely, a leader in data integrity, as a Senior Site Reliability Engineering Manager to lead a team of SREs ensuring the stability, reliability, and efficiency of our global SaaS platform. You will collaborate with senior management and various teams to enhance product reliability, build the SRE team, and implement SaaS best practices. Responsibilities include managing cloud infrastructure for cost-efficiency, improving automation and incident management, and fostering a culture of continuous improvement. This role supports a FedRAMP authorized Cloud Service Offering, requiring collaboration with cross-functional teams to meet stringent compliance requirements. Precisely offers a 'work from anywhere' culture and is committed to employee career development.

Requirements

  • At least 5 years of experience in a global multi-tenanted production environment and at least 3 years of experience leading a diverse engineering team
  • Hands on skills on Kubernetes, AWS/GCP/Azure, Terraform/Cloudformation/Ansible
  • Strong knowledge on Linux fundamentals and experience troubleshooting production issues
  • Experience working in a 24x7 production environment
  • Strong understanding of SRE and general SaaS service management principles
  • Experience with cloud infrastructure tools (monitoring, deployment, security)
  • Strong leadership, collaboration, communication and interpersonal skills
  • The ability to operate calmly in challenging and stressful situations
  • Strong problem-solving and decision-making abilities
  • Ability to work effectively with cross-functional teams and stakeholders

Responsibilities

  • Ensure high availability, scalability, and security of cloud services across multiple geographies
  • Implement and improve automation, incident management, and capacity planning practices
  • Lead and mentor a team of Site Reliability Engineers
  • Oversee the management and optimization of cloud infrastructure for cost-efficiency
  • Maintain and improve observability, logging, and alerting systems
  • Collaborate closely with product development teams to facilitate delivery of new functionality and capabilities to our SaaS platform
  • Drive the adoption of DevOps and continuous delivery practices
  • Evaluate and implement new technologies and tools to enhance cloud infrastructure and operations
  • Foster a culture of continuous improvement, collaboration, and innovation

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.