Senior Site Reliability Engineer

Logo of Precisely

Precisely

📍Remote - United States

Job highlights

Summary

Join Precisely, a leader in data integrity, as a Senior Site Reliability Engineer. This role focuses on enhancing the stability, reliability, and efficiency of our global SaaS platform, working closely with various teams. You will design and implement tools and systems to improve resilience and address incidents, ensuring compliance with FedRAMP requirements. The position requires extensive experience in a global production environment, strong technical skills, and excellent collaboration abilities. Precisely offers a 'work from anywhere' culture and opportunities for career development.

Requirements

  • At least 5 years of experience in a global multi-tenanted production environment
  • Hands on skills on Kubernetes, AWS/GCP/Azure, Terraform/Cloudformation/Ansible
  • Strong knowledge on Linux fundamentals, experience troubleshooting production issues
  • Experience working in a 24x7 production environment
  • Strong understanding of SRE and general SaaS service management principles
  • Past experience working with SRE teams and handling on-call coordination challenges
  • Strong collaboration, communication and interpersonal skills
  • The ability to operate calmly in challenging and stressful situations
  • A deep understanding of Kubernetes and Cloud Networking or previous experience in infrastructure

Responsibilities

  • Partner closely with SaaS Development, Pipeline Engineering, and Platform Engineering teams to ensure that SRE is an integral part of Precisely’s Continuous Delivery model for SaaS applications
  • Design and build necessary tooling and automation to ensure that we are able to manage our cloud native infrastructure in a reliable, maintainable, observable and secure way
  • Establish a 24x7 incidence response process that addresses Precisely’s SLA for SaaS Products through efficient alerting, playbook documentation and blameless postmortems
  • Be part of the Global SRE team and build relationships across product management, development, and support organizations to socialize the culture of SRE
  • Drive the culture of observability through the SaaS development organization
  • Leads prioritization of reliability features and contributes to the design, development and delivery of effective tooling, alerts, and automated responses to identify and address reliability risks
  • Ensure appropriate security cloud tooling is planned for and implemented in the production environment
  • Regularly defend the quality, scalability and reliability of Precisely’s production SaaS environment
  • Collaborate with Federated Security and compliance teams to implement and maintain security controls and measures
  • Participate in Annual FedRAMP assessments and Significant Change assessments as necessary
  • Continuously monitor systems for compliance with FedRAMP security controls and report any deviations
  • Implement and manage strict access controls to ensure that only authorized personnel or systems have access to sensitive data and systems within the FedRAMP boundary

Preferred Qualifications

Exposure to solutioning for Big Data Applications in Cloud

Benefits

  • Work from anywhere culture
  • Opportunities for growth, learning and building community

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Precisely know you found this job on JobsCollider. Thanks! 🙏