Senior Site Reliability Engineer - Automation and Tooling

Ivanti
Summary
Join Ivanti as a Site Reliability Engineer (SRE) and contribute to the reliability, deployment, and operation of their SaaS product offerings. You will be responsible for researching, developing, and automating SRE tools and processes, driving solutions to reduce toil, and participating in code reviews and analysis. This role involves deploying, managing, and securing Ivanti's production SaaS environments in AWS and Azure, collaborating with cross-departmental teams to solve complex problems, and providing mentorship to junior SREs. The ideal candidate will have a strong software engineering background, experience with continuous deployment, SaaS delivery, and production incident response, and proficiency in Python, Java, Golang, C#, Bash, PowerShell, and public cloud platforms.
Requirements
- A BSc in Computer Science, a related field, or equivalent practical experience
- 7+ years relevant industry experience (4+ with an achieved BSc in Computer Science or Equivalent Degree)
- Demonstrated proficiency building applications and tooling with Python and at least one of the following languages: Java
- Golang
- C#
- Proficiency working with Bash interactively and programmatically
- Experience with PowerShell
- Proficiency working with public cloud platforms (AWS or Azure preferred)
- Experience with core Linux distributions (Debian, RHEL, SUSE, Slackware)
- Strong container orchestration experience (EKS or AKS preferred)
- Strong Operating System troubleshooting abilities
- Experience analyzing and troubleshooting scalable distributed computing systems
- Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions
- Proficiency working with Infrastructure as Code (IaC) using Terraform, CloudFormation, and/or ARM
- A strong understanding of DevOps practices
- A desire to adopt and implement emergent technologies and best practices
- Strong verbal and written communication skills in English for the purposes of global collaboration
Responsibilities
- Researching, developing, maintaining, and automating both existing and new SRE tools and processes
- Driving solutions to reduce toil within SRE
- Participating in code review and analysis with SRE peers
- Composing and reviewing contributions to SRE documentation such as system documentation, technical guides, and runbooks
- Deploying, managing, and securing Ivantiโs production Software-as-a-Service (SaaS) environments in AWS and Azure
- Working with geographically dispersed, cross-departmental teams to solve difficult problems
- Provide mentorship and technical guidance to junior SREs to foster their professional growth
- Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and
Preferred Qualifications
- Prior experience as a Site Reliability Engineer
- Certificates in one or more of the following categories or demonstrated certificate-equivalent knowledge: Cloud Development and architecture
- Kubernetes Administration
- Linux Administration
- Software engineering disciplines
- Experience developing for Azure Functions, AWS Lambda, or other โserverlessโ frameworks
- Experience with compliance frameworks such as SOC 2 Type 2, ISO-27001, FedRAMP, IRAP, ISMAP, etc
- Experience with user privacy regulations such as GDPR and PIPEDA