Senior Site Reliability Engineer

ServiceNow
Summary
Join ServiceNow's SRE team as a highly technical engineer responsible for maintaining and enhancing the reliability, scalability, and performance of the ServiceNow cloud infrastructure. You will leverage your expertise in software development, systems engineering, and networking to proactively prevent issues, drive initiatives to improve infrastructure reliability, and contribute to a highly automated environment. This role requires a strong understanding of Linux systems, networking, container security, and experience with infrastructure-as-code tools like Terraform and Ansible. You will also need experience with programming languages like Python, Go, Bash, and JavaScript, as well as expertise in operating and scaling Kubernetes in production environments.
Requirements
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision -making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI- driven insights, or exploring AIβs potential impact on the function or industry
- Solid understanding of Linux systems, networking, and container security
- Proficiency with infrastructure-as-code tools like Terraform and Ansible
- 4+ years of experience in SRE, DevOps, or cloud infrastructure role
- 4+ years of experience programming/scripting skills in Python, Go, Bash and JavaScript
- 4+ years of experience with Linux System Administration with deep knowledge of Linux systems
- 4+ years of experience operating and scaling Kubernetes in production environments
- Knowledge of database technologies including MySQL, MariaDB, and PostgreSQL
- Expertise with GitLab CI/CD and modern software delivery practices
- Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, etc.)
- Experience with Cloud technologies, Azure, AWS, and GCP
- Ability to leverage AI technologies to enhance system reliability, automate operational tasks, and optimize performance monitoring and incident response processes
- Team-first attitude and an uncompromising attention to detail
- Excellent collaboration and communication skills
Responsibilities
- Provide relief and sustainable resolution to issues within our infrastructure
- Use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues
- Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design
- Join a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions
Preferred Qualifications
Experience developing on the ServiceNow Platform is a bonus!
Benefits
- Health plans, including flexible spending accounts
- A 401(k) Plan with company match
- ESPP
- Matching donations
- A flexible time away plan
- Family leave programs