Senior Site Reliability Engineer

ServiceNow
Summary
Join ServiceNow's SRE team as a highly technical engineer responsible for maintaining and developing the reliability, scalability, and performance of our cloud infrastructure. You will leverage software development, networking, and systems engineering expertise to resolve issues, prevent recurring problems, and drive initiatives to improve infrastructure reliability. The role demands a passion for software development, in-depth Linux systems knowledge, automation skills, and experience in troubleshooting complex issues. Success requires a team-first attitude, excellent communication, and a commitment to automation. This position offers a competitive salary and benefits package, including health plans, 401(k) matching, and flexible time away.
Requirements
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision -making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI- driven insights, or exploring AIβs potential impact on the function or industry
- Solid understanding of Linux systems, networking, and container security
- Proficiency with infrastructure-as-code tools like Terraform and Ansible
- 4+ years of experience in SRE, DevOps, or cloud infrastructure role
- 4+ years of experience programming/scripting skills in Python, Go, Bash and JavaScript
- 4+ years of experience with Linux System Administration with deep knowledge of Linux systems
- 4+ years of experience operating and scaling Kubernetes in production environments
- Knowledge of database technologies including MySQL, MariaDB, and PostgreSQL
- Expertise with GitLab CI/CD and modern software delivery practices
- Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, etc.)
- Experience with Cloud technologies, Azure, AWS, and GCP
- Ability to leverage AI technologies to enhance system reliability, automate operational tasks, and optimize performance monitoring and incident response processes
- Team-first attitude and an uncompromising attention to detail
- Excellent collaboration and communication skills
Responsibilities
- Provide relief and sustainable resolution to issues within our infrastructure
- Use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues
- Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design
- Join a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions
Preferred Qualifications
Experience developing on the ServiceNow Platform is a bonus!
Benefits
- Health plans, including flexible spending accounts
- A 401(k) Plan with company match
- ESPP
- Matching donations
- A flexible time away plan
- Family leave programs
Share this job:
Similar Remote Jobs
