Summary

Join Abnormal Security as a Senior Site Reliability Engineer (SRE) and contribute to the reliability, scalability, and operational excellence of our systems and services. You will lead initiatives to improve operational maturity, driving change across the organization. As a senior team member, you will independently define and execute quarterly goals, create roadmaps, and own cross-functional projects. You will advocate for reliability, providing technical leadership, analysis, and mentorship. The ideal candidate possesses strong technical depth in distributed systems, a product-focused mindset, and excellent communication skills. You will be responsible for the operational maturity of services, partnering with product teams, and defining quarterly goals for the SRE team. You will also design and maintain systems, lead incident reviews, collaborate with engineering leadership, and mentor other engineers.

Requirements

8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
Deep knowledge of production-grade distributed systems and cloud-native architectures
Demonstrated experience managing service availability, latency, and incident response in production environments
Strong programming skills in Python, Go, or similar languages
Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
Proven ability to lead complex, multi-team initiatives and influence system design for reliability

Responsibilities

Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
Proactively partner with product teams to embed SRE best practices and support services with operational challenges
Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
Design and maintain systems that promote observability, automated recovery, scalability, and resilience
Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
Mentor other engineers and drive adoption of SRE principles throughout the engineering organization

Preferred Qualifications

Prior experience embedding with product engineering teams to support operational goals
Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)
Experience in regulated environments or with FedRAMP-compliant systems
Contributions to open-source SRE tooling or community knowledge sharing

Benefits

Bonus
Restricted stock units (RSUs)
Benefits

Senior Site Reliability Engineer

Abnormal Security

Summary

Requirements

Responsibilities

Preferred Qualifications

Benefits

Remote

DevOps

Senior

Share this job:

Similar Remote Jobs

Remote

DevOps

Senior

Remote

Software Development

Senior

Remote

Software Development

Senior

Remote

Software Development

Senior

Remote

DevOps

Senior

Remote

DevOps

Senior

Remote

DevOps

Senior

Remote

DevOps

Senior

Remote

DevOps

Senior