πAustralia
Senior Software Engineer - Site Reliability

Abnormal Security
π΅ $176k-$230k
πRemote - United States
Please let Abnormal Security know you found this job on JobsCollider. Thanks! π
Summary
Join Abnormal Security as a Senior Software Engineer - Site Reliability and be responsible for the reliability, scalability, and operational excellence of our systems and services. Lead initiatives to improve operational maturity, driving change for stable operations. Define and execute quarterly goals, create roadmaps, and own cross-functional projects. Serve as a key advocate for reliability, providing technical leadership, deep analysis, and mentorship. The ideal candidate has strong technical depth in distributed systems, a product-focused mindset, and is a strong communicator and mentor. This role requires leading broad technical initiatives and improving service ownership and incident response practices.
Requirements
- 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
- Deep knowledge of production-grade distributed systems and cloud-native architectures
- Demonstrated experience managing service availability, latency, and incident response in production environments
- Strong programming skills in Python, Go, or similar languages
- Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
- Proven ability to lead complex, multi-team initiatives and influence system design for reliability
Responsibilities
- Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
- Proactively partner with product teams to embed SRE best practices and support services with operational challenges
- Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
- Design and maintain systems that promote observability, automated recovery, scalability, and resilience
- Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
- Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
- Mentor other engineers and drive adoption of SRE principles throughout the engineering organization
Preferred Qualifications
- Prior experience embedding with product engineering teams to support operational goals
- Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)
- Experience in regulated environments or with FedRAMP-compliant systems
- Contributions to open-source SRE tooling or community knowledge sharing
Benefits
- Bonus
- Restricted stock units (RSUs)
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
π°$160k-$200k
πWorldwide
π°$190k-$220k
πWorldwide
π°$180k-$230k
πUnited States
πIndia
π°$175k-$210k
πUnited States
π°$225k-$255k
πUnited States
π°$190k-$267k
πUnited States
π°$180k-$220k
πUnited States
πIndia