Senior Site Reliability Engineer
SecurityScorecard
Summary
Join SecurityScorecard as a Senior SRE and play a key role in enhancing our deployment automation, test frameworks, and system reliability. You will collaborate with development teams, establish best practices for continuous delivery, and maintain high-quality standards. Responsibilities include designing and maintaining CI/CD pipelines, optimizing infrastructure as code, improving deployment rollbacks and incident response, developing automated testing strategies, and building robust monitoring and alerting solutions. You will also participate in on-call rotations. This role requires proven SRE or DevOps experience, strong CI/CD and cloud platform knowledge, proficiency with infrastructure as code tools, and hands-on experience with automated testing frameworks. SecurityScorecard offers a competitive salary, stock options, health benefits, unlimited PTO, parental leave, and tuition reimbursements.
Requirements
- Proven experience as an SRE, DevOps Engineer, or similar role
- Strong background in CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.)
- Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes)
- Proficiency with infrastructure as code tools (Terraform, Ansible, etc.)
- Hands-on experience with automated testing frameworks and tools (Selenium, JUnit, etc.)
- Knowledge of scripting languages (Python, Bash, etc.) for automation
- Familiarity with monitoring and observability tools (Prometheus, Grafana, Datadog)
- Excellent problem-solving skills and a proactive attitude toward incident management
Responsibilities
- Design, implement, and maintain CI/CD pipelines to automate deployment processes
- Enhance infrastructure as code practices using tools like Terraform or CloudFormation
- Optimize deployment rollbacks and improve incident response procedures
- Develop automated testing strategies, including integration, load, and performance testing
- Collaborate with developers to improve application reliability through testing and monitoring
- Build robust monitoring and alerting solutions to ensure system health and availability
- Drive improvements in observability, logging, and metrics collection
- Participate in on-call rotations to manage incidents and ensure rapid recovery
Preferred Qualifications
- Experience with Node and JVM
- Experience with chaos engineering or fault injection testing
- Familiarity with security testing and compliance requirements
- Contributions to open-source SRE or DevOps projects
Benefits
- Competitive salary
- Stock options
- Health benefits
- Unlimited PTO
- Parental leave
- Tuition reimbursements
- Annual performance-based incentive compensation awards
- Equity