Senior SRE Lead

Juniper Square
Summary
Join Juniper Square as a Staff Site Reliability Engineer and help us expand our global domain expertise, providing 24x7 support to enable development velocity. You will automate infrastructure provisioning, partner with engineering teams to improve developer experiences and data posture, evolve deployment pipelines, and improve service metrics. Responsibilities include enabling observability, responding to incidents, adopting AWS Well-Architected frameworks, identifying performance bottlenecks, proactively solving problems, building a load testing environment, enforcing security controls, and participating in roadmap planning and architecture reviews. You will also train and mentor engineers. This role requires a Bachelor's degree in Computer Science or equivalent experience, 4-8 years of AWS experience, Kubernetes fluency, CI/CD pipeline experience, and PostgreSQL experience. Juniper Square offers a variety of work arrangements, from fully remote to working in physical offices.
Requirements
- Bachelorโs degree in Computer Science or similar or equivalent experience
- A profound love for solving hard problems and overcoming challenging obstacles
- Putting your customers first, whether they be internal or external, and making them more productive, happy, and successful
- 4 to 8 years of experience with AWS. Other public cloud providers are a bonus
- Some sort of infrastructure-as-code system: Ansible, Terraform, CloudFormation, etc
- Multi-year hands-on experience and fluency with Kubernetes and helm charts are an absolute skill requirement. We live and breathe the k8s ecosystem
- Experience with a CI/CD pipeline. We use a combination of Github Actions, ArgoCD, Helm and GitOps in our deployment process, but again, any are fine
- 3+ Experience with PostgreSQL is a must
- 3+ Experience with cloud security best practices (CSPM, CDR, CWPP, SIEM, etc) to keep our customers and cloud posture secure
- Experience with containers (builds, registries, vulnerabilities scanning, run-time with docker-compose, run-time with TILT, run-time in schedulers/orchestration systems)
- 3+ years of experience managing Linux oriented production environments
Responsibilities
- Automate the provisioning of all of Juniper Squareโs infrastructure in code. Everything we do is in code!
- Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements
- Partner with our Data Engineering team on improving our data posture and driving operational excellence
- Evolve our deployment pipelines to automate infrastructure deployments with the latest and greatest (and reliable) technologies
- Improve metrics on our main services, and act as a subject matter expert for our global dev teams
- Enable observability, SLO/SLI reporting, and respond to business impacting incidents as it pertains to infrastructure
- Adopt and drive solutions that align with AWS Well Architected frameworks and Juniper Squareโs business objectives
- Identify performance bottlenecks and provide recommendations for improvement
- Proactively identify and solve problems that we didnโt even know we had
- Help build, deploy, and scale a load testing environment that is analogous to production
- Enforce security and operational safety controls
- Participate in technical roadmap planning and estimation
- Participate and contribute in production readiness and architecture review board (ARB) meetings and forums
- Train and mentor future engineers in the same region
- Contribute to the architectural improvements to meet future scaling and observability requirements
Preferred Qualifications
- We use Python and Bash, so knowledge and exposure with either is a strong plus
- Experience breaking up monolithic architectures into microservices
- Experience with service meshes and service discovery solutions
- Experience with an observability solution: New Relic, Prometheus, DataDog, etc
- Experience with logging systems: CloudWatch, ELK, Splunk, etc
- Strong knowledge in general programming skills
- Strong knowledge of data retention, backups and recovery processes
- Additional experience with document databases is a nice-to-have
Benefits
Juniper Square offers employees a variety of ways to work, ranging from a fully remote experience to working full-time in one of our physical offices
Share this job:
Similar Remote Jobs

