Senior SRE Lead

Juniper Square Logo

Juniper Square

📍Remote - India

Summary

Join Juniper Square as a Staff Site Reliability Engineer and contribute to the growth of our global domain expertise. You will play a key role in enabling 24x7 development velocity by automating infrastructure provisioning, partnering with engineering teams, and evolving deployment pipelines. Responsibilities include improving metrics, ensuring observability, responding to incidents, and adopting AWS Well-Architected frameworks. You will also identify and solve problems, build a load testing environment, enforce security controls, and participate in technical planning and training. Juniper Square offers a variety of work arrangements, from fully remote to working in one of our physical offices.

Requirements

  • Bachelor’s degree in Computer Science or similar or equivalent experience
  • A profound love for solving hard problems and overcoming challenging obstacles
  • Putting your customers first, whether they be internal or external, and making them more productive, happy, and successful
  • 3 to 5 years of experience with AWS
  • 3+ Experience with PostgreSQL
  • 3+ Experience with cloud security best practices (CSPM, CDR, CWPP, SIEM, etc)
  • Experience with containers (builds, registries, vulnerabilities scanning, run-time with docker-compose, run-time with TILT, run-time in schedulers/orchestration systems)
  • 3+ years of experience managing Linux oriented production environments
  • Multi-year hands-on experience and fluency with Kubernetes and helm charts
  • Experience with a CI/CD pipeline
  • Some sort of infrastructure-as-code system: Ansible, Terraform, CloudFormation, CDK, etc
  • Strong knowledge in general programming skills
  • Strong knowledge of data retention, backups and recovery processes

Responsibilities

  • Automate the provisioning of all of Juniper Square’s infrastructure in code
  • Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements
  • Partner with our Data Engineering team on improving our data posture and driving operational excellence
  • Evolve our deployment pipelines to automate infrastructure deployments with the latest and greatest (and reliable) technologies
  • Improve metrics on our main services, and act as a subject matter expert for our global dev teams
  • Enable observability, SLO/SLI reporting, and respond to business impacting incidents as it pertains to infrastructure
  • Adopt and drive solutions that align with AWS Well Architected frameworks and Juniper Square’s business objectives
  • Identify performance bottlenecks and provide recommendations for improvement
  • Proactively identify and solve problems that we didn’t even know we had
  • Help build, deploy, and scale a load testing environment that is analogous to production
  • Enforce security and operational safety controls
  • Participate in technical roadmap planning and estimation
  • Participate and contribute in production readiness and architecture review board (ARB) meetings and forums
  • Train and mentor future engineers in the same region
  • Contribute to the architectural improvements to meet future scaling and observability requirements

Preferred Qualifications

  • Other public cloud providers
  • Additional experience with document databases
  • We use Python and Typescript, so knowledge and exposure with either is a strong plus
  • Experience breaking up monolithic architectures into microservices
  • Experience with service meshes and service discovery solutions
  • Experience with an observability solution: New Relic, Prometheus, DataDog, etc
  • Experience with logging systems: CloudWatch, ELK, Splunk, etc

Benefits

Juniper Square offers employees a variety of ways to work, ranging from a fully remote experience to working full-time in one of our physical offices

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.