Everbridge is hiring a
Senior Site Reliability Engineer

Logo of Everbridge

Everbridge

πŸ’΅ ~$150k-$222k
πŸ“Remote - United States

Summary

The job is for a Kubernetes Platform Engineer in the Everbridge Federal team, responsible for ensuring service quality and availability of Everbridge's solutions by designing, deploying, managing Kubernetes at scale, implementing best practices, collaborating with teams, and participating in an on-call rotation. The role requires 3+ years of technical AWS experience, 2+ years of Kubernetes experience, 3+ years of Terraform or similar IaC experience, and a Secret Clearance or the ability to obtain one.

Requirements

  • 3+ years of technical AWS experience, managing and owning systems in a production environment
  • 2+ years of Kubernetes experience (EKS, AKS, GKE, Self managed)
  • 3+ years of Terraform or similar IaC experience
  • Experience with the following tooling: GitLab CICD, Packer, Docker, EKS, Kubernetes, Spinnaker, Helm, Argo, Jenkins
  • Experience with Telemetry tools such as Datadog, SumoLogic, Grafana, Prometheus
  • Experience writing automation in languages such as Python, Go, Bash, Java
  • Experience with configuration management tools such as Salt, Ansible, AWS user_data
  • Experience with a DevOps/SRE production environment
  • Experience with Agile practices
  • Large scale production UNIX/Linux experience
  • Experience working on DoD IL4 programs
  • Currently hold a Secret Clearance or a be a US citizen with the ability to obtain a Secret Clearance
  • Must have or be able to obtain and maintain DoD 8140 β€œIntermediate” level or higher certification (formally DoD 8170 IAM Level II)

Responsibilities

  • Keep people safe and businesses running
  • Be an integral member of the team implementing our platform in a DoD IL4 cloud environment
  • Own and maintain the Kubernetes infrastructure from conception to completion within AWS
  • Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's Kubernetes solutions
  • Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other engineers on designing and implementing highly reliable solutions
  • Research and implement SRE and Kubernetes best practices and by creating automation, cross-functional collaboration, and data-driven decisions to reinforce the integrity and reliability of our systems
  • Participate in a rotating on-call rotation to resolve production escalations

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Everbridge know you found this job on JobsCollider. Thanks! πŸ™