Senior Site Reliability Engineer

Wisp
Summary
Join Wisp, a fully-remote healthcare company, as a Senior Site Reliability Engineer. You will be responsible for ensuring the availability and reliability of Wisp's systems, working with infrastructure as code, Kubernetes environments, and CI/CD best practices. You will also manage traffic ingress, establish observability solutions, and collaborate with security and software engineers to implement best practices for system reliability, performance, and security. This role requires 5+ years of experience in Python, Go, or Shell scripting, a deep understanding of networking fundamentals, proven ability to implement SRE principles, and expertise in container orchestration technologies. You must be self-directed, have exceptional interpersonal skills, and be authorized to work in the United States.
Requirements
- Demonstrated proficiency with 5+ years of experience in Python, Go, or Shell scripting
- Deep understanding of networking fundamentals spanning TCP/IP, firewalls, routing, DNS, and load balancing
- Proven ability to implement key SRE principles including monitoring systems, performance optimization, and automation workflows
- Comprehensive expertise in container orchestration technologies including Docker, Buildpack, and/or Kubernetes
- Self-directed professional capable of working autonomously, identifying potential issues, and implementing effective solutions with minimal oversight
- Exceptional interpersonal skills with a track record of clear communication and productive collaboration across technical and non-technical teams
Responsibilities
- Use infrastructure as code practices (Terraform, Terragrunt, etc.), to build, launch, and maintain AWS infrastructure solutions that are resilient, secure, and transparent
- Oversee and optimize Kubernetes environments with an emphasis on security hardening and efficient load distribution
- Enable engineering teams by being the champion of CI/CD best practices and owning the workflows, migrations, and rollbacks of software delivery at Wisp
- Manage all traffic ingress for Wispβs ecosystem including edge caching, application load balancing, and proxy configurations (Nginx, Envoy)
- Establish comprehensive observability solutions that deliver actionable insights to both engineering teams and key business stakeholders
- Collaborate with security and software engineers to implement best practices for system reliability, performance, and security across our infrastructure