Principal Site Reliability Engineer

Logo of SonicWall

SonicWall

πŸ“Remote - United States

Job highlights

Summary

Join our team as a Site Reliability Engineer (SRE) to ensure our systems are highly reliable, scalable, and performant. You will collaborate with engineers to optimize infrastructure, design monitoring solutions, audit Kubernetes workloads, and implement CI/CD pipelines. Your expertise in AWS/GCP will be used to architect solutions enhancing the reliability of our large-scale cloud infrastructure. This role requires data analysis skills and a passion for technology within a dynamic environment. You will contribute to shaping the future of our operations.

Requirements

  • Apply Engineering Principles to infrastructure management
  • Be proficient in identifying and addressing bottlenecks and failure points in large-scale distributed systems
  • Have hands-on experience with Kubernetes (GKE) clusters and workloads using tools like Lens, K9s, and FluxCD
  • Translate business needs into actionable metrics, pulling data from multiple sources like AWS, GCP, and custom applications
  • Be skilled in building and maintaining dashboards using tools like Datadog, Grafana, Prometheus and Statsd to provide critical insights to business leaders
  • Expertly use performance analysis and debugging tools such as: tcpdump, ss/netstat, top, sar, ab, etc
  • Be proficient in at least one scripting language (e.g., Python, Bash, Perl)
  • Administer MySQL databases

Responsibilities

  • Lead the charge in ensuring our systems are highly reliable, scalable, and performant
  • Collaborate with seasoned engineers to optimize our infrastructure through data-driven decisions, automation, and innovative problem-solving
  • Design robust monitoring solutions
  • Audit Kubernetes-based workloads
  • Drive the implementation of CI/CD pipelines
  • Architect solutions that enhance the reliability of our large-scale cloud-based infrastructure

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let SonicWall know you found this job on JobsCollider. Thanks! πŸ™