Senior Site Reliability Engineer - Network Operations

Fastly
Summary
Join Fastly's Technical Operations team as a Senior Site Reliability Engineer (Networking) and contribute to building and operating the infrastructure powering the Fastly Edge Cloud Platform. You will build, operate, and maintain the global network, respond to traffic incidents and lead network incident resolution, innovate monitoring methods, and partner in developing automation systems. The role involves deep-dive performance analysis, collaboration with partner teams, and mentoring team members on global routing. This remote position, based in the UK (preferably near London), requires occasional travel. The estimated salary is ยฃ100,000 to ยฃ120,000 plus bonus, and the role includes eligibility for equity and discretionary bonus programs.
Requirements
- Extensive experience in the protocols and practices that make up the fabric of the global internet, including IP, BGP, Anycast and DNS
- Proficiency in Tier 1 Internet service providers, Internet exchanges and cloud providers
- Ability to analyze traffic patterns across multiple dimensions using flow-based tools
- Experience working with alerting, monitoring and visibility tools (such as Graphite/Grafana, Prometheus, or Splunk)
- Experience in code and design reviews and Scripting abilities in a common language such as Python, etc
- Experience with Linux/Unix
- Knowledge across cloud hosting solutions (i.e., GCP, AWS and Azure)
- Knowledge of DevOps practices and CI / CD pipelines (ie. Git, Jenkins, Ansible)
- Adept at knowledge sharing and creating comprehensive documentation to empower teams
- Able to collaborate with cross-functional teams to shape the technical roadmap, prioritizing initiatives to optimize automation tooling and the network
Responsibilities
- Build, operate, and maintain the continually growing global network footprint of Fastlyโs Edge Cloud Platform
- Response to significant traffic incidents and lead network incidents, resolving edge cases and failure scenarios with your expertise in IP routing, particularly BGP
- Innovate new methods for monitoring network performance, focusing on the end-user experience, and proactively address potential issues
- Partner in the development and iteration of tools and automation systems that improve how we operate and build the network
- Continual deep-dive of performance-based analytics and close involvement with partner teams to maintain a performant global network
- Advocate for the operational stability of the network by identifying opportunities and partnering with engineering teams to shape their roadmaps and software solutions
- Mentor team members on the complexities of global routing, especially in an anycast-heavy environment
Benefits
This role may be eligible to participate in Fastlyโs equity and discretionary bonus programs
Share this job:
Similar Remote Jobs
