Remote Staff Site Reliability Engineer
Vultr
๐ต $120k-$135k
๐Remote - Worldwide
Please let Vultr know you found this job on JobsCollider. Thanks! ๐
Job highlights
Summary
Join Vultr as a Staff Site Reliability Engineer to help automate and make an impact working with cross-functional teams, designing state-of-the-art cloud provider solutions, and enhancing the resilience and stability of our systems.
Requirements
- 3+ years of experience in a hands-on SRE role delivering distributed architectures
- 2+ years working with and maintaining Kubernetes clusters for highly available and regulated environments
- 2+ years of hands-on experience with a modern Grafana stack, including Mimir, Loki, and Tempo
- Comfortable working with complex CI/CD Pipelines (Gitlab/Jenkins), configuration management (Puppet/Salt), and IaC solutions such as Terraform
- Experience working with observability pipelines or Open Telemetry is a plus
- A background in performance optimization for Webstacks, including components such as PHP-FPM, Ningx, and Mysql
- Boasts strong programming chops in Python, Golang, or PHP and thrives when picking up new technologies
Responsibilities
- Collaborate with cross-functional teams to craft and implement a modern observability stack and refine our incident-handling processes
- Design and contribute to state-of-the-art cloud provider solutions for high-performance computing, AI training, and inference workloads, focusing on Observability and MLOps
- The platform team aims to enhance the resilience and stability of our systems through thoughtful software improvements, architecture, and automation
- Contribute to solutions for various challenges ranging in nature from low-level hardware issues to high-level distributed application scale challenges and everything in between
- Champion DevOps and SRE principles through automation, thought leadership, and close collaboration within our engineering team
- Enhance customer experience by improving case handlingโstrive for proactive responses, rich insights, and automated resolutions
- Develop robust documentation to streamline the handling of recurring reliability issues, paving the way for junior SREs to take the helm confidently
- Identify and implement scalable solutions to address technical challenges within our stack, setting new benchmarks for innovation
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- ๐ฐ$195k-$220k๐United States
- ๐ฐ$129k-$161k๐Canada
- ๐ฐ$159k-$239k๐United States
- ๐ฐ$135k-$178k๐Worldwide
- ๐Brazil
- ๐ฐ$168k๐Worldwide
- ๐Worldwide
- ๐Australia
- ๐ฐ$165k-$210k๐United States
Please let Vultr know you found this job on JobsCollider. Thanks! ๐