Senior Site Reliability Engineer

Gelato Logo

Gelato

πŸ“Remote - Switzerland

Summary

Join Gelato, a leading Rollup-as-a-Service platform, as a Senior Site Reliability Engineer. You will play a key role in maintaining and operating our multi-cloud infrastructure, contributing to improvements in incident management, postmortems, and DevOps culture. Responsibilities include deploying and maintaining core components, modernizing infrastructure, enhancing CI/CD pipelines, and providing on-call support. You will also provide insights on system design and scalability, focusing on reliability, security, and efficiency in a Web3 context. Gelato offers a fully remote work environment with a competitive package, including a generous token package and the chance to participate in the Gelato DAO.

Requirements

  • At least 4 years experience in maintaining Cloud infrastructure with modern technologies
  • At least 1 year experience in maintaining Web3 related infrastructure
  • GitOps principles at heart
  • Ability to lead and positively influence peers in decision-making process
  • Ability to maintain high performance and accuracy in rapidly changing and evolving work settings
  • Experience in operating infrastructure on at least one major Cloud provider (GCP, AWS, Azure..)
  • Experience with Docker and containerized applications
  • Experience with Unix based systems
  • Experience in operating and optimizing Kubernetes clusters
  • Experience with Git, Helm, Terraform, Kubectl and similar
  • Experience in networking, CDN, Gateways and deployment strategies
  • Experience in operating highly available infrastructure
  • Understanding of microservice based architecture and operations
  • Experience in advanced debugging, logging, monitoring and alerting using tools such as Prometheus, Grafana, Splunk, Datadog
  • Experience in implementing and maintaining cost optimized solutions
  • Experience with at least one programming language (e.g. Go, Python, Rust, PHP, TypeScript) and demonstrate capabilities in software development
  • Understanding of the Web3 technologies and related challenges including Rollups-as-a-Service (RaaS)
  • Eager to learn and grow professionally

Responsibilities

  • Maintain and operate Gelato infrastructure in a multi-cloud environment
  • Contribute to improve our incident management lifecycle for overall reliability
  • Contribute to improve our Postmortem philosophy
  • Contribute to improve our DevOps culture
  • Deploy and maintain Rollups-as-a-Service (RaaS) core components and related observability stacks
  • Evaluate and modernize our existing infrastructure and deployment strategies to align with the latest industry standards
  • Maintain and enhance our CI/CD pipeline and its governance
  • Be on-call rotation to provide operational support and service availability
  • Participate and conduct regular team meetings
  • Provide insights and recommendations on system design and scalability, focusing on reliability, security and efficiency in a Web3 context
  • Be an active team member by always looking out for cost effective innovative solutions and by facilitating the adoption of industry standards

Benefits

  • Competitive package with a generous token package
  • Get a share of the network token and be able to participate in the Gelato DAO
  • Chance to participate in shaping the future of web3 by working together with the biggest projects in this space that use Gelato such as Optimism, Polygon, Arbitrum, Celestia and Eigenlayer
  • Fully remote team, with team members in Zug, Paris, New York, London, Singapore, and many other cool places

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.