Remote Site Reliability Engineer

Logo of BlaBlaCar

BlaBlaCar

📍Remote - France

Job highlights

Summary

Join our Foundations department as an SRE to provide best-in-class Observability, Alerting and Incident management tools and processes to service teams.

Requirements

  • Working in a multidisciplinary environment will request strong communication skills : you'll need to adapt your communication level to other teams expertise and be able to understand their needs
  • Basic knowledge of observability tools (e.g., Datadog) and understanding of metrics, logging, and tracing
  • Troubleshooting/oncall experience in production environments, diagnosing and resolving technical issues effectively
  • Full working proficiency in English
  • Fit with our BlaBlaPrinciples
  • Thriving in a collaborative, fast-growing and innovative environment
  • Ability to take ownership, aligned with business priorities

Responsibilities

  • Support software engineers by creating, maintaining, and improving observability and alerting tools and frameworks
  • Assist in the design and maintenance of Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure service reliability
  • Own the incident management process by defining best practices, standards, and ensuring continuous improvement through post-mortems and chaos engineering
  • Develop and maintain tools, such as Terraform modules, to help automate and enhance reliability across services
  • Provide reporting on operational metrics and incidents to drive continuous improvement

Preferred Qualifications

  • Familiarity with incident management platforms (e.g., PagerDuty) is a bonus
  • Experience working with Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
  • Exposure to programming in Go or a strong interest in learning it

Benefits

  • Full remote possible in Metropolitan France(+ access to BlaBlaCar co-working spaces in Bordeaux, Toulouse, Lyon, Nantes and Sophia Antipolis)
  • 4 additional weeks parental leave 100% paid
  • Financial support for home office equipment
  • Relocation package and visa support
  • Free unlimited carpooling & bus rides
  • Employee Stock Ownership plan
  • 25 days holiday per year + RTT
  • Local meal plan policies (Swile card in France)
  • 50% transportation paid in France (Forfait Mobilité Durable)
  • Mental health support through Moka.care

Job description

About BlaBlaCar

BlaBlaCar is the world’s leading community-based travel app enabling 27 million members a year to carpool or travel by bus in 21 countries. Our team of 800 employees counts over 50 nationalities and is spread across our 5 global offices, 30% working fully remotely.

Your Mission

By joining our Foundations department, you will be working alongside talented individuals grouped in small agile teams that each have strong ownership on their piece of these goals. Foundations is composed of seven teams which “provide consistent, easy to use, infrastructures, services, and expertise to support BlaBlaCar’s growth and evolution”.

The Site Reliability Engineering team (aka SRE) is responsible to provide best in class Observability, Alerting and Incident management tools and processes to service teams. As an enabling team, we help BlaBlacar engineers to efficiently improve their service reliability. Empowering developers and bringing them our reliability expertise are at the core of our daily work.

Technical stack:

• Core Infrastructure: Kubernetes, Google Cloud Platform

• GitOps/Delivery: GitHub, Terraform, Flux, Helm, Jenkins

• Observability/Incident Management: Datadog, Opentelemetry, PagerDuty

• Languages: Go for Tooling

Your responsibilities

• Support software engineers by creating, maintaining, and improving observability and alerting tools and frameworks

• Assist in the design and maintenance of Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure service reliability.

• Own the incident management process by defining best practices, standards, and ensuring continuous improvement through post-mortems and chaos engineering. While developers handle incidents within their scope, you could be called to  step in as Incident Commander during high-severity incidents, leading coordination efforts.

• Develop and maintain tools, such as Terraform modules, to help automate and enhance reliability across services.

• Provide reporting on operational metrics and incidents to drive continuous improvement.

Your qualifications

• Working in a multidisciplinary environment will request strong communication skills : you’ll need to adapt your communication level to other teams expertise and be able to understand their needs

• Basic knowledge of observability tools (e.g., Datadog) and understanding of metrics, logging, and tracing.

• Troubleshooting/oncall experience in production environments, diagnosing and resolving technical issues effectively (experience with GKE or Kubernetes is a plus).

• Full working proficiency in English

• Fit with our BlaBlaPrinciples

• Thriving in a collaborative, fast-growing and innovative environment

• Ability to take ownership, aligned with business priorities

Nice to have:

• Familiarity with incident management platforms (e.g., PagerDuty) is a bonus

• Experience working with Service Level Objectives (SLOs) and Service Level Indicators (SLIs)

• Exposure to programming in Go or a strong interest in learning it.

• Backend services are built using multiple programming languages: while development skills aren’t required, familiarity with object-oriented programming and scripting languages is an advantage.

What we have to offer

• Full remote possible in Metropolitan France(+ access to BlaBlaCar co-working spaces in Bordeaux, Toulouse, Lyon, Nantes and Sophia Antipolis)

• 4 additional weeks parental leave 100% paid

• Financial support for home office equipment

• Relocation package and visa support

• Free unlimited carpooling & bus rides

• Employee Stock Ownership plan

• 25 days holiday per year + RTT

• Local meal plan policies (Swile card in France)

• 50% transportation paid in France (Forfait Mobilité Durable)

• Mental health support through Moka.care

Interested in joining the ride?

• a 45-min video-call with Maxime, Talent Acquisition Manager,  to get to know you, understand your career expectations and answer your questions

• a 60-min video-call with Damien Bertau, Hiring Manager (n+1) to discuss your experience and share more details about the team

• a fully remote exercise to evaluate your technical skills

• a 90-min video-call with 2 team members to discuss about your exercise & discuss more about your technical expertise

• a 45-min video-call with Maxime Fouilleul Head of Foundations (n+2) to get a wider vision of the department and its strategy

Our hiring process lasts on average 25-30 days, offers usually come within 48 hours.

Not sure yet?

Check out our 100 reasons to join BlaBlaCar!

BlaBlaCar is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. If you don’t meet 100% of the qualifications outlined above, tell us why you’d still be a great fit for this role in your application.

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let BlaBlaCar know you found this job on JobsCollider. Thanks! 🙏