Site Reliability Engineer

StarTree Logo

StarTree

πŸ“Remote - India

Summary

Join StarTree, a passionate team building a cloud analytics system, as a Site Reliability Engineer (SRE). You will manage and tune large-scale, highly available distributed systems, primarily focusing on Apache Pinot and SQL DBs. This role involves collaborating with engineers and customers to resolve incidents, execute disaster recovery, and improve system performance. The ideal candidate possesses extensive experience in managing production systems, cloud platforms, and container orchestration. StarTree offers a dynamic environment and the opportunity to work on cutting-edge technology.

Requirements

  • 5+ years of experience as an engineer (SRE, SDET, or development)
  • Experience with cloud platforms such as AWS, GCP, or Azure
  • Experience with Kubernetes and container orchestration
  • Familiarity with streaming systems, such as Kafka, Pulsar, Flume, Flink, Spark, or similar
  • Knowledge of standard methodologies related to security, performance, and disaster recovery
  • Strong troubleshooting and critical thinking skills

Responsibilities

  • Leverage various monitoring and alerting services to solve intricate programming problems at scale
  • Manage and tune multiple critical customer-facing Apache Pinot clusters
  • Monitor availability, read/write latencies, and other key telemetry to proactively identify SLO misses and help mitigate issues
  • Build a rapport with and work closely with customers to mitigate and resolve incidents
  • Execute disaster recovery strategies with minimal downtime
  • Collaborate with other engineers to understand and troubleshoot systems and use the experience gained to influence the roadmap of other teams

Preferred Qualifications

Experience managing highly available production facing distributed systems and in-depth knowledge of Java are a plus

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.