Senior Site Reliability Engineer

Finary Logo

Finary

πŸ“Remote - Worldwide

Summary

Join Finary as their first Site Reliability Engineer and become the cornerstone of their technical infrastructure, ensuring high availability, performance, and security. Work directly with the VP of Engineering to build and implement systems, processes, and a culture of reliability. Establish and enforce reliability standards, implement robust observability solutions, and bridge the gap between development and operations. Architect and implement solutions using GCP, Kubernetes, Terraform, and Docker, creating scalable infrastructure. Define and measure SLOs/SLIs, build automated recovery systems, and continuously optimize infrastructure costs. Collaborate with engineering teams to embed reliability thinking into the development lifecycle. This highly independent role requires extensive experience in high-availability architectures and deep knowledge of GCP, particularly Kubernetes.

Requirements

  • Have extensive experience designing and maintaining high-availability architectures in production environments
  • Possess deep knowledge of Google Cloud Platform, particularly Kubernetes
  • Are proficient with infrastructure as code tools, especially Terraform
  • Have strong experience with containerization technologies like Docker
  • Excel at implementing and utilizing observability tools such as Datadog, Prometheus, and Grafana
  • Have a proven track record implementing effective monitoring, alerting, and incident response processes
  • Possess strong analytical skills for troubleshooting complex system failures and performance bottlenecks
  • Apply security principles (least privilege, secure defaults, etc.) as a foundational aspect of system design
  • Have experience defining and tracking reliability metrics (SLOs/SLIs/SLAs) that align with business goals
  • Demonstrate a bias for automation, consistently replacing manual processes with scalable solutions
  • Are comfortable being on-call and handling production incidents calmly and methodically
  • Communicate effectively with both technical and non-technical stakeholders
  • Work collaboratively across teams while maintaining independence and ownership
  • Make pragmatic operational decisions, balancing ideal solutions with business needs

Preferred Qualifications

  • Have experience as the first or founding SRE in a fast-growing company
  • Possess experience with fintech or financial systems where reliability is critical
  • Have experience optimizing cloud infrastructure costs without sacrificing performance
  • Have experience mentoring developers on reliability best practices
  • Are a team player, are humble, and like to help others grow
  • Are genuinely friendly and ambitious
  • Like to take initiative and find new creative solutions to problems
  • Are an excellent communicator and know how to coordinate teams, collect feedback and identify bottlenecks
  • Are fluent in English (written and spoken)
  • Break down big projects into small deliverables, well-presented to engineers, and move fast based on user feedback. You prefer small solutions and know when to improve something
  • Are comfortable with product analytics (Amplitude is a plus)
  • Are passionate about personal finance & investing
  • Have experience in fintech

Benefits

  • Working with brilliant people in a motivating and fast-paced environment
  • Having the opportunity to have a real impact in a sector that touches everyone's life: personal finance
  • Being empowered and trusted to bring and become the best version of yourself
  • Gathering at the office or in nice places every 6 weeks
  • A competitive compensation package

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.