Senior Software Engineer, Site Reliability Engineering

Crypto.com Logo

Crypto.com

πŸ“Remote - China

Summary

Join our team to design, develop, maintain, and improve software for various ventures projects. You will be actively involved in designing scalable applications, from frontend UI to backend infrastructure. Ensure the entire stack operates optimally, perform deep dives into reliability issues, and continuously improve availability and reliability. Lead SRE initiatives, represent the team in design reviews, and cultivate relationships to drive impact. We offer a supportive and flexible work environment with opportunities for growth and development, along with competitive salary and benefits.

Requirements

  • Experience coding in Ruby and/or Go
  • Familiar with GitOps principles and tools (Github Actions, Docker, Kubernetes)
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • Curiosity about finding root causes in incidents and outages
  • Ability to develop alignment to cultivate relationships and driving impact
  • Mindset in designing fault tolerance system architecture
  • Comfort with being uncomfortable in ambiguous situations
  • Involvement with incident management and response
  • Desire to grow expertise, inform, and educate others
  • Capable to pick up various technologies, a fast learner and have a β€œget things done” mentality
  • Humble to embrace better ideas from others, eager to make things better, open to challenges and possibilities

Responsibilities

  • Ensure entire stack is healthy: hardware, software, application and network are operating at optimal performance
  • Perform deep dives into both systemic and latent reliability issues; partnering with other software and DevOps engineers across the organization to design, implement and roll out fixes
  • Continuously improve availability, reliability, and observability and reduce the burden of human toil with tooling and automation
  • Lead and drive SRE initiatives to improve operation efficiencies
  • Represent the SRE team in system design reviews and operational readiness exercises for new and existing services

Preferred Qualifications

  • Familiar with cloud platforms and micro-service based architecture (AWS is big plus)
  • Familiar with monitoring tools (e.g. Datadog, OpenTelemetry)
  • Familiar with CICD tools (e.g. Github Actions)
  • Familiar with IaC tools (e.g. Terraform, Spacelift)
  • Experience in designing resilient system architecture
  • Experience in optimizing performance of large-scale production system

Benefits

  • Competitive salary
  • Attractive annual leave entitlement including: birthday, work anniversary
  • Work Flexibility Adoption. Flexi-work hour and hybrid or remote set-up
  • Aspire career alternatives through us. Our internal mobility program can offer employees a diverse scope
  • Work Perks: crypto.com visa card provided upon joining

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.