
Senior Software Engineer

Airbnb
Summary
Join Airbnb's Transactional Storage Services team and help design, build, and operate a new, open-source NewSql database. This crucial database will host critical user, listing, and financial data, serving as a unified storage backend for Airbnb's online data. As a Senior engineer, you will tackle broad and deep technical challenges, focusing on areas such as control plane and operations design, automated database operations, platform delivery for partner teams, migration and adoption of new systems, and robust backup and restore solutions. You will work with talented engineers on a modern distributed database system, contributing to a cutting-edge data ecosystem. This role offers the opportunity to shape the future of data systems for a global company. The position is US-remote eligible, with occasional office work or offsites.
Requirements
- 5+ years of relevant industry experience
- Solid understanding of distributed systems and infrastructure fundamentals
- Experience in deep diving and then owning a complex code base
- Knack for writing clean, readable, testable, maintainable code
- Ability to decompose large-scale distributed systems and figure out monitoring metrics, failure scenarios and debug them in an efficient manner
- Strong collaboration and communication skills in a remote-working environment
- Expertise with a public cloud provider (AWS, GCP, Azure) and their Storage, VM, Networking, Security offerings. E.g. external-dns, route53, ebs etc
Responsibilities
- Design frameworks and maintain the general ecosystem around our NewSql database’s monitoring, permissions, service discovery integration, etc
- Design, automate critical database operations such as centralized and hierarchical config management system, fully automated image building and release certification for major version upgrades, zero-downtime Blue/Green deployment
- Be part of the team that defines and delivers a generalized database platform for partner KVStore, ORM, MySql teams
- Deliver a zero-downtime forward and reverse replication pipeline with near-real-time consistency between two transactional databases, with correctness guarantee across transactional boundaries
- Deliver a robust failover/failback mechanism to guarantee correctness and continuity during unexpected outages
- Conduct case study of all Airbnb’s disaster recovery scenarios, leverage existing open source and/or design and implement software that satisfies Airbnb’s requirements on database backup and restore, cross-region data resiliency, PiTR, etc
- Design the right cluster topology, restore logic, and ransomware policy to safeguard Airbnb’s business continuity
Preferred Qualifications
- Experience in Java, Go, Rust or C++
- Experience with writing robust automation frameworks and tooling
- Experience with Kubernetes, operator pattern, helm, etc; experience with Infrastructure as Code, such as Chef and Terraform
Benefits
- Bonus
- Equity
- Benefits
- Employee Travel Credits
Share this job:
Similar Remote Jobs

