πUnited States
Senior Data Engineer, ML Infrastructure

Airbnb
π΅ $191k-$223k
πRemote - United States
Please let Airbnb know you found this job on JobsCollider. Thanks! π
Summary
Join Airbnb's ML Infrastructure team and build the essential AI/ML data foundations powering all AI/ML use cases across the company. You will design, build, and maintain robust data pipelines, develop and optimize data models, collaborate with other teams, and contribute to scalable GenAI infrastructure. Your work will accelerate AI/ML innovation and enable the rapid development and deployment of high-quality solutions. You'll leverage technologies like Spark, Airflow, Ray, MLFlow, TensorFlow, and PyTorch. This role is US-remote eligible with occasional office work.
Requirements
- 5+ years of relevant industry experience (BS/Masters) or 2+ years with a PhD
- Strong coding skills in Python, Java, or equivalent languages
- Hands-on experience with distributed processing technologies (Spark, Kafka, Flink, Hadoop) and distributed storage (HDFS, S3)
- Solid knowledge of data warehousing concepts and databases (e.g. PostgreSQL, MySQL, Redshift, BigQuery, ClickHouse)
- Expertise building scalable ETL pipelines using schedulers like Airflow, Luigi, Oozie, or AWS Glue
- Proven ability to analyze large datasets, identify insights, and drive impactful product solutions
- Excellent written and verbal communication skills; comfortable collaborating cross-functionally
- Experience building end-to-end Machine Learning platforms and deploying ML models
- Familiarity with Kubernetes, Docker, and modern infrastructure tools
- Deep understanding of distributed systems and engineering best practices
Responsibilities
- Design, build, automate, and maintain robust, scalable data pipelines using SparkSQL, Scala, and Airflow
- Develop and optimize data models ensuring high-quality, consistent, and accurate data to support broad AI/ML product feature decisions
- Collaborate closely with peer ML Infra teams to deliver automated data solutions driving AI/ML acceleration
- Contribute to scalable GenAI infrastructure by leveraging foundational language and vision models to create high quality datasets that power cutting edge GenAI applications
- Partner with key customer teams to deliver high-impact, high-quality datasets core to Airbnb's roadmap
- Utilize leading open-source technologies including Spark, Airflow, Ray, MLFlow, TensorFlow, PyTorch, Docker, Kubernetes, and more
Benefits
This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
π°$200k-$240k
πUnited States
π°$155k-$207k
πUnited States
πCanada
π°$180k-$230k
πUnited States
πBrazil
πArgentina
πIndia
π°$170k-$200k
πUnited States