Staff Software Engineer

Airbnb Logo

Airbnb

πŸ’΅ $204k-$255k
πŸ“Remote - United States

Summary

Join Airbnb's Core Machine Learning team as a Staff Software Engineer (GenAI) and contribute to the development and optimization of scalable engineering evaluation frameworks and systems for Airbnb's Generative AI products. Work closely with cross-functional teams, including modeling engineers, product managers, data scientists, and operations, to define and shape the future of Airbnb's Community Support experience. You will design, productionize, and optimize end-to-end data systems to improve the effectiveness and efficiency of the AI evaluation automation framework. Collaborate with machine learning infrastructure engineering teams to evolve how evaluation frameworks are built and tested for Airbnb's Conversational AI products. Lead all phases of software development, from architecture design to implementation and testing. Foster a culture of engineering excellence by supporting teammates and sharing knowledge.

Requirements

  • 9+ years of industry experience in applied machine learning, with a track record of technical leadership and delivering complex, high-impact AI/ML systems
  • MS or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related technical field
  • Deep expertise in Large Language Models (LLMs), including experience with LLM model evaluation methodologies, and agent-based applications
  • Solid programming skills in Python and at least one other language (e.g., Java, Go, or Scala), with a strong foundation in software design, testing, and code quality
  • Strong AI/ML system design skills with a track record of building scalable, extensible AI systems
  • Familiarity with ML infrastructure and operations, including model deployment, serving, monitoring, and experimentation
  • Proven ability to work in cross-functional teams, collaborating with modeling engineers, product managers, data scientists, and operations to deliver end-to-end solutions
  • Excellent communication, mentorship, and technical leadership skills; able to drive alignment, set direction, and influence engineering culture across teams

Responsibilities

  • Work closely with Core Modeling engineers to understand pain points in the LLM development process, and develop LLM-as-a-judge solutions and models to address metric-related challenges in a scalable and efficient way
  • Design, productionize, and optimize end-to-end data systems to improve the effectiveness and efficiency of the AI evaluation automation framework
  • Collaborate with machine learning infrastructure engineering teams to evolve how we build and test evaluation framework for Airbnb Conversational AI products
  • Lead all phases of software development including architecture design, implementation and testing
  • Work collaboratively with cross-functional partners including product managers, operations and data scientists, identify opportunities for business impact, understand and prioritize requirements for machine learning systems and data pipelines, drive engineering decisions and quantify impact
  • Foster a culture of engineering excellence by supporting teammates in writing high-quality code, ensuring operational reliability, and sharing knowledge across the team

Benefits

  • This position is US - Remote Eligible
  • The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager
  • This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits
  • $204,000 β€” $255,000 USD

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs