Senior Staff Machine Learning Engineer AI Safety and Guardrail

Airbnb
Summary
Join Airbnb's Community Support Products (CSP) Machine Learning team as a Guardrail and AI Safety Engineer. Ensure the reliability and safety of AI-powered systems, including chatbots and AI assistants, by collaborating with cross-functional teams. Design and implement guardrails to mitigate risks such as hallucinations and privacy breaches. Set up continuous risk monitoring and collaborate with various teams to manage risks and ensure compliance. Document processes and partner with ML infrastructure to scale safety features. This role requires a PhD/Master's degree in CS or equivalent experience, 7+ years of experience in deploying machine learning models, and 2+ years in areas like Content Safety or Responsible AI. The position is US-remote eligible with occasional office work.
Requirements
- PhD/Master’s degree, preferably in CS, or equivalent experience
- 7/10+ years of work experience in developing and deploying machine learning models in production
- Strong understanding of machine learning principles and algorithms
- Hands-on programming experience in python and in-depth knowledge of machine learning frameworks
- 2+ years of experience with one or more of the following broader areas: Content Safety/Integrity, ML Fairness and Bias, Responsible AI, AI Model Security, or related areas
Responsibilities
- Collaborate with cross functional teams to identify issues, evaluate risks, design monitoring systems, tailor safeguard measures and deploy efficient solutions to ensure safe, robust and responsible use of AI adoption in CS products
- Design and implement appropriate guardrails to mitigate risks like hallucinations, privacy breaches, prompt injections, harmful responses, or bias
- Set up continuous risk monitoring pipelines and alerting to enable human-in-the-loop feedback and mitigation
- Collaborate with trust, security, legal and operation teams to enable risk management
- Collaborate with evaluation and data platform to design and build out data flywheel for fixing model failure modes and guardrail improvement
- Partner with product, design, trust & safety, and legal teams to ensure AI-driven features meet global privacy and compliance standards
- Own documentation, guardrail white papers, and onboarding materials to support knowledge sharing and auditability
- Partner with ML infrastructure to scale safety features across Airbnb products
Preferred Qualifications
Experience with AI technologies in Customer Support Products, LLM alignment techniques (SFT, RLHF, DPO, etc) and LLM evaluation
Benefits
- Bonus
- Equity
- Benefits
- Employee Travel Credits