Engineering Manager, Gen AI

closed
Rad AI Logo

Rad AI

πŸ“Remote - United States

Summary

Join Rad AI's Gen AI Engineering team and lead a growing team of engineers, pioneering new approaches to healthcare-specific LLM deployment. You will define and execute the long-term Gen AI engineering strategy, drive innovative training and inference architectures, and collaborate with radiologists and ML researchers. This role requires 6+ years of software engineering experience, including 3+ years focusing on ML infrastructure and 2+ years leading technical teams. You'll need proficiency in Python and infrastructure-as-code tools. Rad AI offers a variety of benefits for US-based full-time roles, including comprehensive insurance, 401(k), flexible PTO, and more. The company values diversity and provides equal employment opportunities.

Requirements

  • 6+ years of experience in software engineering, with at least 3+ years focusing on ML infrastructure
  • 2+ years of experience leading technical teams
  • Working knowledge of advanced model customization and fine-tuning methodologies
  • Experience deploying and operating production ML systems at scale
  • Proficiency with Python and at least one infrastructure-as-code tool (Terraform, CloudFormation, etc.)
  • Strong communication skills with the ability to translate complex technical concepts across teams
  • Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • Passion for applying AI to meaningful problems with direct impact on patient care

Responsibilities

  • Lead and mentor a growing team of Gen AI engineers, helping them develop professionally while tackling healthcare's most challenging AI problems
  • Pioneer new approaches to healthcare-specific LLM deployment that balance clinical accuracy, performance, and regulatory requirements
  • Define and execute the long-term Gen AI engineering strategy, balancing performance, cost, and reliability across healthcare AI applications
  • Drive innovative training and inference architectures that can scale to support thousands of radiologists simultaneously
  • Collaborate with radiologists and ML researchers to translate clinical insights into technical solutions that meaningfully improve patient care
  • Establish technical partnerships with leading AI infrastructure providers to optimize our unique healthcare AI compute requirements
  • Champion a culture of technical excellence, continuous learning, and ethical AI development

Preferred Qualifications

  • Strong understanding of cloud infrastructure (AWS, GCP, or Azure) and containerization technologies
  • Familiarity with specialized ML hardware like TPUs, specialized GPUs, or custom accelerators
  • Knowledge of model optimization techniques, such as quantization and distillation, to reduce inference costs in real-world deployments
  • Experience working with open-source LLM ecosystems (Hugging Face, LangChain, etc.)
  • Experience in a fast-growing startup environment
  • Healthcare technology experience or demonstrated interest in healthcare applications
  • Background in building AI systems that comply with regulated environments (HIPAA, FDA, etc.)
  • Track record of innovation and thought leadership in the AI community

Benefits

  • Comprehensive Medical, Dental, Vision & Life insurance
  • HSA (with employer match), FSA, & DCFSA
  • 401(k)
  • 11 Paid Company Holidays
  • Location Flexibility (Remote-first company!)
  • Flexible PTO policy
  • Annual company-wide offsite
  • Periodic team offsites
  • Annual equipment stipend
This job is filled or no longer available