Senior Staff Technical Lead Manager, ML Fundamentals

Stack AV Logo

Stack AV

πŸ“Remote - United States

Summary

Join Stack's ML Fundamentals team as a Technical Lead Manager (TLM) and lead the development of core tooling for ML engineers. You will drive the vision and development of key tools powering ML development, focusing on autonomous driving. Responsibilities include leading a team of engineers, setting technical direction, collaborating cross-functionally, defining cloud resource utilization strategy, and proactively identifying and resolving bottlenecks. You will also drive execution of the team's roadmap and ensure features are delivered in alignment with key partners. The ideal candidate possesses strong leadership experience, expertise in distributed ML training, and experience with ML release workflows and tooling. A background in software engineering and architecture, proficiency in Python and C++, and experience with Deep Learning Models are essential.

Requirements

  • Experience with strong, hands-on leadership of a team of engineers
  • Experience with distributed ML training frameworks and ML training optimization
  • Experience with common ML release workflows, and building tooling that improves developer efficiency and product quality
  • Strong experience in software engineering and software architecture
  • Knowledge of Python and C++

Responsibilities

  • Lead a team of engineers to develop critical ML tooling and algorithms to 1) support autonomous driving development and release, 2) increase training and inference performance and reliability; and 3) optimize the ML development lifecycle
  • Set the technical direction for the team, and work cross-functionally to develop tooling that improves safety
  • Partner with other teams across the company to identify key requirements, dependencies, and prioritize the key use cases that support business outcomes
  • Define the strategy and optimize cloud resource utilization at Stack, focusing on maximizing ML training performance
  • Proactively identify bottlenecks and limitations in ML developer efficiency and product quality across the company, and develop novel solutions to improve dev iteration speed and safety/reliability
  • Drive execution of the team’s roadmap, review their code and SW architecture, and deliver features in lockstep with key partners

Preferred Qualifications

  • Experience with architecting, training, and deploying Deep Learning Models is a plus
  • Experience with CUDA a plus

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs