Remote Staff Software Engineer, ML Platform

Logo of Stack AV

Stack AV

πŸ“Remote - United States

Job highlights

Summary

Join our ML Training and DevEx team at Stack to provide a reliable, scalable, and easy-to-use training framework for modeling needs of Stack AV. The team is responsible for the overall developer experience of ML engineers, including building tools for testing, validation, and understanding models and the data used to train them.

Requirements

  • Experience with both ML Platforms and building ML-based applications
  • Experience building scalable, reliable infra at a fast-paced environment
  • Ability to work across teams
  • Experience building or using ML infra built for a large number of customer teams
  • A deep understanding of design tradeoffs and ability to articulate those tradeoffs and work with others on getting alignment

Responsibilities

  • Provide a reliable, scalable, and easy-to-use training framework for modeling needs of Stack AV
  • Be responsible for the overall developer experience of ML engineers, including building tools for testing, validation, and understanding models and the data used to train them

Preferred Qualifications

  • Knows how to push the GPU to its limit from Python to CUDA kernel level
  • Built the inference or training loop for a large model (ideally with LLM flavor)
  • Shipped ML products (NLP, computer vision, recommender systems, etc.) at scale to make business impact
  • Knows how to build low latency / high throughput batch or stream processing pipelines
  • Knows how to write (readable) high performance C++
  • Prior AV experience

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Stack AV know you found this job on JobsCollider. Thanks! πŸ™