Staff Software Engineer, ML Platform

closed
Logo of Stack AV

Stack AV

πŸ“Remote - United States, Remote

Job highlights

Summary

The job is for a Machine Learning Engineer in Stack AV's ML Data team. The role involves building infrastructure for machine learning training and implementing high throughput inference service. The ideal candidate has experience with ML platforms, scalable infrastructure, and working across teams.

Requirements

  • Experience with both ML Platforms and building ML-based applications
  • Ability to work across teams
  • Experience building or using ML infra built for a large number of customer teams

Responsibilities

  • Help build infrastructure to support machine learning training
  • Implement high throughput inference service using LLMs and vector db
  • Set the direction for auto-labeling

Preferred Qualifications

  • Deep understanding of design tradeoffs and ability to articulate those tradeoffs and work with others on getting alignment
  • Experience with building ML models or ML infra in the domains of autonomous vehicles, perception, and decision making
  • Experience with model training, model optimization, or large data processing pipelines

Benefits

  • Knows how to push the GPU to its limit from Python to CUDA kernel level (bonus point)
  • Built the inference or training loop for a large model (ideally with LLM flavor)
  • Shipped ML products (NLP, computer vision, recommender systems, etc.) at scale to make business impact
  • Has data platform experience where you built infrastructure for real time querying / vector databases, batch/stream processing using Ray, Spark or similar, and Parquet-based object storage (data lake / data warehouse)
  • Knows how to build low latency / high throughput batch or stream processing pipelines
  • Knows how to write (readable) high performance C++
  • Has prior AV experience
This job is filled or no longer available

Similar Remote Jobs