Principal Machine Learning Engineer

The Browser Company
Summary
Join The Browser Company and become a Principal Machine Learning Engineer, working on Dia, the next LLM-powered interface for the internet. Collaborate with a diverse team to fine-tune LLMs, optimize on-device and cloud-based inference, and build evaluation pipelines. You will experiment with new LLMs, integrate them into browser-based use cases, and collaborate with product teams to build AI-powered features. The role involves optimizing model architecture, improving datasets, and prototyping new features. This position requires significant experience in ML model optimization and productionization, particularly with transformer models. The Browser Company offers a competitive salary, comprehensive benefits, flexible work arrangements, and a supportive remote-first environment.
Requirements
- 8+ years of experience optimizing and productionizing modern ML models, especially ones that run in a real-world product environment (bonus if youβve worked closely with transformer models)
- Deep experience fine-tuning open-source LLMs and going beyond simple LoRA fine-tuning
- Production experience with a modern coding language like Python
- Passion for on-device performance and excitement to push the boundaries of what's possible in a browser
- Experience independently running critical projects, shipping ML features, and leading initiatives with minimal guidance
- Pragmatism, motivation by nebulous problems, and excitement to work in a startup environment with quick product validation cycles
- 4+ hours of overlap time with team members in Eastern Time Zone
Responsibilities
- Fine-tune, distill, and optimize LLMs to improve performance, reduce latency, and enhance efficiency for on-device and cloud-based inference
- Improve our on-device model architecture, leveraging frameworks like MLX, ONNX, and TFLite to ensure models run efficiently across different devices
- Experiment with and integrate new LLMs, fine-tuning them for specific browser-based use cases while balancing quality, speed, and resource constraints
- Build evaluation pipelines to track model performance, accuracy, and real-world effectiveness over time
- Collaborate with product ops teams to build and improve datasets that accurately match product needs
- Collaborate with product engineers and designers to prototype and ship AI-powered features that enhance user experience
- Optimize inference strategies, including running models on-device, in the cloud, or in hybrid configurations to maximize throughput and resource usage
Preferred Qualifications
Experience working closely with transformer models
Benefits
- Flexible compensation model with options for salary-optimized, equity-optimized, and balanced offers
- Annual salary range of $250,000 - $330,000 USD
- Comprehensive benefits package with 100% employee premium coverage for medical, dental, and vision, and up to 95% for dependents
- 401k plan
- Flexible vacation policy (15-20 days on average, plus federal holidays)
- Remote-friendly working environment with core working hours of 11 AM-2 PM Eastern Time
- 12 weeks of paid parental leave
- $1,500 USD home office stipend
- Free annual memberships to One Medical (where available), Talkspace, Teladoc, and HealthAdvocate (for US-based employees)