Principal Machine Learning Engineer

The Browser Company Logo

The Browser Company

πŸ’΅ $250k-$330k
πŸ“Remote - United States

Summary

Join The Browser Company and become a Principal Machine Learning Engineer, working on Dia, the next LLM-powered interface for the internet. Collaborate with a diverse team to fine-tune LLMs, optimize on-device and cloud-based inference, and build evaluation pipelines. You will experiment with new LLMs, integrate them into browser-based use cases, and collaborate with product teams to build AI-powered features. The role involves optimizing model architecture, improving datasets, and prototyping new features. This position requires significant experience in ML model optimization and productionization, particularly with transformer models. The Browser Company offers a competitive salary, comprehensive benefits, flexible work arrangements, and a supportive remote-first environment.

Requirements

  • 8+ years of experience optimizing and productionizing modern ML models, especially ones that run in a real-world product environment (bonus if you’ve worked closely with transformer models)
  • Deep experience fine-tuning open-source LLMs and going beyond simple LoRA fine-tuning
  • Production experience with a modern coding language like Python
  • Passion for on-device performance and excitement to push the boundaries of what's possible in a browser
  • Experience independently running critical projects, shipping ML features, and leading initiatives with minimal guidance
  • Pragmatism, motivation by nebulous problems, and excitement to work in a startup environment with quick product validation cycles
  • 4+ hours of overlap time with team members in Eastern Time Zone

Responsibilities

  • Fine-tune, distill, and optimize LLMs to improve performance, reduce latency, and enhance efficiency for on-device and cloud-based inference
  • Improve our on-device model architecture, leveraging frameworks like MLX, ONNX, and TFLite to ensure models run efficiently across different devices
  • Experiment with and integrate new LLMs, fine-tuning them for specific browser-based use cases while balancing quality, speed, and resource constraints
  • Build evaluation pipelines to track model performance, accuracy, and real-world effectiveness over time
  • Collaborate with product ops teams to build and improve datasets that accurately match product needs
  • Collaborate with product engineers and designers to prototype and ship AI-powered features that enhance user experience
  • Optimize inference strategies, including running models on-device, in the cloud, or in hybrid configurations to maximize throughput and resource usage

Preferred Qualifications

Experience working closely with transformer models

Benefits

  • Flexible compensation model with options for salary-optimized, equity-optimized, and balanced offers
  • Annual salary range of $250,000 - $330,000 USD
  • Comprehensive benefits package with 100% employee premium coverage for medical, dental, and vision, and up to 95% for dependents
  • 401k plan
  • Flexible vacation policy (15-20 days on average, plus federal holidays)
  • Remote-friendly working environment with core working hours of 11 AM-2 PM Eastern Time
  • 12 weeks of paid parental leave
  • $1,500 USD home office stipend
  • Free annual memberships to One Medical (where available), Talkspace, Teladoc, and HealthAdvocate (for US-based employees)

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.