Engineering Member, Fine-tuning

Logo of poolside

poolside

πŸ“Remote - United States

Job highlights

Summary

Join poolside's Applied Research team and contribute to transforming pre-trained LLMs into highly capable AI systems for coding. This hands-on role involves building data pipelines, researching and implementing fine-tuning algorithms, and training reward models. You will leverage thousands of GPUs and collaborate with a team focused on creating AI-assisted software development tools. The ideal candidate possesses extensive experience with LLMs, deep learning, and distributed training, along with a strong research background. Poolside offers a remote-first work environment with flexible hours, generous vacation time, health insurance allowance, and various other benefits.

Requirements

  • Experience with Large Language Models (LLM)
  • Deep knowledge of Transformers
  • Strong deep learning fundamentals
  • Good taste in data
  • Fine-tuning experience with LLMs
  • Extensively used and probed LLMs, familiarity of their capabilities and limitations
  • Knowledge of distributed training
  • Strong machine learning and engineering background
  • Research experience
  • Experience in proposing and evaluating novel research ideas
  • Familiar with, or contributed to the state of the art in multiple of the following topics: Fine-tuning and alignment of LLMs, synthetic data generation, continual learning, RLHF, code generation
  • Is comfortable in a rapidly iterating environment
  • Is reasonably opinionated
  • Programming experience
  • Linux
  • Strong algorithmic skills
  • Python with PyTorch or Jax
  • Use modern tools and are always looking to improve
  • Strong critical thinking and ability to question code quality policies when applicable

Responsibilities

  • Research and experiment on ways to specialize foundational models to coding use cases
  • Build and maintain data and training pipelines
  • Keep up with latest research, and be familiar with state of the art in LLMs, alignment, synthetic data generation, code generation
  • Design, analyze, and iterate on training/fine-tuning/data generation experiments
  • Write high-quality, pragmatic code
  • Work as part of a team: plan future steps, discuss, and communicate clearly with your peers

Preferred Qualifications

Prior experience in non-ML programming, especially not in Python

Benefits

  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you and dependents
  • Company-provided equipment
  • Wellbeing, always-be-learning and home office allowances
  • Frequent team get togethers
  • Great diverse & inclusive people-first culture

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.