Engineering Member, Fine-tuning at poolside

Summary

Join poolside's Applied Research team and contribute to transforming pre-trained LLMs into highly capable AI systems for coding. This hands-on role involves building data pipelines, researching and implementing fine-tuning algorithms, and training reward models. You will leverage thousands of GPUs and collaborate with a team focused on creating AI-assisted software development tools. The ideal candidate possesses extensive experience with LLMs, deep learning, and distributed training, along with a strong research background. Poolside offers a remote-first work environment with flexible hours, generous vacation time, health insurance allowance, and various other benefits.

Requirements

Experience with Large Language Models (LLM)
Deep knowledge of Transformers
Strong deep learning fundamentals
Good taste in data
Fine-tuning experience with LLMs
Extensively used and probed LLMs, familiarity of their capabilities and limitations
Knowledge of distributed training
Strong machine learning and engineering background
Research experience
Experience in proposing and evaluating novel research ideas
Familiar with, or contributed to the state of the art in multiple of the following topics: Fine-tuning and alignment of LLMs, synthetic data generation, continual learning, RLHF, code generation
Is comfortable in a rapidly iterating environment
Is reasonably opinionated
Programming experience
Linux
Strong algorithmic skills
Python with PyTorch or Jax
Use modern tools and are always looking to improve
Strong critical thinking and ability to question code quality policies when applicable

Responsibilities

Research and experiment on ways to specialize foundational models to coding use cases
Build and maintain data and training pipelines
Keep up with latest research, and be familiar with state of the art in LLMs, alignment, synthetic data generation, code generation
Design, analyze, and iterate on training/fine-tuning/data generation experiments
Write high-quality, pragmatic code
Work as part of a team: plan future steps, discuss, and communicate clearly with your peers

Preferred Qualifications

Prior experience in non-ML programming, especially not in Python

Benefits

Fully remote work & flexible hours
37 days/year of vacation & holidays
Health insurance allowance for you and dependents
Company-provided equipment
Wellbeing, always-be-learning and home office allowances
Frequent team get togethers
Great diverse & inclusive people-first culture

Engineering Member, Fine-tuning

poolside

Summary

Requirements

Responsibilities

Preferred Qualifications

Benefits

Remote

Software Development

Mid-level

Share this job:

Similar Remote Jobs

poolside

Remote

Data

Mid-level

Remote

Software Development

Manager

Remote

Software Development

Mid-level

Remote

Cybersecurity

Mid-level

Apollo.io

Remote

Software Development

Mid-level

Remote

Software Development

Mid-level

Remote

Cybersecurity

Mid-level

Remote

Software Development

Senior

Sana

Remote

Software Development

Senior