Engineering Member, Fine-tuning
poolside
πRemote - United States
Please let poolside know you found this job on JobsCollider. Thanks! π
Job highlights
Summary
Join poolside's Applied Research team and contribute to transforming pre-trained LLMs into highly capable AI systems for coding. This hands-on role involves building data pipelines, researching and implementing fine-tuning algorithms, and training reward models. You will leverage thousands of GPUs and collaborate with a team focused on creating AI-assisted software development tools. The ideal candidate possesses extensive experience with LLMs, deep learning, and distributed training, along with a strong research background. Poolside offers a remote-first work environment with flexible hours, generous vacation time, health insurance allowance, and various other benefits.
Requirements
- Experience with Large Language Models (LLM)
- Deep knowledge of Transformers
- Strong deep learning fundamentals
- Good taste in data
- Fine-tuning experience with LLMs
- Extensively used and probed LLMs, familiarity of their capabilities and limitations
- Knowledge of distributed training
- Strong machine learning and engineering background
- Research experience
- Experience in proposing and evaluating novel research ideas
- Familiar with, or contributed to the state of the art in multiple of the following topics: Fine-tuning and alignment of LLMs, synthetic data generation, continual learning, RLHF, code generation
- Is comfortable in a rapidly iterating environment
- Is reasonably opinionated
- Programming experience
- Linux
- Strong algorithmic skills
- Python with PyTorch or Jax
- Use modern tools and are always looking to improve
- Strong critical thinking and ability to question code quality policies when applicable
Responsibilities
- Research and experiment on ways to specialize foundational models to coding use cases
- Build and maintain data and training pipelines
- Keep up with latest research, and be familiar with state of the art in LLMs, alignment, synthetic data generation, code generation
- Design, analyze, and iterate on training/fine-tuning/data generation experiments
- Write high-quality, pragmatic code
- Work as part of a team: plan future steps, discuss, and communicate clearly with your peers
Preferred Qualifications
Prior experience in non-ML programming, especially not in Python
Benefits
- Fully remote work & flexible hours
- 37 days/year of vacation & holidays
- Health insurance allowance for you and dependents
- Company-provided equipment
- Wellbeing, always-be-learning and home office allowances
- Frequent team get togethers
- Great diverse & inclusive people-first culture
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- πUnited States
- πUnited States
- π°$175k-$234kπWorldwide
- πUnited States
- πWorldwide
- πUnited States
- πUruguay
- π°$175k-$225kπUnited States