Engineering Member

Logo of poolside

poolside

πŸ“Remote - United States

Job highlights

Summary

Join poolside, a company building AI to drive economically valuable work and scientific progress, and become part of our pre-training team focused on building out our distributed training of Large Language Models (LLMs). This hands-on role involves programming and implementing LLM architectures, distributed training code, and researching optimizations and new architectures. You will have access to thousands of GPUs. Your mission is to train the best foundational models for source code generation in minimum time and with maximum hardware utilization. The role requires deep knowledge of Transformers, experience with LLM training, and strong programming skills in Python, C/C++, CUDA, and Triton. We offer a fully remote work environment with flexible hours, generous vacation time, health insurance, company equipment, and various allowances.

Requirements

  • Experience with Large Language Models (LLM)
  • Deep knowledge of Transformers is a must
  • Knowledge/Experience with cutting-edge training tricks
  • Knowledge/Experience of distributed training
  • Trained LLMs from scratch
  • Coded LLMs from scratch
  • Knowledge of deep learning fundamentals
  • Strong machine learning and engineering background
  • Research experience
  • Programming experience
  • Linux
  • Strong algorithmic skills
  • Python with PyTorch or Jax
  • C/C++, CUDA, Triton
  • Use modern tools and are always looking to improve
  • Strong critical thinking and ability to question code quality policies when applicable

Responsibilities

  • Follow the latest research on LLMs and source code generation
  • Propose and evaluate innovations, both in the quality and the efficiency of the training
  • Do LLM-Ops: babysitting and analyzing the experiments, iterating
  • Write high-quality Python, Cython, C/C++, Triton, CUDA code
  • Work in the team: plan future steps, discuss, and always stay in touch

Preferred Qualifications

  • Author of scientific papers on any of the topics: applied deep learning, LLMs, source code generation, etc. - is a nice to have
  • Can freely discuss the latest papers and descend to fine details
  • Is reasonably opinionated
  • Prior experience in non-ML programming, especially not in Python - is a nice to have

Benefits

  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you and dependents
  • Company-provided equipment
  • Wellbeing, always-be-learning and home office allowances
  • Frequent team get togethers
  • Great diverse & inclusive people-first culture

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let poolside know you found this job on JobsCollider. Thanks! πŸ™