Remote Technical Lead, Large Language Model Training Data

Logo of Turing

Turing

📍Remote - United States

Job highlights

Summary

Join Turing as a highly skilled research and engineering lead to collaborate with researchers in leading LLM companies, implement data generation processes, and design scalable data quality and throughput systems. The ideal candidate has a strong technical foundation, proficiency in multiple programming languages, experience with coding languages and environments, and a fast-iteration and fast-learning attitude.

Requirements

  • Strong background in coding, software development, or related fields
  • Proficiency across multiple programming languages with deep expertise across at least one of the following: Python, Java Script, Java, React
  • Experience with coding languages and environments, including the ability to review, correct, and explain code effectively
  • Understanding of data annotation workflows, especially for coding tasks, is a plus

Responsibilities

  • Collaborate to understand data needs: Work with the researchers in leading LLM companies to understand the training data needs for the next generation of LLMs
  • Implementation for data: Work with the internal R&D team and engineers to design the process and system that can generate the needed training data in the most effective ways
  • Data Quality and Throughput: Work with internal operational leaders to design a scalable process that can leverage the knowledge of hundreds of knowledge workers assisted with existing LLM capability to build high quality data efficiently

Benefits

  • Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
  • Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
  • Competitive compensation
  • Flexible working hours
  • Full-time remote opportunity

Job description

About Turing

Based in Palo Alto, California, Turing is the world’s first AI-powered tech services company. It has reimagined tech services from the ground up with AI by offering AI-vetted and matched talent, AI-accelerated development, and access to AI transformation experts who have built many of the most iconic Silicon Valley companies.

Founded in 2018, the company has experienced tremendous growth with three million global developers on its Talent Cloud and 900+ clients. Turing has received numerous awards, including Forbes’s 2022 “One of America’s Best Startup Employers,” being ranked #1 in The Information’s 2021 Annual List of most promising B2B Companies and Fast Company’s “Annual List of the World’s Most Innovative Companies.”

The company’s leadership team comprises both AI technologists from leading organizations including Meta, Google, Microsoft, Apple, Amazon, Twitter, Stanford, Caltech, MIT as well as tech consulting veterans from Accenture, Cognizant, Capgemini, McKinsey, Bain, and more.

Job Overview:

We are seeking a highly skilled research and engineering lead to:

  • Collaboration to understand data needs: Work with the researchers in leading LLM companies to understand the training data needs for the next generation of LLMs, in the domain of coding skills, or advanced Maths, or Robotics;
  • Implementation for data: Work with the internal R&D team and engineers to design the process and system that can  generate the needed training data in the most effective ways;
  • Data Quality and Throughput: Work with internal operational leaders to design a scalable process that can leverage the knowledge of hundreds of knowledge workers assisted with existing LLM capability to build high quality data efficiently

Qualifications:

  • Technical Expertise:

    • You need to have a very strong technical foundation
    • Strong background in coding, software development, or related fields.
    • Proficiency across multiple programming languages with deep expertise across at least one of the following: Python, Java Script, Java, React
    • Experience with coding languages and environments, including the ability to review, correct, and explain code effectively.
    • Understanding of data annotation workflows, especially for coding tasks, is a plus.
  • Fast-iteration and fast-learning attitude:

    • You need to iterate fast with the leading LLM researchers in this cutting-edge space
    • Comfortable to work in a highly iterative pattern with the researchers and engineers – there won’t be a quarterly plan because we are exploring the unknown.
    • Learn quickly into the depth of LLM training domain – even though you may have taken classes or ran projects of machine learning, the LLM training domain has been rapdily changing. Every month, new sub-domain appears and new depth appears
    • Learn quickly into a new domain that you don’t know – it is fascinating how much software engineering knowledge can help advance AGI: coding, reasoning, planning, maths, physics, chemistry… you need to be able to learn a new domain quickly (with the help of chatGPT)!
  • Communication and collaboration

    • Brainstorm with researchers on what is the best dataset to develop a certain capability for the next generation of LLMs
    • Translate the research ideas into the fastest step-by-step way to iterate
    • Understand and verify the operational plan
    • Follow up with researchers on how to use the data in LLM trainings and how to measure the effectiveness of the dataset
  • Ownership & Urgency:

    • A high sense of ownership and responsibility for the team’s output and quality.
    • Strong problem-solving skills, with the ability to think critically and act quickly when needed.
  • Interpersonal Skills:

    • Exceptional interpersonal and communication skills, with the ability to work well with diverse teams and clients.
    • Ability to motivate and inspire a team, fostering a positive and productive work environment.

Advantages of joining Turing:

  • Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
  • Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
  • Competitive compensation
  • Flexible working hours
  • Full-time remote opportunity

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. Turing is proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, or any other legally protected characteristics. At Turing we are dedicated to building a diverse, inclusive and authentic workplace  and celebrate authenticity, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

For applicants from the European Union, please review Turing’s GDPR notice here.

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Turing know you found this job on JobsCollider. Thanks! 🙏