LLM Data Researcher

Turing Logo

Turing

📍Remote - United States

Summary

Join Turing, a rapidly growing AI company, as a data-savvy leader to evaluate data quality and its impact on model performance. This high-leverage role focuses on maximizing client impact through smarter data practices. You will define and implement strategies to assess data ROI, build and maintain performance benchmarks, and collaborate with cross-functional teams. Responsibilities include developing data evaluation systems, driving quality processes, identifying data gaps, and translating research insights into client value. The ideal candidate possesses a strong grasp of LLMs and data-model dynamics, a proven track record in benchmark development, and experience designing and interpreting performance metrics. This is a full-time remote opportunity with flexible working hours.

Requirements

  • Strong grasp of LLMs and data-model dynamics (but this is not a model training role)
  • Knowledge of the latest trends in Generative AI and data that is useful for improving foundation models
  • Proven track record in benchmark development, model evaluation, or data-centric infrastructure
  • Strategic thinker with a bias toward impact: can connect data quality work directly to client value
  • Experience designing and interpreting metrics that inform delivery performance
  • Familiarity with annotation workflows, validation processes, and scalable QA systems
  • Solid ML or data science foundation—able to reason about training impact from a data point-of-view

Responsibilities

  • Define and implement strategies to assess the ROI of data across training and fine-tuning pipelines
  • Build and maintain benchmarks that measure performance across key client and internal objectives
  • Collaborate with data operations, research, and delivery teams to align on quality standards and data priorities
  • Develop systems and tooling for continuous data evaluation—measuring what matters, where it matters
  • Drive human-in-the-loop quality processes including pre-delivery validation and annotation feedback loops
  • Identify data gaps and lead targeted acquisitions or refinements, guided by performance metrics
  • Define and/or leverage comprehensive task taxonomy frameworks to structure data annotation efforts and improve training signal quality
  • Translate research insights and data evaluations into client-facing value through better delivery and prioritization

Preferred Qualifications

  • Experience with feedback-driven annotation loops and pre-delivery QA
  • Hands-on experience with taxonomy frameworks and structured data labeling
  • Background in data-centric AI or related research, Linguistics as applied to data annotation efforts, or related fields

Benefits

  • Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
  • Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
  • Competitive compensation
  • Flexible working hours
  • Full-time remote opportunity

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.