Deepgram is hiring a
Research Scientist, Remote - Worldwide

Logo of Deepgram

Research Scientist

🏢 Deepgram

💵 ~$188k-$283k
📍Worldwide

Summary

The job is for a Research Scientist at Deepgram, a voice AI company. The role involves designing and carrying out experimental programs to build new speech and language AI foundation models, driving large-scale training jobs on massive distributed computing infrastructure, optimizing model architectures, documenting results, staying up-to-date with the latest advances in deep learning, and having strong communication and software engineering skills.

Requirements

  • PhD in Physics, Electrical Engineering, Computer Science or another related field
  • Prior experience in designing and conducting experimental programs aimed at understanding complex phenomena, with the ability to rapidly iterate and change course as needed
  • Proven experience building models from a blank page and owning the entire deep learning stack including data curation, characterization and cleaning, architecture design and model building, distributed large-scale training, and model optimization for inference
  • Strong communication skills and the ability to translate complex concepts in simple terms, depending on the target audience
  • Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch

Responsibilities

  • Design and carry out experimental programs to build new speech and language AI foundation models across modalities and tasks
  • Drive large-scale training jobs successfully on massive distributed computing infrastructure
  • Optimize model architectures to make them as fast and memory-efficient as possible; deploy new models into production for use at massive scale
  • Document and present results and complex technical concepts clearly for internal and external audiences
  • Stay up to date with the latest advances in deep learning with a particular eye towards their implications and applications within our products

Preferred Qualifications

  • Prior industry experience in building deep learning models to solve complex problems, with a solid understanding toward the applications and implications of different neural network types, architectures, and loss mechanisms
  • Deep understanding and experience working with state-of-the-art network architectures including transformers
  • Understanding of different parallelism paradigms for efficient distributed training

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Deepgram know you found this job on JobsCollider. Thanks! 🙏