GenAI Optimization Engineer

Modular Logo

Modular

πŸ’΅ $91k-$242k
πŸ“Remote - United States, Canada

Summary

Join Modular, a company revolutionizing AI infrastructure, and become part of a team building the next-generation AI platform, MAX. As an E2E Optimizations team member, you will design, implement, and tune features for Generative AI, lead cross-functional projects, collaborate with experts, and contribute to the MAX tech stack using various programming languages. You will monitor research and identify opportunities for framework improvements. This role requires in-depth Python knowledge, 3+ years of experience in ML/DL/Generative AI, and experience with framework-level optimizations and profiling. The position offers competitive compensation, including a significant equity component, along with benefits such as premier insurance plans, 401k matching, flexible paid time off, and team-building events. Remote work options are available for US and Canada-based candidates.

Requirements

  • In-depth knowledge of the Python programming language
  • 3+ years of working experience in Machine Learning, Deep Learning, or Generative AI
  • Experience implementing framework-level optimizations for Generative AI use cases
  • Experience profiling and optimizing GenAI applications
  • Deep interest in machine learning technologies and use cases
  • Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture

Responsibilities

  • Design, scope, implement, and tune features for Generative AI use cases in the MAX framework
  • Plan and lead cross-functional projects spanning multiple teams and domains
  • Collaborate with subject matter experts within Modular to enable features across different parts of the stack
  • Contribute to the MAX tech stack across multiple languages, including Mojo, Python, and C++
  • Monitor latest research channels and identify potential opportunities for the MAX framework

Preferred Qualifications

  • Experience using Machine Learning frameworks like PyTorch, Tensorflow, etc
  • CUDA/GPU Programming and Optimization experience
  • Experience with LLVM/MLIR/Compilers
  • Experience working with distributed/parallel programming models and an understanding of parallel hardware

Benefits

  • Premier insurance plans
  • Up to 5% 401k matching
  • Flexible paid time off
  • Competitive Compensation, including stock options
  • Team Building Events
  • Remote work options (US and Canada)

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs