Summary
The job description is for a Machine Learning Engineer position at Factored, specializing in Retrieval-Augmented Generation (RAG) models. The ideal candidate will design, develop, and optimize RAG models, enhance their performance, fine-tune LLMs, integrate them into production environments, and create features like predictive analytics and intelligent data extraction.
Requirements
- Bachelorβs or Masterβs degree in Computer Science, Statistics, Mathematics, or a related field
- 3-4 years of hands-on experience in developing and deploying machine learning models
- Strong Python skills are required
- 3+ years of experience with production NLP and deep learning models using PyTorch/TensorFlow
Responsibilities
- Design, develop, and optimize RAG models that combine retrieval-based and generation-based approaches to solve complex problems
- Enhance the performance of RAG models through innovative algorithmic techniques and fine-tuning
- Fine-tune and adapt LLMs for specific tasks and domains within the RAG framework
- Work with cross-functional teams to integrate RAG models into production environments and ensure seamless deployment
- Utilize machine learning methods and advanced techniques like LLMs to create effective AI solutions
- Design, deliver, and maintain features such as predictive analytics, automated risk assessments, intelligent data extraction, and personalized insights
- Write well-designed, maintainable, and testable code. Create clear and comprehensive design documentation
Preferred Qualifications
- Proven experience utilizing cutting-edge methodologies such as Retrieval-Augmented Generation (RAG) and other innovative techniques to enhance model performance and create sophisticated AI solutions
- Experience with cloud computing platforms (AWS, GCP) or equivalent on-premise platforms
Benefits
At Factored, we invest in you and support your career and professional growth in many meaningful ways