Summary
Join Wing's M32 Labs, a Silicon Valley-based AI company, as a Machine Learning/AI Engineer. You will design, build, and deploy AI agents using cutting-edge technologies like RAG and LangChain. This fast-paced role demands innovation and collaboration within small, elite teams. Expect rapid career growth and exceptional compensation for top performers. The position is remote, offering a flexible work environment and competitive benefits.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Data Science, Machine Learning, or related field—or equivalent practical experience
- Proven track record (2+ years) in NLP, conversational AI, or machine learning roles, ideally with hands-on deployment experience
- Proficiency in Python; familiarity with frameworks like PyTorch, TensorFlow, or Hugging Face
- Experience with libraries such as spaCy, Rasa, or NLTK, and an understanding of dialog systems
- Hands-on experience designing retrieval-augmented architectures and working with vector stores (Pinecone, Weaviate, Chroma)
- Knowledge of orchestrating multi-step AI workflows, prompt engineering, and advanced conversation management
- Comfort with data engineering tasks—managing and preprocessing large text corpora or domain-specific datasets
- Familiarity with containerization (Docker) and Kubernetes-based deployments on AWS, GCP, or Azure
- Awareness of PhiData or similar data governance/privacy solutions, and compliance requirements (GDPR, HIPAA, etc.) relevant to enterprise AI
- Understanding of secure coding and data handling practices
- Thrives on exploring novel techniques and pushing the boundaries of what AI can achieve for specific industry verticals
- Comfortable with short iteration cycles, rapid prototyping, and a minimal-bureaucracy culture
- Excellent at sharing ideas with both technical and non-technical stakeholders, and open to feedback and knowledge sharing
Responsibilities
- Design, build, and refine NLP models (intent recognition, entity extraction, dialogue management) for advanced conversational or task-focused AI agents
- Integrate speech recognition (STT) and text-to-speech (TTS) capabilities when needed, leveraging cloud APIs or open-source solutions
- Architect and implement pipelines that retrieve relevant context from knowledge bases or document stores, enhancing LLM outputs with accurate, domain-specific information
- Utilize vector databases (e.g., Pinecone, Weaviate, Chroma) to index content for semantic search and real-time retrieval
- Use LangChain or similar frameworks to organize multi-step AI workflows, combining prompts, data retrieval, and dynamic decision-making in a coherent pipeline
- Develop strategies for prompt engineering, error handling, and iterative refinement to improve user interactions and AI accuracy
- Integrate PhiData (or comparable tools) to ensure strong data governance, security, and privacy across all stages of model training and deployment
- Contribute to best practices for handling sensitive or proprietary data, adhering to compliance requirements in various industries
- Train, fine-tune, and deploy LLMs or custom ML models using frameworks like PyTorch, TensorFlow, or Hugging Face Transformers
- Leverage containerization (Docker) and orchestration (Kubernetes) on cloud platforms (AWS, GCP, Azure) to ensure scalable, low-latency deployments
- Embrace a lightweight, iterative workflow to quickly develop proofs-of-concept, validate assumptions, and pivot based on user or stakeholder feedback
- Conduct ongoing experiments to benchmark performance, refine models, and stay ahead of emerging AI/ML trends
- Work closely with cross-functional teams (backend, product, design) to integrate AI solutions seamlessly into our vertical-focused platforms
- Document and share best practices in NLP, retrieval methods, and AI model deployment across the organization
Preferred Qualifications
- Familiarity with Cursor AI or similar rapid development tools
- Comfortable with distributed teams, asynchronous communication, and remote collaboration
- Excellent verbal and written communication in English
- Demonstrated ability to tackle problems from multiple angles and propose innovative solutions
- Experience in backend JavaScript/TypeScript development is a plus
- Exposure to or hands-on experience with PHP frameworks is also a plus
- Prior exposure to building or integrating large-scale web platforms and familiarity with modern web technologies
Benefits
- Competitive Pay: Above-market compensation for exceptional talent
- Rapid Pay Increases for top performers: For exceptional performance, we are willing to double your compensation within 1 year
- Health Benefits: Reimbursement for health insurance premiums
- Performance Bonuses: Significant rewards for exceptional contributions
- Upskilling Budget: Support for online professional development
- Flexible Work Environment: Remote-first flexibility with in-office collaboration when needed
- AI software licenses for faster development
- Food Delivery Reimbursement: Late night Swiggy/Zomato reimbursement of 2,000 INR per month
- Gym Reimbursement: Gym reimbursement of 4,000 INR per month
- Tech Setup: Budget for tech set up provided after 6 months of employment
- US HQ Opportunities: Top performers may have the opportunity to explore international roles within our US-based headquarters, including potential emigration opportunities, subject to availability and company needs, and after at least 2 years of employment