Data Annotation Specialist

DocPlanner
Summary
Join Docplanner Group as a Data Annotation Specialist and contribute to the development of AI-driven products in the medical domain. Your main focus will be preparing and curating linguistically and medically oriented datasets for machine learning and natural language processing (NLP) initiatives. Collaborate with annotators, NLP experts, and machine learning engineers to ensure data accuracy and relevance. This role requires a strong linguistic background and experience in data annotation, transcription, and text analysis, along with familiarity with medical terminology. You will be responsible for preparing and maintaining datasets, creating annotation conventions, performing audio transcriptions, developing text patterns, and evaluating data quality. The ideal candidate will possess exceptional attention to detail and strong collaboration skills. Docplanner offers a variety of benefits, including healthcare insurance, wellness programs, paid time off, an ESOP, and flexible work arrangements.
Requirements
- Languages: Spanish Native and Full Professional Level of English
- Linguistic Expertise: A degree or strong background in linguistics, computational linguistics, or a related field
- Medical Knowledge: Familiarity with medical terminology or prior experience in the healthcare domain
- Data Annotation: Hands-on experience with data annotation, transcription, and labeling tools
- Text Analysis: Proficiency in creating and managing text patterns and dictionaries for NLP purposes
- Attention to Detail: Exceptional ability to assess and improve data quality
- Collaboration Skills: Strong interpersonal skills to coordinate with annotators and technical teams
Responsibilities
- Prepare and maintain medically oriented datasets for machine learning purposes, ensuring high-quality data annotation and consistency
- Create and refine annotation and labeling conventions, as well as perform audio transcriptions (aligning audio with text and correcting text based on audio)
- Develop and implement text patterns to detect and extract named entities, such as names, addresses, organization names, ID numbers, and medical terms
- Prepare and maintain dictionaries and taxonomies for use in NLP systems
- Evaluate the correctness and accuracy of words, sentences, and text structures, ensuring linguistic quality and consistency
- Supervise the work of annotators and maintain close collaboration with NLP and machine learning engineers to address project needs and challenges
Preferred Qualifications
- Previous experience in AI, machine learning, or data preparation projects
- Knowledge of multiple languages or dialects beyond the primary working language
- Exposure to medical coding systems (e.g., ICD, CPT) or electronic health records (EHR)
- Experience leading linguistic teams
Benefits
- Healthcare insurance
- Wellness that works for you – from gym memberships to mental health support, we’ve got you covered
- Time off that counts – whether it’s a vacation, your birthday, or just a day to recharge, we believe in balance
- ESOP (Employee Stock Ownership Plan) after 6 months with us—because we believe in sharing our success!
- Local Perks – Depending on your location, you will be entitled to local benefits like meal vouchers (ticket restaurant), transport allowances, or extended parental leave
- Career Growth – We’re growing, and so can you! You’ll find lots of chances to learn, develop, and explore new paths—whether within your team or through cross-functional projects
- A Truly Global Team – Work with talented people from all over the world in a diverse and inclusive environment
- Flexibility That Works for You – Remote work and flexible hours aren’t just buzzwords here. While the extent of flexibility depends on your role and team, we value results over rigid schedules. Prefer an office setting? You're welcome at any of our hubs in Barcelona, Warsaw, Curitiba, Rio de Janeiro, Mexico City, Bogotá, Munich, Rome or Bologna