Summary
Join Verana Health's growing Quantitative Sciences team as a Senior Quantitative Scientist, ML/NLP! You will leverage cutting-edge natural language processing and machine learning to analyze clinical data, develop algorithms, and contribute to the development of Verana Qdataยฎ. This role involves collaborating with cross-functional teams, mentoring team members, and presenting findings to a multidisciplinary audience. The ideal candidate possesses a doctorate or master's degree in a quantitative discipline with significant experience in machine learning, NLP, and working with clinical data. Verana Health offers a comprehensive benefits package, including health insurance, 401k matching, flexible vacation, and a wellness stipend.
Requirements
- Doctorate in a quantitative discipline (e.g., data science, computer science, machine learning, biostatistics, health economics, etc.) with 3+ years of experience or Masterโs with 5+ years of experience
- 5+ years of hands-on experience with messy data (e.g., electronic health records, outcomes data) and analytical methodologies
- 3+ years of hands-on experience with machine learning model implementation & deployment, especially on clinical notes
- 3+ years of hands-on experience with state-of-the-art natural large language models (e.g., BERT, Longformer, RoBERTa, etc.) in resolving use cases like named entity recognition (NER), text classification, entity relation extraction, etc
- Strong familiarity with programming languages, especially Python, Pyspark, R, SQL
- Strong familiarity with coding platforms, especially Databricks, Amazon Sagemaker, Visual Studio Code
- Strong familiarity with unstructured text processing techniques
- Familiarity with clinical datasets and coding systems such as ICD, CPT, and RxNorm
- Ability to work effectively with cross-functional teams
- Clear communication skills and able to deliver internal/external presentations
- Ability to prioritize and manage multiple projects with high attention to detail
Responsibilities
- Develop and leverage state-of-the-art advances in natural language processing using pre-trained large language models (LLMs) for analyzing and reasoning over clinical notes and other unstructured data in the context of clinical problems
- Drive cutting-edge research on language modeling with emphasis on scientific accuracy and explainability
- Communicate analysis results via presentations to a multi-disciplinary audience using clear, intuitive visualizations
- Establish and maintain best practices for data exploration, end-to-end model development and deployment lifecycle, and data/code/documentation management
- Work on Qdata development to enable commercial projects leveraging real-world data through responsibilities such as creation of study plans, implementation of analyses, development of algorithms, and/or writing of publications
- Collaborate cross-functionally with teams (e.g., Commercial, Product, Medical, Engineering/Technology, etc.) to translate clinical investigation questions into detailed data analytics requirements for internal and external projects
- Provide mentorship and knowledge sharing to team members in standardizing machine learning/natural language processing best practices
Benefits
- We provide 100% health, vision, and dental coverage for employees
- 401K Match
- Flexible vacation plans
- $700 learning and wellness annual stipend
- $25/wk in Doordash credit