Data Scientist
Biogen
Summary
Join Biogen's Decision & Quality Analytics Innovation (DQAI) team as a 6-month student intern (July-December 2025)! Work alongside experienced data scientists and statisticians, implementing cutting-edge AI models and contributing to the entire AI product development lifecycle. You will collaborate on projects involving data visualization, clinical scenario simulation, and research into AI applications in pharmaceuticals. This remote role offers opportunities to learn and apply state-of-the-art AI methodologies, contributing to Biogen's mission of delivering life-changing medicines. Resume review begins January 2025. The position requires proficiency in programming and familiarity with various AI/ML concepts and tools.
Requirements
- Demonstrated proficiency in at least one programming language (Python, R, etc)
- Familiarity with concepts about NLP/NLG, topic modeling, text analytics, and text mining, and understanding of their mathematical foundations
- Experience of building web application with React framework
- Experience with Monte Carlo Simulation
- Experience with NLP packages in Python, such as NLTK, spaCy, Gensim, etc
- Experience with deep learning frameworks, such as Pytorch, TensorFlow, HuggingFace
- Ability to explore, discover and import data from multiple sources and make them ready for modeling with SQL and/or Pandas
- Ability to communicate complex technical concepts in a clear and actionable manner
- Willing to work in a collaborative environment to define a practical solution
- Legal authorization to work in the U.S
- At least 18 years of age prior to the scheduled start date
- Be currently enrolled in an accredited community college, college or university, with a graduation date no earlier than December 2025
- Currently pursuing a Masterโs degree in Data Science, Statistics, Bioinformatics, Computer Science, Computational Biology, or related field
Responsibilities
- Collaborate closely with senior data scientists and statisticians to implement and deploy cutting-edge AI models
- Facilitate clinical scenario simulation work
- Develop and prototype data visualizations and dashboards
- Conduct research works on the latest AI applications in Pharmaceutical areas
- Engage with stakeholders to communicate key results to deliver predictive and prescriptive insights
- Provide ad-hoc statistical and machine learning support to business partners
- Develop explainable machine learning models and deploy them as interactive dashboards
- Reproduce the latest methodologies from the top-tier machine learning research papers, apply them to Biogenโs internal data and use cases, and create comprehensive evaluation reports regarding the model performance and limitations
- Create a simulation model for clinical programs
Preferred Qualifications
- Strong data visualization skills and experience with the Streamlit and/or Dash framework in Python is a plus
- Experience with reproducing results from top-tier machine learning conferences is a plus
- Familiarity with Github and Linux shell scripting in a cloud-based environment is a plus
- Experience with Quality and Compliance data, Clinical Portfolio Data in the Pharmaceutical industry is a plus
Benefits
This role is remote