Data Engineer II

Valo Health Logo

Valo Health

💵 $125k-$170k
📍Remote - Worldwide

Summary

Join Valo Health, a technology company revolutionizing drug discovery and development through AI and human-centric data, as a Data Engineer. You will be a core member of the Translational Data Sciences group, collaborating with data engineers and scientists to build computational tools and address critical scientific questions. Your contributions will focus on enhancing Valo’s EHR data engineering capabilities, transforming structured and unstructured data into analysis-ready products. This involves close collaboration with data engineers and scientists supporting epidemiology, patient clustering, and biomarker identification. You will work on data transformation pipelines, contribute to the data source backend for machine learning, build visualization and data extraction tools, and employ agile development techniques. The role requires a Bachelor’s degree and 2 years of experience or a Master’s degree in a relevant field, along with proficiency in Python and SQL.

Requirements

  • Bachelor’s + 2 years of experience or recent Master’s degree in computer science, information systems, computational sciences (e.g. bioinformatics, computational biology, epidemiology), or related fields
  • Familiarity with various clinical data types (e.g., EHR/EMR, clinical trials, claims)
  • Must have experience in Python and SQL (e.g. PostgreSQL, MySQL, etc.)
  • Familiarity with releases/versioning of datasets for internal users
  • Familiarity with data processing workflows, data platforms (e.g., spark, snowflake) , and cloud environments (e.g., AWS, GCP)
  • Experience with data and software engineering best practices and testing methodologies (data provenance, collaborative development using source control management (git, bitbucket), code versioning, reproducibility, etc.)

Responsibilities

  • Work on a team developing data transformation pipelines and systems to ingest and harmonize data in Valo’s data ecosystems
  • Contribute to the data source backend for our epidemiology and patient machine learning
  • Build visualization and data extraction tools
  • Work using agile development techniques
  • Learn to provide high quality data through unit tests, validation tests, code reviews
  • Be a dynamic and active team member

Preferred Qualifications

Familiarity with any medical coding systems is a plus, (e.g. ICD9/10, CPT, ATC, LOINC, SNOMED, UMLS)

Benefits

$125,000 — $170,000 USD

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs