Remote Staff / Senior Staff Data Engineer

closed
Logo of Valo Health

Valo Health

๐Ÿ“Remote - Worldwide

Job highlights

Summary

Join Valo Health as a Staff / Senior Staff Data Engineer to lead the development of complex initiatives transforming real-world data into analysis-ready data products for internal teams.

Requirements

  • Bachelorโ€™s degree + 8 (staff) /10 (sr staff) years of experience, MS + 6/8 YOE, PhD + 5/7 YOE in computer science, information systems, or data science
  • 5+ yrs experience in a technical role in: SWE / DE: data ingestion, streaming technologies, troubleshooting data pipelines (eg prefect, airflow) and implement CI/CD practices
  • Production programming experience in python & SQL; cloud compute and big data tools, eg spark
  • 3+ yrs experience in a professional role gathering requirements and understanding customers/data users goals
  • Experience with EHR/EMR data and medical coding ontologies (eg, ICD, ATC, LOINC, SNOMED)
  • Experience with data engineering best practices and testing methodologies (data provenance, collaborative development using source control management (git), code versioning, reproducibility, etc)

Responsibilities

  • Build, maintain, and extend data transformation pipelines and systems to ingest and harmonize third-party EHR data into Valoโ€™s data ecosystems
  • Define Valoโ€™s EHR data models and pipelines (spark, SQL) in a centralized data ecosystem and semi-isolated cloud environments
  • Work closely with data providers and in-house data users to integrate third-party EHR data with Valoโ€™s standardized data
  • Maintain and extend data integration (standardization & harmonization) & data quality processes to improve quality, reliability, and FAIRness
  • Ensure conceptual accuracy and generalizability of data: do standardized derived features represent clinical concepts in repeatable ways?
  • Simplify how data scientists access, transform, and use their data
  • Promote consistent data usage patterns, including version management, shared ontologies & data dictionaries
  • Support internal data users both directly and by composing demos, how-tos, and reference documentation
  • Provide technical leadership within the translational data engineering team
  • Simplify how data engineers build, maintain, and extend their data pipelines
  • Advise colleagues on data transformations and database design
  • Provide critical feedback and encourage best practices within the data engineering team
  • Participate in the creation and maintenance of technical documentation
This job is filled or no longer available