Remote Staff / Senior Staff Data Engineer
closedValo Health
๐Remote - Worldwide
Job highlights
Summary
Join Valo Health as a Staff / Senior Staff Data Engineer to lead the development of complex initiatives transforming real-world data into analysis-ready data products for internal teams.
Requirements
- Bachelorโs degree + 8 (staff) /10 (sr staff) years of experience, MS + 6/8 YOE, PhD + 5/7 YOE in computer science, information systems, or data science
- 5+ yrs experience in a technical role in: SWE / DE: data ingestion, streaming technologies, troubleshooting data pipelines (eg prefect, airflow) and implement CI/CD practices
- Production programming experience in python & SQL; cloud compute and big data tools, eg spark
- 3+ yrs experience in a professional role gathering requirements and understanding customers/data users goals
- Experience with EHR/EMR data and medical coding ontologies (eg, ICD, ATC, LOINC, SNOMED)
- Experience with data engineering best practices and testing methodologies (data provenance, collaborative development using source control management (git), code versioning, reproducibility, etc)
Responsibilities
- Build, maintain, and extend data transformation pipelines and systems to ingest and harmonize third-party EHR data into Valoโs data ecosystems
- Define Valoโs EHR data models and pipelines (spark, SQL) in a centralized data ecosystem and semi-isolated cloud environments
- Work closely with data providers and in-house data users to integrate third-party EHR data with Valoโs standardized data
- Maintain and extend data integration (standardization & harmonization) & data quality processes to improve quality, reliability, and FAIRness
- Ensure conceptual accuracy and generalizability of data: do standardized derived features represent clinical concepts in repeatable ways?
- Simplify how data scientists access, transform, and use their data
- Promote consistent data usage patterns, including version management, shared ontologies & data dictionaries
- Support internal data users both directly and by composing demos, how-tos, and reference documentation
- Provide technical leadership within the translational data engineering team
- Simplify how data engineers build, maintain, and extend their data pipelines
- Advise colleagues on data transformations and database design
- Provide critical feedback and encourage best practices within the data engineering team
- Participate in the creation and maintenance of technical documentation
This job is filled or no longer available
Similar Remote Jobs
- ๐United States
- ๐India
- ๐United States
- ๐ฐ$220k-$270k๐United States
- ๐Mexico
- ๐India
- ๐Sri Lanka
- ๐India
- ๐Mexico
- ๐ฐ$263k-$289k๐United States