Data Engineering 2, Translational Data Scientist
closed
Valo Health
Summary
Join Valo Health, a technology company revolutionizing drug discovery and development using AI and human-centric data. As a Data Engineer, you'll be a core member of the Translational Data Sciences group, building computational tools and transforming biological data into analysis-ready products. You will contribute to knowledge graph integration, supporting target identification, statistical genetics, and multi-omics modeling. This role involves developing data transformation pipelines, building visualization tools, and working with agile development techniques. The ideal candidate possesses a Bachelor's degree plus 2 years of experience or a recent Master's degree in a relevant field, along with expertise in Python, SQL, and large-scale biology data.
Requirements
- Bachelorβs + 2 years of experience or recent Masterβs degree in computer science, information systems, computational sciences (eg bioinformatics, computational biology), or related fields
- Familiarity with various large-scale biology data types (eg, genomics or transcriptomics)
- Experience in Python and SQL
- Familiarity with releases/versioning of datasets for internal users
- Familiarity with data processing workflows, data platforms (eg, databricks, snowflake), and cloud environments (eg, AWS, GCP)
- Experience with data and software engineering best practices and testing methodologies (data provenance, collaborative development using source control management (git), code versioning, reproducibility, etc)
Responsibilities
- Work on a team developing data transformation pipelines and systems to ingest and harmonize data into Valoβs data ecosystems
- Contribute to the data source backend for our knowledge graphs & knowledge integration
- Build visualization and data extraction tools for knowledge graphs
- Work using agile development techniques
- Learn to provide high quality data through unit tests, validation tests, code reviews
- Be a dynamic and active team member