
Data Engineer

Our Future Health UK
Summary
Join Our Future Health as a Data Engineer (Bioinformatics) and contribute to the UK's largest health research program. You will design, build, and test data pipelines for genomic data, using various technologies and collaborating with a multidisciplinary team. Responsibilities include developing data transformation logic, creating prototypes for complex pipelines, and ensuring data quality and accessibility. You will work closely with researchers to meet their data needs and stay updated on best practices in data engineering. This role requires proficiency in Python, experience with bioinformatics tools, and a strong understanding of data governance and security. Our Future Health offers a competitive salary and benefits package, including a generous pension scheme, holiday allowance, enhanced parental leave, professional development opportunities, and flexible working arrangements.
Requirements
- Experience working in an Agile development team
- Highly proficient in Python
- Understanding of containerisation using Docker and deployment with Kubernetes
- Experience with version control (Git/Github)
- Follow best practices like code review and clean code unit tests
- Understanding of information governance and data security approaches appropriate for sensitive health data following ISO27001
- Detailed knowledge and understanding of genomic data
- Experience using bioinformatics file standards (VCF, BGEN etc) and tools (PLINK, bcftools, QCtools etc)
- Experience in validating and QCing complex genomic datasets
- Experience building and maintaining robust, scalable and efficient pipelines capable of processing very large amounts of data from one or multiple systems
- You know how to create repeatable and reusable products
- Experience with workflow management tools such as Nextflow, WDL/Cromwell, Airflow, Prefect and Dagster
- Good understanding of cloud environments (ideally Azure), distributed computing and scaling workflows and pipelines
- Understanding of common data transformation and storage formats, e.g. Apache Parquet
- Awareness of data standards such as GA4GH ( https://www.ga4gh.org/ ) and FAIR ( https://www.go-fair.org/fair-principles/ )
Responsibilities
- Support the build of data pipelines from data providers to our primary data store and Trusted Research Environment
- Produce logic for data transformation steps as code, which meets the requirements for our end users and builds well-curated, accessible and quality-controlled data for analysis
- Developing prototypes for pipelines for complex transformations drawing on existing workflows developed in industry and academia
- Keep abreast of best practices in data engineering across industry, research and Government and facilitating the adoption of standards
- Providing technical input into the upstream parts of the data pipeline, including the specification and transfer of data from data providers
- Routine ad-hoc data curation activities requiring hands-on development of bespoke ETL cleaning scripts using languages such as Python
- Working with researchers to understand the data requirements and work with them to deliver the data needed for their projects
Preferred Qualifications
Exposure of genotyping and imputation is highly advantageous
Benefits
- Competitive base salary
- Generous Pension Scheme – We invest in your future with employer contributions of up to 12%
- 30 Days Holiday + Bank Holidays – Enjoy a generous holiday allowance with the flexibility to take bank holidays when it suits you
- Enhanced Parental Leave – Supporting you during life’s biggest moments
- Career Growth & Development – £500 per year to spend on Learnerbly, our learning platform, plus regular appraisals and development opportunities
- Cycle to Work Scheme – Save 25-39% on a new bike and accessories through salary sacrifice
- Home & Tech Savings – Get up to 8% off on IKEA and Currys products, spreading the cost over 12 months through salary sacrifice
- ��1,000 Employee Referral Bonus – Know someone amazing? Get rewarded for bringing them on board!
- Wellbeing Support – Access to Mental Health First Aiders, plus 24/7 online GP services and an Employee Assistance Programme for you and your family
- A Great Place to Work – We have a lovely Central London office in Holborn, and offer flexible and remote working arrangements
Share this job:
Similar Remote Jobs
