Big Data Engineer

closed
H1 Logo

H1

πŸ’΅ $100k-$120k
πŸ“Remote - United States

Summary

Join H1, a company dedicated to providing accessible healthcare information globally, as a Data Engineer. You will play a crucial role in building and enhancing scalable data pipelines, transforming raw client data into actionable insights. This involves data ingestion, enrichment, and integration across various platforms. You will collaborate with a team, ensuring data accuracy, scalability, and performance. The ideal candidate possesses strong data engineering skills, experience with large datasets, and proficiency in specific technologies. H1 offers a competitive salary, stock options, and a comprehensive benefits package.

Requirements

  • 3+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a track record of working with large datasets, including ingestion, transformation, and optimization
  • Proficiency in Spark, Python, and SQL for building scalable data processing pipelines
  • Hands-on experience with Kubernetes for container orchestration and deployment
  • Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure
  • Experience with EMR and Databricks to optimize large-scale data workflows
  • Has an understanding of LLM usage in production

Responsibilities

  • Develop and enhance processes to enrich raw or partially processed data using established business logic, ensuring it is accurate and ready for product use
  • Build and maintain scalable and reliable data pipelines that support the team’s enrichment workflows
  • Integrate enriched data from core platforms (e.g., CT platform) into broader data systems, applying necessary transformations and aligning with business requirements
  • Contribute to code reviews, holding a high bar for quality and aligning with organizational engineering guidelines
  • Follow development workflows, including coding, testing, deployment, and monitoring, to ensure quality and efficiency
  • Work collaboratively with team members and escalate issues appropriately when challenges arise
  • Contribute to the understanding and execution of tasks with a strong focus on accuracy, scalability, and performance

Preferred Qualifications

  • Experience developing and optimizing data workflows, applying business logic for data enrichment, and addressing technical challenges with creative solutions
  • Strong knowledge of building and scaling data infrastructure, including integration with core platforms
  • Experience working with data quality challenges and implementing validation mechanisms
  • Self-motivated with the ability to manage tasks and collaborate effectively within a team
  • Ability to align work with broader organizational goals and contribute to strategic initiatives
  • Proactively identifies potential risks and helps implement solutions early in the project lifecycle
  • Eager to learn, grow, and contribute to a collaborative, high-performing engineering team

Benefits

  • Full suite of health insurance options, in addition to generous paid time off
  • Pre-planned company-wide wellness holidays
  • Retirement options
  • Health & charitable donation stipends
  • Impactful Business Resource Groups
  • Flexible work hours & the opportunity to work from anywhere
  • The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe
This job is filled or no longer available