Big Data Engineer

H1 Logo

H1

πŸ’΅ $100k-$120k
πŸ“Remote - United States

Summary

Join H1, a company dedicated to providing accessible healthcare information globally, as a Data Engineer. You will play a crucial role in building and enhancing scalable data pipelines, transforming raw client data into actionable insights. This involves data ingestion, enrichment, and integration across various platforms. You will collaborate with a team, ensuring data accuracy, scalability, and performance. The ideal candidate possesses strong data engineering skills, experience with large datasets, and proficiency in specific technologies. H1 offers a competitive salary, stock options, and a comprehensive benefits package.

Requirements

  • 3+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a track record of working with large datasets, including ingestion, transformation, and optimization
  • Proficiency in Spark, Python, and SQL for building scalable data processing pipelines
  • Hands-on experience with Kubernetes for container orchestration and deployment
  • Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure
  • Experience with EMR and Databricks to optimize large-scale data workflows
  • Has an understanding of LLM usage in production

Responsibilities

  • Develop and enhance processes to enrich raw or partially processed data using established business logic, ensuring it is accurate and ready for product use
  • Build and maintain scalable and reliable data pipelines that support the team’s enrichment workflows
  • Integrate enriched data from core platforms (e.g., CT platform) into broader data systems, applying necessary transformations and aligning with business requirements
  • Contribute to code reviews, holding a high bar for quality and aligning with organizational engineering guidelines
  • Follow development workflows, including coding, testing, deployment, and monitoring, to ensure quality and efficiency
  • Work collaboratively with team members and escalate issues appropriately when challenges arise
  • Contribute to the understanding and execution of tasks with a strong focus on accuracy, scalability, and performance

Preferred Qualifications

  • Experience developing and optimizing data workflows, applying business logic for data enrichment, and addressing technical challenges with creative solutions
  • Strong knowledge of building and scaling data infrastructure, including integration with core platforms
  • Experience working with data quality challenges and implementing validation mechanisms
  • Self-motivated with the ability to manage tasks and collaborate effectively within a team
  • Ability to align work with broader organizational goals and contribute to strategic initiatives
  • Proactively identifies potential risks and helps implement solutions early in the project lifecycle
  • Eager to learn, grow, and contribute to a collaborative, high-performing engineering team

Benefits

  • Full suite of health insurance options, in addition to generous paid time off
  • Pre-planned company-wide wellness holidays
  • Retirement options
  • Health & charitable donation stipends
  • Impactful Business Resource Groups
  • Flexible work hours & the opportunity to work from anywhere
  • The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.