Senior Data Engineer

Gradient AI Logo

Gradient AI

πŸ“Remote - Worldwide

Summary

Join Gradient AI, a revolutionary AI-powered solutions provider for Group Health and P&C insurance, as a Senior Data Engineer. You will design, build, and manage data pipelines for health insurance clients, leveraging your deep understanding of healthcare data. This fully remote role requires expertise in Airflow for ETL pipelines and a strong grasp of healthcare data privacy and security regulations. You will collaborate with data scientists, ensuring data quality and integrity. The ideal candidate possesses a BS in a quantitative discipline and 7+ years of experience with health data. Gradient AI offers a fun, team-oriented culture, generous stock options, unlimited vacation, flexible schedules, and a full benefits package.

Requirements

  • BS in Computer Science, Bioinformatics, or another quantitative discipline with 7+ years working with and interpreting health, medical, and bioinformatics data, including real-world healthcare datasets
  • Subject matter expertise (SME) in health and bioinformatics data, with a strong grasp of the complexities and challenges of processing medical and biological information
  • Proficiency in Python and SQL within a professional environment
  • Hands-on knowledge of big data tools like Apache Spark (PySpark), DataBricks, Snowflake, or similar platforms
  • Skilled in using data orchestration frameworks such as Airflow, Dagster, or Prefect
  • Comfortable working within cloud computing environments, preferably AWS, along with Linux systems

Responsibilities

  • Design, build, and implement data systems to support ML and AI models for our health insurance clients, ensuring strict compliance with healthcare data privacy and security regulations (e.g., HIPAA)
  • Develop tools for extracting, processing, and profiling diverse healthcare data sources, including EHRs, medical claims, pharmacy data, and genomic data
  • Collaborate with data scientists to transform large volumes of health-related and bioinformatics data into modeling-ready formats, prioritizing data quality, integrity, and reliability in healthcare applications
  • Build and maintain infrastructure for the extraction, transformation, and loading (ETL) of data from a variety of sources using SQL, AWS, and healthcare-specific big data technologies and analytics platforms
  • Apply health and bioinformatics subject matter expertise to ensure data pipelines meet the unique requirements of health, medical, and bioinformatics data processing - including translating complex medical and biological concepts into actionable data requirements

Preferred Qualifications

  • Knowledge of healthcare data standards (e.g., FHIR, HL7) and a solid understanding of healthcare data privacy and security regulations (such as HIPAA) are highly desirable
  • Ability to work with and visualize health and/or medical data, with Insurtech industry exposure, is considered a plus

Benefits

  • A fun, team-oriented startup culture
  • Generous stock options - we all get to own a piece of what we’re building
  • Unlimited vacation days
  • Flexible schedule that supports working from home
  • Full benefits package includes medical, dental, vision, 401k, paid paternal leave, and more
  • Ample opportunities to learn and take on new responsibilities

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.