Senior Data Engineer

closed
Arine Logo

Arine

πŸ’΅ $150k-$170k
πŸ“Remote - United States

Summary

Join Arine, a rapidly growing healthcare technology company, as a Senior Data Engineer. You will design and build data pipelines for AI/ML solutions, focusing on machine learning feature engineering and workflow automation to improve patient care. This role requires strong software engineering skills in Python and SQL, experience with AWS services, and a deep understanding of data pipelines for AI applications. You will collaborate with data scientists and ML engineers to optimize data preparation and ensure high-quality model inputs. Arine offers a dynamic role with unparalleled learning and growth opportunities, collaborating with experienced professionals in a mission-driven environment. The salary range is $150,000-170,000/year.

Requirements

  • Experienced Senior Data Engineer with a focus on building data pipelines for machine learning or analytics applications
  • Strong software engineering skills: you write clean, scalable Python and SQL code and are comfortable working with AWS infrastructure
  • Understanding of ML-driven data workflows and their dependencies
  • Strong grasp of data storage, retrieval, and real-time processing pipelines for AI applications
  • Experience with feature engineering best practices
  • Strong attention to detail: you ensure data accuracy and reliability while iterating quickly
  • Ability to identify patterns and edge cases in messy, real-world data
  • Comfortable working with structured and unstructured data (text, transcriptions, clinical notes)
  • Highly collaborative: comfortable working across engineering, AI, and business teams
  • Mission-driven mindset: you care about healthcare impact as much as technical excellence
  • Strong communication and presentation skills to explain technical concepts to non-technical stakeholders
  • 5+ years of experience in data engineering
  • Expertise in Python and SQL
  • Deep experience with AWS services (S3, Lambda, Glue, Step Functions, Batch, Athena)
  • Experience working with healthcare data, particularly medical and pharmacy claims
  • Ability to pass a background check
  • Must live in and be eligible to work in the United States

Responsibilities

  • Design and build robust data pipelines to clean, join, and transform raw healthcare data into high-quality features for AI and machine learning algorithms
  • Collaborate closely with data scientists and ML engineers to optimize data preparation, ensuring high-quality model inputs
  • Process high-volume structured and unstructured data using Snowflake and AWS tools
  • Optimize data transformation pipelines for speed, accuracy, and reliabilityβ€”ensuring scalability for AI-driven solutions
  • Develop scalable data pipelines for AI performance tracking, clinician efficiency, and product outcome metrics
  • Automate data extraction and reporting to make key AI and business metrics accessible across teams
  • Implement automated validation, testing, and monitoring for pipelines to ensure data integrity and consistency
  • Support the development tools and processes to detect and flag data anomalies, ensuring consistent model input quality
  • Partner with ML engineers, product engineers, and data scientists to align data solutions with AI goals
  • Advocate for scalable, generalizable solutions that support future AI expansion while delivering immediate impact

Preferred Qualifications

  • Experience working with unstructured healthcare data such as text or transcriptions
  • Familiarity with AWS-based ML infrastructure, including SageMaker Feature Store, and feature engineering tools such as TFDV or Great Expectations
  • Experience with shell scripting and docker for automation
  • Knowledge of healthcare regulations and compliance, including CMS rules on medication adherence (PDC calculations)

Benefits

  • The salary range for this position is: $150,000-170,000/year
  • Joining Arine offers you a dynamic role and the opportunity to contribute to the company's growth and shape its future
  • You'll have unparalleled learning and growth prospects, collaborating closely with experienced Clinicians, Engineers, Software Architects, and Digital Health Entrepreneurs
This job is filled or no longer available