Data Engineer
Reveal HealthTech
Summary
Join our team as a Data Engineer and play a key role in designing, building, and maintaining our data infrastructure. You will collaborate with cross-functional teams to ensure data availability, integrity, and usability. The ideal candidate is passionate about building efficient data pipelines and working with cloud-based technologies. This is a remote position with US shift timings (6:30 PM - 3:30 AM IST). Responsibilities include developing and optimizing data pipelines, managing databases, integrating data, supporting analytics teams, leveraging AWS services, and monitoring data infrastructure. The role requires strong technical skills, including experience with ETL/ELT pipelines, SQL, cloud platforms, and monitoring tools. Immediate start is preferred.
Requirements
- 0-2 years of experience in a similar data profile
- Strong experience with ETL/ELT pipelines, including tools like Apache Airflow, dbt, or similar
- Proficiency in SQL and database optimization for Postgres and Redis
- Hands-on experience with cloud platforms (AWS preferred) and services like S3 and CloudFront
- Familiarity with ODBC adapters and APIs for integrating third-party systems
- Expertise in for data manipulation and pipeline orchestration
- Experience with observability tools such as Datadog, Sentry, and Healthchecks.io
- Strong debugging and troubleshooting skills for data pipelines and integrations
- Excellent problem-solving skills and attention to detail
- Strong communication and collaboration abilities
- Eagerness to learn and adapt to new technologies
- Prior experience in a customer facing role
Responsibilities
- Design, build, and maintain robust ETL/ELT pipelines to integrate data from multiple sources, including APIs, databases, and third-party integrations like Elation (Snowflake), Apero, and Zoom
- Optimize existing pipelines to improve performance and reliability
- Maintain and optimize PostgreSQL and Redis databases hosted on Aptible
- Implement best practices for database indexing, partitioning, and performance tuning
- Work with third-party integrations such as Elationβs ODBC connection to Snowflake, ensuring accurate and efficient data extraction and transformation
- Support analytics teams by ensuring timely and accurate delivery of data for dashboards and reporting
- Leverage AWS services such as S3 and CloudFront for data storage and distribution
- Collaborate with the DevOps team to ensure data infrastructure aligns with application hosting and deployment processes
- Utilize tools such as Datadog, Sentry, and Sumologic for monitoring ETL jobs, database performance, and data pipeline health
- Set up alerts and dashboards to proactively identify and resolve issues
- Work closely with development and operations teams to support new data-related requirements
- Document data models, pipeline designs, and operational workflows to ensure knowledge sharing and system maintainability
Preferred Qualifications
- Experience working in a healthcare technology environment
- Familiarity with Snowflake or similar modern data warehouse platforms
- Knowledge of Docker and CI/CD pipelines (e.g., CircleCI)
- Support experience in previous roles
- Familiarity with Ruby
Benefits
- Competitive compensation, benefits, and opportunities for professional growth
- Immediate start with high-visibility projects from day one