Senior Data Engineer

Sunscrapers
Summary
Join Sunscrapers, a technology consultancy empowering finance and healthcare leaders, to build a new holistic data platform for a US-based healthcare company. You will contribute to developing robust data pipelines and scalable architectures using AWS services, Airflow, and dbt. The ideal candidate is well-organized, eager to learn, and thrives in a collaborative environment. Responsibilities include building and orchestrating data flows, integrating third-party systems, modeling datasets, designing data transformations for machine learning, and integrating data with machine learning model training and deployment. The role requires at least 5 years of data engineering experience, a relevant degree, excellent English, AWS expertise, and experience with various tools and technologies. Sunscrapers offers flexible working hours, remote work possibilities, a comfortable office, and various benefits.
Requirements
- At least 5 years of professional experience as a data engineer
- Undergraduate or graduate degree in Computer Science, Engineering, Mathematics, or similar
- Excellent command in spoken and written English, at least C1
- Expertise on AWS stack
- Experience with infrastructure-as-code tools, like Terraform
- Strong professional experience with Python and SQL
- Experience in building data pipelines with Airflow (MWAA would be a plus)
- Hands-on experience with using and managing a data warehouse like Snowflake or Redshift
- DevOps skills to automate deployment and streamline development
- Creative problem-solving skills
Responsibilities
- Building and orchestrating data flows for fetching, aggregation and data modeling using batch pipelines
- Integrating third-party systems and external data sources into data warehouse as well as reverse-ETL into third-party systems
- Modeling datasets and schemes for consistency and easy access
- Design and implement data transformations for machine learning models
- Integrating data with machine learning model training, re-training and deployment
Preferred Qualifications
- Hands on experience with DBT
- Experience with AWS DMS, or other data migration or CDC services
- Strong understanding of various data modeling techniques like Kimball Star Schema
- Experience with production-grade machine learning projects (data prep, building, training, deployment, re-training) especially using AWS Sagemaker
- Great analytical skills and attention to detail - asking questions and proactively searching for answers
- Good understanding of Docker
- Great customer service and troubleshooting skills
Benefits
- Flexible working hours and remote work possibility
- Comfortable office in a penthouse in central Warsaw equipped with all the necessary tools to conquer the universe (Macbook, external screen, ergonomic chairs)
- Fully equipped kitchen with fruit, hot and cold drinks
- Multisport card & Private medical care
- Culture of good feedback: evaluation meetings, mentoring