Senior Data Engineer

League
Summary
Join League, a leading platform technology company in healthcare, as a Data Engineer. Design, build, and optimize scalable data pipelines using various distributed computing platforms. Maximize the effectiveness of microservices within the data engineering ecosystem. Revolutionize the Data Platform’s DevOps process by improving infrastructure as code and CI/CD capabilities. Drive architectural choices and implement data platform tools for efficient data analysis and insight generation. Guide and mentor other data engineers, promoting best practices. Translate business needs into data requirements using agile development practices. Analyze, tune, and support League’s data infrastructure to ensure data quality, performance, integrity, and security. Improve and troubleshoot customer-facing pipelines for both real-time and batch processing. The position offers the option to work from home and requires less than 1% travel to Toronto, Canada.
Requirements
- Experience performing root cause analysis of production issues , performance tuning and optimization
- Experience in healthcare datasets like EMR and Claims and interoperability standards like FHIR
- Experience in coding scripts in python and SQL, as well as microservices in python; Experience in HL7’s FHIR framework, a data framework used for healthcare interoperability
- Experience in implementing deployments, error handling and monitoring for scalable pipelines
- Experience in a cloud Data Stack such as GCP, AWS or Azure; Experience supporting reporting tools as a platform such as Looker, Tableau, or Power BI; Experience ingesting, storing, and analyzing data in SQL, with tools such as Snowflake, BigQuery, MySQL, or PostgreSQL
- Experience designing, developing, and deploying server-less functions for deployment of microservices using Python as a language for tools such as AWS Lambda functions, Azure Functions, or GCP Cloud Functions
- Experience using, developing, and maintaining task orchestration systems
- Requires a Bachelor’s degree or foreign equivalent in Computer Science, Data Science, Software Engineering or related field, and At least four years of experience as a Data Engineer or similar occupation in the Healthcare Tech Industry
Responsibilities
- Design, building, and optimizing scalable stream and batch pipelines both in real time and through gathering large amounts of data for bulk processing/assessment using a variety of distributed computing platforms and tools such as: Airflow/Cloud Composer and Apache Beam/Dataflow
- Maximizing the effectiveness of the microservices that serve critical backend capabilities, including orchestrating the movement and transformation of data within League’s data engineering ecosystem by leveraging services such as Cloud Functions and Cloud Run
- Revolutionizing our Data Platform’s DevOps process, including modifying League’s infrastructure as code approach for the purposes of increasing the ease in which data configurations are edited and distributed, maximizing CI/CD capabilities, and incorporating other software development processes into our data infrastructure strategies for improving platform performance using tools such as Terraform, Github Actions, and Cloud Build
- Driving architectural choices and implementing data platform tools in order to devise seamless and efficient procedures for analyzing data, deriving insights, and translating insights into actionable, data-driven business decisions
- Guiding and mentoring data engineers while educating and encouraging League’s adoption and application of software best practices, which will include designing coding that is functional, efficient, and flexible for continuous modifications as required
- Translating business and customer needs into data and analytics requirements with attention to detail through use of sound agile development practices that clearly define particular points of prioritization to ensure that the goals and requirements of League’s ever-developing business models are met and exceeded
- Analyzing, tuning, troubleshooting, and supporting League’s data infrastructure for the purposes of improving the quality, performance, integrity, and security of the data used in the development and delivery of data products; and Improving, troubleshooting, and administering our customer-facing pipelines, including data ingestion pipelines as well as data sharing pipelines in both real-time and batch processing
Benefits
Option to work from home is available