Principal Data Engineer, Machine Learning

closed
SmithRx Logo

SmithRx

πŸ“Remote - Worldwide

Summary

Join SmithRx, a rapidly growing Health-Tech company, as we transform the pharmacy benefit management sector with a cutting-edge platform. We're seeking an experienced Principal Data Engineer to lead our technology strategy for modern data platforms.

Requirements

  • BS, MS, or PhD in Computer Science, Information Systems, or a related field, with 15+ years of experience in data engineering, data science, or a similar role
  • Strong expertise in data architecture, database design, and optimization, with experience in OLTP, OLAP, NoSQL, and cloud-based data warehouses (e.g., AWS Snowflake, PostgresDB, DymanoDB, etc )
  • Proficiency in programming languages such as Python, SQL, and tools like Spark, PySpark, Airflow, DBT, Snowflake, Cortext, OpenAI, and Terraform
  • Proven experience architecting and designing AI/ML initiatives with a deep understanding of AI/ML algorithms and frameworks. Nice to have - experience in developing and deploying ML models in production
  • Ability to lead cross-functional teams, influence stakeholders, and manage complex projects in a fast-paced environment
  • Strong analytical and problem-solving skills, with the ability to handle evolving requirements and ambiguous challenges
  • Excellent communication and presentation skills, capable of conveying complex technical concepts to both technical and non-technical audiences

Responsibilities

  • Lead the design and development of robust data architectures that support scalable, secure, and efficient data pipelines
  • Architect, develop an enterprise data warehouse (EDW) and tooling that encompasses design patterns to scale and expand through integrations and automation of ETL/ELT pipelines as well as analytic layer to scale reporting and insights
  • Develop strategies across the entire AI/ML project lifecycle. This includes seamless integration with data platforms, spanning from problem definition and data preparation to model deployment and performance monitoring
  • Drive innovation by evaluating and implementing new technologies and tools that enhance our data platform’s capabilities
  • Drive excellence and standardization e.g. Optimize the performance of database systems, ensuring best practices in data security, access control, and compliance
  • Ensure data quality, lineage, and resilience across production environments including monitoring, alerting, and recovery mechanisms to ensure 99% uptime and quick resolution of data pipeline issues
  • Provide technical leadership, mentoring, and guidance to team members, establishing and enforcing best practices in data engineering and data science
  • Influence and Collaborate with cross-functional teams & leadership, including product managers, engineers, data analysts, and business stakeholders

Benefits

  • Highly competitive wellness benefits including Medical, Pharmacy, Dental, Vision, and Life Insurance and AD&D Insurance
  • Flexible Spending Benefits
  • 401(k) Retirement Savings Program
  • Short-term and long-term disability
  • Discretionary Paid Time Off
  • 12 Paid Holidays
  • Wellness Benefits
  • Commuter Benefits
  • Paid Parental Leave benefits
  • Employee Assistance Program (EAP)
  • Well-stocked kitchen in office locations
  • Professional development and training opportunities
This job is filled or no longer available