Remote Lead Data Engineer
StrongDM
π΅ $190k-$230k
πRemote - United States
Please let StrongDM know you found this job on JobsCollider. Thanks! π
Job highlights
Summary
Join StrongDM as a Principal Data Engineer to design and implement data architectures that support diverse use cases, AI/ML to business intelligence (BI).
Requirements
- Strong knowledge of big data processing frameworks and data streaming technologies
- Experience collaborating with AI/ML teams, building data pipelines that feed AI models, and ensuring data readiness for machine learning workflows
- Proven experience in architecting and building data lakes on cloud platforms (AWS, Azure, GCP)
- In-depth knowledge of Apache Iceberg, Apache Parquet, and other open standards for efficient data storage and query optimization
- Expertise in using compute engines such as Apache Spark, Dremio, Presto, or similar, with hands-on experience in optimizing them for business intelligence and AI workloads
- Proven track record of leading large-scale data engineering projects and mentoring teams
- Proficiency in languages such as Python, Java, or Scala, and SQL for querying and managing large datasets
- Previous experience working directly with AI or machine learning teams preferred
- A deep understanding of distributed systems and the challenges of scaling data infrastructure in large, dynamic environments preferred
- Familiarity with modern data warehousing solutions such as Snowflake or Redshift preferred
Responsibilities
- Design and Architect Cloud Data Lakes
- Implement and manage tabular formats like Apache Iceberg, Parquet, and other open standards to efficiently store and process large datasets
- Architect and build large-scale, highly available data platforms that support real-time analytics, reporting, and AI workloads
- Leverage various compute engines (e.g., Apache Spark, Dremio, Presto, Trino) to support complex business intelligence and AI use cases, optimizing performance and cost-efficiency
- Work closely with AI and machine learning teams to design data pipelines that enable AI model training, deployment, and real-time inference
- Establish best practices for data governance, ensuring data quality, security, and compliance with industry regulations
- Provide technical leadership to data engineering teams and mentor junior engineers, fostering a culture of continuous learning and innovation
Benefits
- Medical, dental, and vision insurance (free to employees and dependents)
- 401K, HSA, FSA, short/long-term disability coverage, life insurance
- 6 weeks of combined accrued vacation + sick time
- Volunteer days + standard holidays
- 24 weeks paid parental leave for everyone + 1 month transition time back + childcare stipend for first year
- Generous monthly and annual stipend for internet + home office
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- π°$160k-$180kπWorldwide
- πWorldwide
- πRomania
- πSpain
- π°$190k-$245kπWorldwide
- πIndia
- πIndia
- πIndia
- πIndia
Please let StrongDM know you found this job on JobsCollider. Thanks! π