Lead Data Engineer

Verdigris Logo

Verdigris

πŸ“Remote - United States

Summary

Join Verdigris, a mission-driven startup focused on sustainable energy intelligence, and help build a modern data platform supporting climate-focused outcomes. As a data engineer, you will collaborate with various teams, design and implement data architecture, and migrate existing data. You will own schema design, architect data storage, build ETL/ELT pipelines, and ensure data quality. The role involves working with large-scale, high-throughput systems and requires strong SQL and Python skills. The team operates remotely with a flexible schedule and values collaboration and high-impact delivery.

Requirements

  • Align with core working hours, 10:00AM PST to 5:00PM PST in either pacific, mountain, or central timezones
  • 5+ years of experience in data engineering with large-scale, high-throughput systems
  • Proven experience designing dimensional models and OLAP schema (fact/dimension tables)
  • Deep understanding of columnar stores and database internals (e.g., ClickHouse, Druid, StarTree, Pinot)
  • Strong SQL skills and proficiency with Python for data pipelines
  • Experience handling updates/inserts/type-2 dimensions for time-series or large-scale event stores

Responsibilities

  • Collaborate with Product Management, Understand use cases and personas, and engineer product to support a strong user experience
  • Own schema design and data modeling for energy metering and building management system (BMS) data
  • Architect and maintain cost-effective and performant next generation data storage (e.g. ClickHouse, StarTree, etc)
  • Lead data architecture decisions, including evaluating and integrating tools in our modern data stack
  • Build and manage robust, scalable ETL/ELT pipelines to ingest, transform, and serve data
  • Ensure performance and efficiency of analytical queries across large datasets
  • Develop and enforce data quality, validation, and governance standards
  • Support real-time IoT analytics and streaming pipelines
  • Owning BI tooling (e.g. Superset, Looker, Tableau, etc)
  • Contribute to building internal data tools for engineers and analysts
  • Collaborate with AI/ML teams to support model training and inference pipelines
  • Work with web and application teams to ensure real-time and batch data access needs are met
  • Manage team projects and coordinate with other technical leads
  • Mentor junior engineers and contribute to technical hiring

Preferred Qualifications

  • Experience with BMS/HVAC or Energy data is a plus
  • Experience with usage of time series and energy data used for diagnostics and efficiency
  • Experience with IoT or sensor data systems
  • Experience working in AWS Cloud
  • Experience with Postgres
  • Proficiency in orchestrating ETL workflows (e.g. Dagster, Airflow, AWS Step Functions, etc.)
  • Familiarity with stream processing tools (e.g., Kafka, Flink, Spark Streaming)
  • Exposure to machine learning feature stores or MLOps tooling
  • Experience with data observability and data cataloging tools
  • Experience managing a team or others

Benefits

  • We operate as a fully remote team with daily virtual standups and a two-week sprint cadence
  • We primarily work from 10:00am PST to 6:00pm PST

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs