📍Brazil
Data Engineer

Obol Labs Inc.
📍Remote - Portugal
Please let Obol Labs Inc. know you found this job on JobsCollider. Thanks! 🙏
Summary
Join DV Labs, a venture-backed, remote-first team, as a Data Engineer to design and build the data platform for their next-generation distributed validators. You will ingest and model Beacon-chain data at multi-TB scale, develop scalable ETL/ELT pipelines, implement columnar schemas, and expose datasets to internal stakeholders. Collaboration with Protocol & DevOps teams is key to surfacing validator health and protocol anomalies. You will own data quality and contribute to open-source tooling. The ideal candidate has 2+ years of experience in data engineering, expertise with ClickHouse and Apache Spark, and a deep understanding of the Ethereum consensus layer.
Requirements
- 2+ years of professional experience in data engineering or high-performance backend roles
- Production expertise with ClickHouse and Apache Spark on multi-terabyte datasets
- Hands-on experience operating MongoDB for semi-structured/operational workloads
- Proficiency in Python (pandas/PySpark) and/or Scala ; solid Git and CI/CD habits ( GitHub Actions/Workflows or similar)
- Deep understanding of the Ethereum consensus layer (Beacon chain architecture, validator lifecycle, slashing conditions, client diversity—Lighthouse, Prysm, Teku, etc.)
- Comfortable working in a remote, asynchronous startup environment with high ownership and autonomy
Responsibilities
- Ingest & model Beacon-chain data — blocks, attestations, sync-committee aggregates, deposits, and slashings—into ClickHouse and MongoDB at multi-TB scale
- Develop scalable ETL/ELT pipelines in Apache Spark (PySpark/Scala) orchestrated via GitHub Workflows and containerized CI/CD
- Implement columnar schemas & partition strategies to achieve sub-second analytical queries and reduce storage footprint
- Expose clean, version-controlled datasets & metrics to internal stakeholders through APIs, dashboards, and notebooks
- Collaborate with Protocol & DevOps teams to surface validator health, slash-risk events, and protocol-level anomalies in real time
- Own data quality, lineage, testing, and documentation across the stack; champion best practices and continuous improvement
- Contribute to open-source tooling around consensus-layer data, distributed-validator monitoring, and Ethereum research
Preferred Qualifications
- Familiarity with Ethereum execution-layer JSON-RPC, MEV-Boost, and block-building economics
- Experience operating distributed systems on Kubernetes , Nomad , or similar orchestrators
- Fluency in Python
- Exposure to data-observability stacks (dbt, Great Expectations, Dagster) and time-series monitoring (Prometheus/Grafana)
- Prior contributions to web3 or other open-source projects
Benefits
- Competitive salary in dollars
- Full remote company - Work from wherever you want
- Possibility to attend to relevant Conferences
- 2 Recharge weeks at the end of the year
- Equipment budget
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
📍France, Spain
📍India
📍India
📍Lithuania
📍Argentina
💰$175k-$210k
📍United States
💰$225k-$255k
📍United States
💰$120k-$180k
📍Worldwide
💰$190k-$210k
📍Worldwide