Summary
Join Dremio as a Senior Software Engineer and contribute to building the future of unified lakehouse platforms for self-service analytics and AI. As part of the Datalake team, you will focus on enhancing our query engine with a particular emphasis on the Iceberg table format and efficient scans of various file formats.
Requirements
- B.S., M.S. or PhD in Computer Science or in a related technical field
- 5+ years of software engineering experience, with a focus on database systems, query execution or related fields
- Strong programming skills in an object oriented language such as Java or C++
- Understanding of database internals, query planning, distributed systems, concurrency control, data replication, and storage systems
- Familiarity with cloud object stores, such as AWS S3, ADLS or GCS
- Experience with Apache Iceberg, Parquet, AVRO and/or Delta
Responsibilities
- Designing and implementing features for Dremioβs query engine with a focus on the Iceberg table format
- Optimizing file scan operations for various file formats, including Parquet, Avro and others
- Collaborating with members of the query planning and query execution teams to ensure seamless integration of features across the code stack
- Working with and contributing to open-source projects like Apache Iceberg, Parquet and Arrow
- Maintaining and enhancing compliance with the Iceberg table format specification
- Conducting performance tuning and benchmarking to enhance query execution speed
- Understanding and reasoning about concurrency and parallelization to deliver scalability and performance in a multithreaded and distributed environment
- Participating in code reviews and providing constructive feedback to peers