Distributed Systems Software Engineer

closed
Voltron Data Logo

Voltron Data

πŸ’΅ $160k-$220k
πŸ“Remote - United States

Summary

Join Voltron Data as a highly motivated Distributed Systems Software Engineer to contribute to their streaming data vision within a composable data ecosystem. You will design and implement components, collaborate with expert engineers, and work with open-source and proprietary technologies. Your work will directly impact the advancement of unified data processing capabilities and set new industry benchmarks. The role offers opportunities to work across various open-source stacks and contribute to the development of a next-generation data system. Voltron Data is a Series A, venture-backed startup committed to building a diverse and inclusive workplace. The company offers a competitive salary and benefits package.

Requirements

Be highly motivated

Responsibilities

  • Design meticulous components
  • Drive executions of designs to completion
  • Collaborate with product engineering experts
  • Work across open-source stacks (e.g., Ibis, Arrow, Substrait) and proprietary implementations
  • Contribute to the advancement of unified data processing capabilities
  • Drive innovation and set new industry benchmarks
  • Spend time learning about Ibis and Apache Arrow
  • Understand stream processing technologies
  • Get exposed to accelerated computing concepts
  • Learn and embrace the software development culture at Voltron Data
  • Actively engage in owning components from design to implementation in a native language like C++, Rust, or similar
  • Contribute to technical discussions and technical design documents
  • Work closely with open-source tools such as Ibis, Arrow, Kafka, DuckDb, Flink, and RAPIDS/cuDF
  • Contribute to bug-fixing efforts and propose areas for improvement
  • Develop a comprehensive set of distributed stateless streaming operators and Arrow-based networking protocols
  • Work with the team to implement an end-to-end system that significantly beats the current cost-efficiency benchmark
  • Leverage columnar memory format, GPU compression, and stream processing techniques to transform an existing proof of concept into a more crafted offering
  • Identify and build reusable components across the accelerated data movement code base
  • Continuously profile and analyze throughput in a distributed system to identify inefficiencies
  • Design solutions to solve inefficiencies
  • Incrementally evolve the streaming operators to support more complex stateful use cases in a unified compute engine
  • Mature the technology to be ready for broader adoption
  • Become one of the company's streaming go-to experts

Preferred Qualifications

  • Have built large-scale distributed systems or networking protocols that handle throughput greater than 10-100GB/s
  • Have worked with batch processing and stream processing systems
  • Be opinionated about data platform evolution and a strong thought leader
  • Have good intuitions about distributed system challenges and failure characteristics
  • Have experience planning capacity and estimations in cloud-native or on-premise data centers
  • Love building at a scale that most companies don’t even imagine
  • Have experience operating large-scale production systems and supporting your customers
  • Have experience with hardware like GPUs, DPUs, FPGAs, and associated software

Benefits

  • Work from Anywhere - Payroll and Benefits in 150+ Countries
  • Unlimited PTO
  • Medical, Dental, and Vision
  • Retirement [USA Only]
  • Home Office Budget
  • Continuing Education Budget
  • Equity awards
This job is filled or no longer available