Summary
Join Voltage Park as a Senior Software Engineer and build automation, tooling, and API-driven systems for AI/ML training, inference, and HPC workloads. You will design and implement systems enabling interaction with thousands of servers, storage clusters, and networks. Collaborate with cross-functional teams to drive infrastructure rollouts and improve resource lifecycle management. This fully remote position (continental US only) requires 8+ years of software or infrastructure engineering experience. No sponsorship is offered. You will work with Linux, Python, containerization, and HPC infrastructure.
Requirements
- 8+ years of professional experience in software engineering, infrastructure engineering, or related fields
- Strong experience with Linux in production environments
- Proficiency in Python or similar object-oriented programming languages
- Familiarity with containerization and orchestration concepts
- Understanding of HPC infrastructure fundamentals, bare-metal provisioning and out-of-band management
- Experience balancing pragmatic shipping with good long-term architecture
- Comfortable with navigating ambiguity
- Strong written and verbal communication skills
Responsibilities
- Design, build and maintain tools, APIs, and automation frameworks to manage physical infrastructure at scale
- Build and extend systems for server lifecycle management
- Implement observability, telemetry, and logging systems that enable visibility and insights into the health of our hardware
- Collaborate with our Network, Infrastructure Operations, Platform Engineering, and Customer Experience teams to define requirements for and build new tools
- Participate in architectural discussions to help define the direction of infrastructure engineering at Voltage Park
- Write clear design documents and technical documentation
Preferred Qualifications
- Experience with bare metal hardware troubleshooting and provisioning, extra points for working with Dell hardware
- Experience with GPU servers, both in bare metal form or under virtualization
- Deep experience with network switches, routers, and firewalls, particularly SONiC switches, Palo Alto firewalls and Juniper Networks as vendors
- Experience with VAST storage systems
Benefits
This is a fully remote position, although candidates must be based in the continental United States
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.