Senior Consultant - High Performance Computing

Burwood Group
Summary
Join Burwood Group as a High-Performance Computing (HPC) Consultant to support cutting-edge research and innovation. Leverage your expertise in managing HPC resources, preferably within a higher education setting, to drive the maturity of HPC offerings and services in the cloud. Collaborate with Cloud, Data, and Development teams to enhance HPC capabilities for large-scale distributed cloud-powered computing. Develop close customer relationships to understand their needs and provide technical solutions. This role involves designing, implementing, and improving proprietary libraries and machine-learning pipelines, identifying and resolving performance bottlenecks, and delivering cost-efficient solutions. The position offers opportunities for professional growth and development within a dynamic and supportive work environment.
Requirements
- You are motivated by solving technical problems and helping clients achieve their technological goals
- You are collaborative in nature and enjoy working closely with customers and colleagues, understanding challenges, and building technical solutions together
- You are a detail-oriented, enthusiastic learner with the ability to work well as part of a team
- You enjoy problem-solving and workflow challenges
- You have excellent presentation skills, whether that be in-person, over-the-phone or e-mail, with both technical and non-technical stakeholders
- Experience in research, academic, or enterprise environments focusing on scientific or engineering applications is highly relevant
- Experience with identifying performance bottlenecks: at a low level, induced by the OS, from software architecture, or in a distributed system
- Demonstrable skills in system administration and the architecture, deployment, and optimization of HPC clusters
- Demonstrated analytical thinking and strong problem-solving abilities with a quantitative focus
- Consistently goal-driven, with a proven ability to meet and exceed performance targets
- Effective in both independent work and collaborative team environments
- Most work is expected to be completed remotely, but you are willing to travel should client project work require on-site activities
- Expert-level Linux/Unix Administration
- Deep knowledge of at least one major workload manager (Slurm highly preferred)
- Hands-on experience with underlying technologies, including high-speed networking and parallel file systems
- Working knowledge of containerization
- Experience working with and configuring GPU environments. TPU is a plus
- Familiarity with CI/CD (GitHub Actions, or Jenkins, etc.)
- Experience with Git/GitHub
- Strong scripting skills working in Python and Bash
Responsibilities
- Supporting clients by designing, implementing, and improving proprietary libraries and machine-learning pipelines
- Leveraging modern computer architecture to improve throughput or reduce costs
- Identifying compute waste and IO bottlenecks and delivering cost-efficient solutions at scale
- Establish relationships with customers, assisting them with technical solutions and solving problems
- Delivering value through project work and exceptional operational support to customers
- Maintaining awareness and leveraging the newest technologies, primarily from Google Cloud Platform, but also AWS
Preferred Qualifications
- Public cloud (Google, Azure, AWS) certifications are a plus
- Hands-on experience supporting research by designing, implementing and improving proprietary libraries and machine-learning pipelines
Benefits
- Generous vacation policy
- 401(k) and Roth 401(k) savings plan, complete with a company match
- Health, dental, and vision insurance
- Life and accident insurance
- Short and Long-Term Disability coverage
- Mental wellness programs
- Flexible spending accounts for pre-tax healthcare and transit/parking expenses
- Learning opportunities in support of career development
- Access to financial planning expertise
Share this job:
Similar Remote Jobs
