Remote GPU Benchmarking Engineer
DRW
π΅ $150k-$250k
πRemote - Worldwide
Please let DRW know you found this job on JobsCollider. Thanks! π
Job highlights
Summary
The job description is for a Lead GPU Benchmarking Engineer role at a DRW portfolio company. The ideal candidate should have extensive hands-on experience with GPU hardware, benchmarking tools, performance analysis, programming, and automation. The role involves designing and executing rigorous testing protocols to assess the reliability of GPUs, leading the development and implementation of comprehensive GPU benchmarking frameworks, and potentially growing into a Chief Technology Officer (CTO) role.
Requirements
- Bachelor's degree in Computer Science, Electrical Engineering, or a related field
- Proven experience in compute benchmarking, stress testing, and performance analysis
- Proficiency with benchmarking tools such as 3DMark, CUDA, OpenCL benchmarks, FurMark, MSI Kombustor, SPECviewperf, Unigine Heaven, and Superposition Benchmark
- Strong understanding of GPU clusters architectures and relevant performance metrics
- Experience with using the driver APIs to get the raw data directly
- Strong programming and scripting skills, including experience with Python, C/C++, Bash, or PowerShell
- Familiarity with cloud computing platforms and environments
- Excellent analytical, problem-solving, and communication skills
Responsibilities
- Develop and implement comprehensive test plans to evaluate GPUs under prolonged heavy workloads using stress testing software
- Monitor key metrics such as frame rates, temperature, peak and average power consumption, Peak Flops, Sustained Flops, cross-node bandwidth, and stability over time
- Benchmark GPUs using industry-standard benchmarking tools to measure and analyze performance
- Provide leadership and mentorship to a team of engineers, fostering a culture of innovation and technical excellence
- Conduct baseline tests on new GPUs to establish initial performance benchmarks
- Track performance metrics over time to detect and analyze any degradation
- Utilize GPU driver APIs to collect low-level telemetry during various operational conditions
- Compare performance metrics across different cluster configurations to identify comparative strengths and weaknesses
- Perform statistical analyses to ensure the validity and reliability of the test results
- Repeat tests to ensure consistency and accuracy of data
- Prepare detailed reports outlining test setups, methodologies, and data-driven conclusions
- Clearly communicate findings, insights, and recommendations to team members and stakeholders
- Configure, deploy, and maintain cloud infrastructure for automation, orchestration, and integration
- Utilize cloud computing resources to create scalable and efficient testing environments
- Optimize cloud platform usage for benchmarking and data analysis tasks
Preferred Qualifications
- Experience with statistical analysis tools and techniques
- Familiarity with Tensor, GPU cluster testing methodologies, and large-scale data analysis
- Demonstrated leadership experience or potential to grow into a Chief Technology Officer (CTO) role
Benefits
- Competitive salary and benefits package
- Opportunity to work with cutting-edge technology and innovative projects
- A collaborative and dynamic work environment
- Career growth and development opportunities
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- πUnited States
- πUnited States
- π°$120k-$135kπWorldwide
- πPoland
- πUnited States, United Kingdom
- πCanada, United States
- π°$128k-$166kπUnited Kingdom
- πWorldwide
- πUnited States
Please let DRW know you found this job on JobsCollider. Thanks! π