JobsSenior Staff AI/ML & GPU Performance Validation Engineer
AMD logo

Senior Staff AI/ML & GPU Performance Validation Engineer

AMD

Location

San Jose, CA

Type

Full-time

Posted

5/5/2026

Compensation

USD $175,700.00/Yr. – USD $251,000.00/Yr.

Undergraduate with 5+ Years of Experience
Approval 98.6%·Filings 728·New hires 184·
Established Sponsor
·FY 2025

Job description

The Senior Staff Engineer at AMD will lead test development, performance benchmarking, and validation for AI/ML frameworks, AI models, and AI agent-based systems on GPU platforms. This role requires deep expertise in GPU performance analysis and AI/ML training and inference frameworks. The engineer will work closely with various teams to ensure the performance and correctness of AI systems. The position emphasizes collaboration, innovation, and the ability to mentor others in a dynamic environment.

Requirements

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field or equivalent experience
  • 12+ years of industry experience in test development, performance engineering, or systems validation
  • Deep understanding of GPU and CPU architecture and hardware components
  • Strong experience with AI/ML frameworks such as PyTorch, TensorFlow, JAX, or ONNX Runtime
  • Hands-on experience with AI model training and inference, including large-scale models
  • Proficiency in Python, C/C++, or similar languages
  • Experience building automation frameworks and integrating them into CI/CD pipelines
  • Experience working in virtualization environments such as containers and VMs

Responsibilities

  • Design, develop, and own test and validation frameworks for AI/ML frameworks, AI models, AI agents, and HPC focusing on GPU performance, correctness, and scalability
  • Lead GPU performance benchmarking for AI/ML training, inference, and agent-based and HPC workloads
  • Develop and maintain performance baselines and regression detection across multiple GPU and CPU platforms
  • Analyze and debug performance, functional, and stability issues spanning hardware, drivers, runtime, frameworks, and models
  • Build automated test pipelines covering bare-metal and virtualized environments
  • Drive CI/CD integration for large-scale AI/ML validation infrastructure
  • Collaborate closely with AI framework, model, compiler, OS, driver, and hardware teams
  • Mentor senior engineers and provide technical leadership across cross-functional teams
  • Influence long-term strategy for AI/ML performance validation and system-level testing

Benefits

  • AMD provides a competitive 'Total Rewards' package that focuses on financial growth, health, and work-life balance.

Is this posting expired or inaccurate?