JobsAI Inference Performance Engineer - New College Grad 2026
AI Inference Performance Engineer - New College Grad 2026
NVIDIAAI Inference Performance Engineer - New College Grad 2026
NVIDIALocation
Santa Clara, CA
Type
Full-time
Posted
6/4/2026
Compensation
$124,000 - $241,500 per year
Undergraduate with 2+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
This role focuses on optimizing and benchmarking GenAI inference on NVIDIA's latest accelerators, contributing to industry performance standards across various AI workloads. The team operates at the intersection of GPU performance engineering and public accountability, working with frameworks like TensorRT-LLM, SGLang, and vLLM. The position involves driving benchmark results and collaborating with multiple teams to enhance performance across large-scale models. Candidates will play a key role in shaping next-generation inference benchmarks and establishing performance methodologies.
Requirements
- BS, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience.
- 2+ years of relevant software development experience.
- Strong Python or C++ programming, software design, and software engineering skills.
- Expertise with a deep learning framework such as PyTorch or JAX.
- Proven track record of delivering measurable performance improvements in deep learning inference or high-performance systems.
- Deep understanding of LLM/VLM architectures and inference mechanics.
Responsibilities
- Drive industry benchmark results by owning the end-to-end optimization pipeline.
- Define and optimize cutting-edge workloads by identifying and shaping next-generation inference benchmarks.
- Architect distributed inference by designing and optimizing execution from single-GPU to rack-scale clusters.
- Establish performance methodology by applying roofline analysis and systematic profiling.
- Influence the ecosystem by contributing to open-source projects and partnering with architecture and kernel teams.
- Raise the technical bar for the team and lead cross-functional execution on benchmark timelines.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
