JobsDeep Learning Architect, LLM Inference - New College Grad 2026
Deep Learning Architect, LLM Inference - New College Grad 2026
NVIDIADeep Learning Architect, LLM Inference - New College Grad 2026
NVIDIALocation
Santa Clara, CA
Type
Full-time
Posted
5/10/2026
Compensation
$124,000 - $241,500 per year
PhD Entry-Level
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
The Deep Learning Architect role at NVIDIA focuses on optimizing inference server performance for Large Language Models (LLMs). The Inference Benchmarking team is dedicated to maintaining NVIDIA's leadership in generative AI by characterizing workloads and collaborating with various teams. Candidates will work on deep learning software projects and develop performance benchmarking methodologies. This position requires a strong understanding of GPU hardware and software performance.
Requirements
- Master's or PhD degree in Computer Science, Computer Engineering, or related fields, or equivalent experience.
- Relevant software development experience.
- Detailed knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations.
- Experience developing client server LLM applications with OpenAI API or MCP and identifying performance bottlenecks.
- Solid understanding of CPU and GPU microarchitecture and performance characteristics.
- Experience with complex software projects like frameworks, compilers, or operating systems.
- Demonstrated proficiency with the latest AI coding agents like Claude Code, Codex, and Cursor.
- Excellent written and verbal communication skills and the ability to work independently and collaboratively in a fast-paced environment.
Responsibilities
- Perform workload characterization of the latest LLMs and inference servers.
- Collaborate with the performance marketing team to create engaging content.
- Work with engineers from AI startups to establish standard benchmarking methodologies.
- Develop a constantly evolving inference performance data results website.
- Invent end-to-end profiling and analysis tools for Generative AI.
- Contribute to deep learning software projects to drive advancements in the field.
- Verify that new GPU product launches produce industry-leading performance.
- Collaborate across the company to guide the direction of inference serving.
- Use the latest coding agents and inference technology to improve team efficiency.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
