Post your job offer for free on H1BConnect with no upfront cost!

Logo

Hire with Us
NVIDIA Corporation logo

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA Corporation

8/2/2025

US, CA, Santa Clara

Full-time

Salary: $184,000 - $287,500 per year


Job Description

NVIDIA is seeking a Senior Deep Learning Software Engineer passionate about analyzing and improving the performance of LLM inference. Join a team that specializes in developing GPU-accelerated deep learning software and collaborate with the deep learning community to implement the latest algorithms.

Requirements

  • Bachelors, Masters, PhD, or equivalent experience in Computer Engineering, Computer Science, EECS, AI
  • At least 8 years of relevant software development experience
  • Excellent Python/C/C++ programming, software design, and software engineering skills
  • Experience with a DL framework like PyTorch, JAX, TensorFlow

Responsibilities

  • Performance optimization, analysis, and tuning of LLM, VLM, and GenAI models for DL inference, serving, and deployment
  • Scale performance of LLM models across different architectures and NVIDIA accelerators types
  • Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton
  • Work with cross-collaborative teams to develop innovative solutions

Benefits

  • Multiple relocation packages
  • Two weeklong shutdowns (mid-summer and year-end) in the US (in addition to PTO)
  • 8-week parental leave
  • 9 Employee Resource Groups
  • Annual bonus offering
  • Flexible work arrangements
  • Up to 6% 401K matching
Logo

© 2024 H1BConnect. All rights reserved.

Check out our sister site LatamDev for tech jobs in Latin America! 🌎