Post your job offer for free on H1BConnect with no upfront cost!

Logo

Hire with Us
NVIDIA Corporation logo

Principal Deep Learning Software Engineer, LLM Performance

NVIDIA Corporation

7/13/2025

US, CA, Santa Clara

Full-time

Salary: $272,000 - $425,500 per year


Job Description

NVIDIA is seeking a Principal Deep Learning Software Engineer with a focus on performance optimization for deep learning inference models.

Requirements

  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, EECS, AI)
  • At least 12 years of relevant software development experience
  • Excellent Python/C/C++ programming, software design and software engineering skills
  • Experience with a DL framework like PyTorch, JAX, TensorFlow

Responsibilities

  • Performance optimization, analysis, and tuning of LLM, VLM and GenAI models for DL inference, serving and deployment
  • Scale performance of LLM models across different architectures and types of NVIDIA accelerators
  • Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton
  • Work with cross-collaborative teams to develop innovative solutions

Benefits

  • Multiple relocation packages
  • Two weeklong shutdowns (mid-summer and year-end) in the US (in addition to PTO)
  • 8-week parental leave
  • 9 Employee Resource Groups
  • Annual bonus offering
  • Flexible work arrangements
  • Up to 6% 401K matching
Logo

© 2024 H1BConnect. All rights reserved.

Check out our sister site LatamDev for tech jobs in Latin America! 🌎