JobsSenior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
NVIDIASenior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
NVIDIALocation
Santa Clara, CA
Type
Full-time
Posted
5/10/2026
Compensation
$184,000 - $356,500 per year
Undergraduate with 5+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
NVIDIA is seeking a high-caliber Deep Learning Engineer to work on cutting-edge multimodal architectures for autonomous vehicles. The role focuses on optimizing state-of-the-art algorithms for real-time robotic execution, ensuring models can operate within strict latency and safety constraints. The engineer will collaborate with various teams to translate innovations into product solutions while leveraging advanced techniques in model optimization and performance scaling. This position is critical in advancing NVIDIA's mission to make self-driving vehicles a reality.
Requirements
- PhD with 4+ years, MS with 6+ years, or BS with 8+ years of relevant experience in Computer Science, Computer Engineering, or a related technical field.
- Expert-level proficiency in PyTorch, JAX, or similar machine learning frameworks.
- Sophisticated proficiency with modern LLM/VLM inference stacks, such as vLLM, TensorRT-LLM and SGLang.
- A proven track record of training, deploying, or optimizing large-scale DL models in production environments.
- Deep familiarity with NVIDIA’s deep learning SDKs, specifically TensorRT and CUDA.
- Strong understanding of GPU architecture, the compilation stack, and the ability to debug end-to-end performance across the hardware/software boundary.
Responsibilities
- Develop SOTA model optimization techniques to boost E2E model performance for production deployments.
- Implement advanced compression techniques to minimize model footprints without compromising safety-critical accuracy.
- Design high-performance optimization strategies for inference, including automated model sharding and efficient attention kernels.
- Conduct deep, layer-by-layer model profiling to identify compute and memory bottlenecks.
- Leverage the PyTorch ecosystem to extract standardized model graph representations and automate deployment pipelines.
- Scale DL model performance across diverse NVIDIA edge architectures.
- Architect the software interface to integrate and interact with large-scale models within a high-performance C++ production environment.
- Partner with research, TensorRT, and Cosmos teams to translate breakthrough innovations into shipping product solutions.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
