JobsSenior Systems Software Engineer, Accelerated Kubernetes Performance and Scale - DGX Cloud
Senior Systems Software Engineer, Accelerated Kubernetes Performance and Scale - DGX Cloud
NVIDIASenior Systems Software Engineer, Accelerated Kubernetes Performance and Scale - DGX Cloud
NVIDIALocation
Santa Clara, CA, Seattle, WA
Type
Full-time
Posted
6/26/2026
Compensation
$184,000 - $356,500 per year
Undergraduate with 5+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
NVIDIA is seeking a Senior Systems Software Engineer to join the DGX Cloud organization, focusing on scaling AI infrastructure and optimizing performance for distributed systems. The role involves working with cutting-edge hardware and software to tackle complex challenges in AI workloads. The ideal candidate will have extensive experience with Kubernetes, containers, and systems performance. This position offers the opportunity to make a significant impact in a diverse and supportive environment.
Requirements
- Bachelor's or Master's degree in Engineering or equivalent experience, ideally in Electrical, Computer Engineering, or Computer Science
- 8+ years of experience in computer architecture, networking, storage systems, and accelerator-based platforms
- Expertise in Kubernetes and familiarity with the broader CNCF ecosystem
- Deep experience with large-scale, parallel, distributed accelerator systems and performance optimization of AI workloads
- Experience with performance modeling and benchmarking for large-scale systems
- Proficiency in Golang and/or Python
- Strong familiarity with the NVIDIA software stack across training and inference
- Expertise with at least one major public cloud provider (for example, AWS, Azure, GCP, or OCI)
Responsibilities
- Lead end-to-end performance and scalability analysis across the Kubernetes-based accelerated runtime stack.
- Design and contribute upstream architectural changes to the Kubernetes control plane and related projects.
- Improve container startup and cold-start latency for low-latency inference scaling on Kubernetes.
- Assess, improve, and contribute to open-source projects that enhance Kubernetes for AI workloads.
- Advance scalability and performance of confidential containers on Kubernetes.
- Use large-scale simulation infrastructure to model AI-factory deployments and validate scalability.
- Collaborate with AI researchers, developers, and customers to design automated workload tests.
- Document methods and results clearly and present findings at industry events.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
