JobsPrincipal Software Engineer - Compute Infrastructure
Principal Software Engineer - Compute Infrastructure
NVIDIAPrincipal Software Engineer - Compute Infrastructure
NVIDIALocation
remote, Santa Clara, CA
Type
Full-time
Posted
5/14/2026
Compensation
$248,000 - $391,000 per year
Undergraduate with 5+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
NVIDIA is seeking a highly skilled Principal Software Engineer to join their dynamic team focused on driving efficiency and optimizing the performance of their infrastructure. This role involves leading the architectural vision for a global platform and operationalizing internal AI inference systems. The successful candidate will work in a diverse environment that encourages innovation and collaboration. This position offers the opportunity to work with cutting-edge technology and influence the direction of complex projects.
Requirements
- Bachelor's degree in Engineering, Computer Science, Mathematics, or related field, or equivalent experience.
- 15+ years of proven experience in compute platform engineering, site reliability, or systems architecture with a heavy focus on automation at massive scale.
- Deep expertise in Kubernetes architecture and designing/deploying virtualization architectures, specifically operating VMs inside K8s (KubeVirt, OpenShift).
- In-depth knowledge of hardware technologies (GPUs, high-speed backplane networking) with a track record of mitigating hardware-level failures, silent data corruption, and anomalies in large-scale environments.
- Experience running large global environments spanning bare metal, virtualized infrastructure, and cloud with a unified GitOps posture (ArgoCD or similar).
- Proficiency in programming languages such as Go and/or Python, alongside expert-level infrastructure-as-code development (Terraform, Config Management).
- Strong leadership skills with the ability to influence technical direction across highly autonomous teams without relying on top-down mandates.
Responsibilities
- Lead initiatives to architect and transform the global enterprise compute platform by defining service tiers, SLAs, and automated cluster lifecycles.
- Build the operational foundation for the internal AI inference platform scaling to frontier-class models.
- Collect and review system data for capacity planning to navigate extreme hardware supply constraints.
- Collaborate with highly autonomous NVIDIA engineering teams to drive cultural adoption of standard platforms.
- Evaluate existing application architectures and drive the migration of massive legacy workloads into modern Kubernetes orchestration.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
