JobsPrincipal Software Engineer - Compute Infrastructure

Principal Software Engineer - Compute Infrastructure

NVIDIA

Principal Software Engineer - Compute Infrastructure

NVIDIA

Location

remote, Santa Clara, CA

Type

Full-time

Posted

5/14/2026

Compensation

$248,000 - $391,000 per year

Undergraduate with 5+ Years of Experience

Approval 99.2%·Filings 1,781·New hires 873·

👑 Elite Sponsor

·FY 2025

Job description

NVIDIA is seeking a highly skilled Principal Software Engineer to join their dynamic team focused on driving efficiency and optimizing the performance of their infrastructure. This role involves leading the architectural vision for a global platform and operationalizing internal AI inference systems. The successful candidate will work in a diverse environment that encourages innovation and collaboration. This position offers the opportunity to work with cutting-edge technology and influence the direction of complex projects.

Requirements

Bachelor's degree in Engineering, Computer Science, Mathematics, or related field, or equivalent experience.
15+ years of proven experience in compute platform engineering, site reliability, or systems architecture with a heavy focus on automation at massive scale.
Deep expertise in Kubernetes architecture and designing/deploying virtualization architectures, specifically operating VMs inside K8s (KubeVirt, OpenShift).
In-depth knowledge of hardware technologies (GPUs, high-speed backplane networking) with a track record of mitigating hardware-level failures, silent data corruption, and anomalies in large-scale environments.
Experience running large global environments spanning bare metal, virtualized infrastructure, and cloud with a unified GitOps posture (ArgoCD or similar).
Proficiency in programming languages such as Go and/or Python, alongside expert-level infrastructure-as-code development (Terraform, Config Management).
Strong leadership skills with the ability to influence technical direction across highly autonomous teams without relying on top-down mandates.

Responsibilities

Lead initiatives to architect and transform the global enterprise compute platform by defining service tiers, SLAs, and automated cluster lifecycles.
Build the operational foundation for the internal AI inference platform scaling to frontier-class models.
Collect and review system data for capacity planning to navigate extreme hardware supply constraints.
Collaborate with highly autonomous NVIDIA engineering teams to drive cultural adoption of standard platforms.
Evaluate existing application architectures and drive the migration of massive legacy workloads into modern Kubernetes orchestration.

Benefits

Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.

Is this posting expired or inaccurate?