JobsSenior Systems Software Engineer, Data Center Infrastructure Management - EngOps
Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps
NVIDIASenior Systems Software Engineer, Data Center Infrastructure Management - EngOps
NVIDIALocation
USA (Multiple Locations)
Type
Full-time
Posted
5/10/2026
Compensation
$152,000 - $287,500 per year
Undergraduate with 5+ Years of Experience
Master's with 5+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
NVIDIA is seeking a highly motivated EngOps Engineer to join their advanced infrastructure software team. This role focuses on maintaining high-performance, rack-scale management solutions for datacenter environments. The engineer will work closely with the Infrastructure Service software development team to support deployment and debugging of hardware and Infrastructure Manager. The ideal candidate will have a strong background in cluster management and troubleshooting.
Requirements
- BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or a related field, or equivalent experience.
- 5+ years of hands-on experience in deploying and administrating clusters, servers, switches, and related infrastructure.
- Experience with deployment and configuration of operating systems, computer networks, and high-performance applications.
- Proven ability to work effectively with developers and test engineers across different teams and time zones.
- Experience deploying services in Kubernetes.
- Datacenter or computer architecture experience is required.
- Background with hardware management protocols such as Redfish, IPMI, and BMC.
- Experience configuring and debugging complex data center networks.
- Experience developing scripts to automate recovery actions for management controllers and datacenter systems.
Responsibilities
- Take ownership of daily cluster failures and issues, troubleshooting them promptly to maintain optimal cluster availability and performance.
- Manage updates to the site controller management nodes.
- Manage the rollout and rollback of cluster software and firmware updates, ensuring smooth transitions and minimal disruptions.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
