Post your job offer for free on H1BConnect with no upfront cost!

Logo

Hire with Us
NVIDIA logo

Site Reliability Engineer - Air Platform Team

NVIDIA

10/15/2025

Durham, NC

Full-time

Salary: $148,000 - $287,500 a year


Job Description

NVIDIA is looking for a highly motivated SRE Engineer to join the NVIDIA AIR team – the Digital Twin for Data Center Simulation web application.

Requirements

  • BS degree in Computer Science, Software Engineering, or a related field (or equivalent experience)
  • 5+ years of experience in a Site Reliability, DevOps, or Systems Engineering role
  • Strong automation and scripting skills in Ansible, Python, and Shell Scripting
  • Experience in IaaS environments, including deploying, configuring, and administering Linux-based bare metal servers
  • Deep experience in infrastructure engineering, focused on managing and monitoring a highly available production infrastructure
  • Skilled in observability practices, using Prometheus, Grafana, ELK/EFK, and integrated alerting systems
  • Solid grasp of Linux internals and core networking concepts including NAT, DNS, DHCP, routing, and firewall configuration with iptables or nftables
  • Experience with modern deployment architecture for non-disruptive cloud operations, including blue-green and canary rollouts
  • Proficiency in Kubernetes, Docker, QEMU, and Libvirt

Responsibilities

  • Design, deploy, and manage IaaS platforms with a focus on high availability and performance
  • Automate infrastructure operations using tools like Terraform, Ansible, and Python
  • Focus on efficiency by automating repetitive workflows
  • Develop monitoring and observability tooling to detect and prevent outages using Prometheus, Grafana, ELK, etc.
  • Deploy and troubleshoot non-disruptive cloud operations with an emphasis on secure production infrastructure
  • Manage deployment/upgrades for Operating Systems, Kubernetes (k8s) clusters, and other orchestration tools
  • Provide day-to-day support for engineering activities with CI/CD tools like Git and Jenkins
  • Implement and enforce best practices around infrastructure security, access control, and operational efficiency

Benefits

  • Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Logo

© 2024 H1BConnect. All rights reserved.

Check out our sister site LatamDev for tech jobs in Latin America! 🌎