JobsSenior Systems Software Engineer, Observability and Telemetry Platform
Senior Systems Software Engineer, Observability and Telemetry Platform
NVIDIASenior Systems Software Engineer, Observability and Telemetry Platform
NVIDIALocation
remote, Santa Clara, CA
Type
Full-time
Posted
6/25/2026
Compensation
$184,000 - $356,500 per year
Undergraduate with 5+ Years of Experience
Approval 99.2%·Filings 1,781·New hires 873·
👑 Elite Sponsor
·FY 2025Job description
The Senior Systems Software Engineer (SRE) at NVIDIA is responsible for designing, building, and maintaining large-scale production systems with a focus on efficiency and availability. This role requires a deep understanding of various systems, networking, coding, and cloud technologies. The engineer will ensure the reliability and uptime of GPU cloud services while enabling developers to make system changes. The position emphasizes automation, performance tuning, and the continuous improvement of production systems.
Requirements
- BS degree in Computer Science or a related technical field involving coding, or equivalent experience.
- 8+ years of experience with infrastructure automation and distributed systems design.
- 5+ years of experience delivering foundational infrastructure and observability platforms.
- Experience in one or more programming languages such as Python, Go, Perl, or Ruby.
- In-depth knowledge of Linux, networking, and containers.
Responsibilities
- Design, implement, and support operational and reliability aspects of a large-scale observability and telemetry collection platform.
- Engage in and improve the entire lifecycle of services from inception and design through deployment, operation, and refinement.
- Support services before they go live through system design consulting, developing software tools, and capacity management.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through automation and push for changes that improve reliability and velocity.
- Practice sustainable incident response and conduct blameless postmortems.
- Participate in an on-call rotation to support production systems.
Benefits
- Employees at NVIDIA are often offered comprehensive, day-one benefits—including medical, dental, and vision coverage with HSA support, life and disability insurance, an Employee Assistance Program, and a 401(k) with auto-enrollment. Many roles also have generous time off and holidays, donation matching (up to $10,000), and a wide menu of extras like FSAs, commuter benefits, legal and identity-theft protection, pet insurance, and wellness discounts. Optional programs can include student-loan and home-purchase support, plus family care resources and expert medical services.
Is this posting expired or inaccurate?
