JobsSite Reliability Engineer - Data, Cloud & Developer Experience
Site Reliability Engineer - Data, Cloud & Developer Experience
BlackstoneSite Reliability Engineer - Data, Cloud & Developer Experience
BlackstoneLocation
New York 601 Lex, NY
Type
Full-time
Posted
6/7/2026
Compensation
$140,000 - $225,000 per year
Undergraduate with 2+ Years of Experience
Approval 96.3%·Filings 80·New hires 15·
✓ Established Sponsor
·FY 2025Job description
The Site Reliability Engineering team at Blackstone focuses on enhancing the reliability of systems and services to support business needs. This role involves collaboration with development and engineering teams to implement SRE practices and principles. The position emphasizes the selection and maintenance of observability tools while ensuring operational efficiency and service reliability. Additionally, it includes responding to incidents and improving processes through automation and a blameless culture.
Requirements
- 5+ years of professional experience in Infrastructure Engineering, Software Engineering, DevOps Engineering, or Platform Engineering.
- Strong automation script writing skills and the ability to troubleshoot code in languages such as Python, C#, and Typescript.
- Proficiency with public cloud providers, particularly strong experience with AWS and preferred experience with Azure.
- Experience with configuration-as-code, infrastructure management, and CI/CD tooling such as Terraform, Puppet, and Gitlab CI.
- Hands-on experience with Docker and container schedulers including AWS ECS and EKS.
- Excellent troubleshooting skills for Linux and Windows environments, along with networking experience using observability tools like Grafana, Prometheus, and Splunk.
- Strong communication and organizational skills, with a curiosity and motivation to improve systems and processes.
Responsibilities
- Provide technical leadership in understanding and adopting SRE methodologies across the firm.
- Incorporate observability standards into code and deployment pipelines.
- Evolve the SRE standards adopted across all teams.
- Partner with colleagues to improve service reliability and operational efficiency.
- Assist developers and engineers directly and through AI assistants.
- Implement instrumentation and provide performance insights to service owners.
- Ensure monitoring and alerting reflect the reliability of services and enable effective on-call operations.
- Implement strategic observability tools while controlling maintenance and cost overhead.
- Participate in on-call rotations and respond to system incidents to ensure service availability.
- Use automation to manage, maintain, and scale SRE systems with minimal human intervention.
- Foster a blameless team culture while assisting in postmortem discussions and reporting.
Benefits
- Employees at Blackstone are often offered comprehensive and competitive benefits, including robust health and retirement plans, paid time off, and a range of quality-of-life programs such as wellbeing support and family planning resources.
Is this posting expired or inaccurate?
