JobsCloud Site Reliability Engineer (SRE) - Data Management & Analytics Platform
Cloud Site Reliability Engineer (SRE) - Data Management & Analytics Platform
BloombergCloud Site Reliability Engineer (SRE) - Data Management & Analytics Platform
BloombergLocation
Princeton, NJ
Type
Full-time
Posted
5/5/2026
Compensation
$160,000 - $240,000 per year
Undergraduate with 5+ Years of Experience
Approval 99%·Filings 720·New hires 216·
✓ Established Sponsor
·FY 2025Job description
The Cloud Site Reliability Engineer (SRE) role at Bloomberg focuses on building and operating highly reliable, scalable data platforms in the cloud. As part of the Data Management and Analytics Platform (DMAP) SRE team, you will enhance analytics across the organization to improve products and customer engagement. The position emphasizes ensuring the availability, performance, and scalability of critical data pipelines and analytics infrastructure. You will apply automation and reliability best practices to support large-scale distributed systems.
Requirements
- 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
- Strong proficiency in at least one programming or scripting language such as Python or Go
- Experience supporting production systems with a focus on reliability, scalability, and observability
- Hands-on experience operating or designing highly available distributed systems
- A Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field, or equivalent professional experience
Responsibilities
- Design, build, and operate highly available, scalable, and resilient cloud infrastructure supporting large-scale data ingestion and analytics platforms
- Define, implement, and monitor SLIs/SLOs for data systems and services
- Improve observability across data pipelines and platforms through logging, metrics, tracing, and alerting
- Automate infrastructure provisioning and system management using Infrastructure as Code (IaC)
- Lead incident response efforts, perform root cause analysis (RCA), and implement post-incident improvements
- Optimize performance, reliability, and cost efficiency of cloud-based data systems
- Ensure data platform reliability, including batch and streaming pipelines, storage systems, and reporting infrastructure
- Partner with data engineers, software engineers, and stakeholders to improve system reliability and operational maturity
- Strengthen platform security through proactive monitoring, vulnerability management, and cloud security best practices
- Continuously improve CI/CD pipelines and deployment processes for data infrastructure
Benefits
- Bloomberg offers a comprehensive suite of benefits designed to support health, financial stability, and work-life balance.
Is this posting expired or inaccurate?
