Are you passionate about digital media, entertainment, and software services? Do you like big challenges and working within a highly motivated team environment? Keen with respect to Observability and Reliability principles?
Candidates for the Site Reliability Engineer role in our AIOps group will be responsible for operational management & application support.
As a Site Reliability Engineer for AIOps, you will:
- Design, integrate, and provide full-stack lifecycle support for applications supporting our AIOps platform
- Own CI/CD initiatives, while working with other technical leads to define and maintain compliance with organizational standards
- Work closely with DevOps teams, customers, and infrastructure partners to identify & understand key system health/performance metrics, and develop monitoring approaches in support of service level objectives
- Participate in incident cause-analysis and assist in remediation and design efforts to improve reliability/prevent future failure scenarios
- Advance our AIOps offering through measurement & analysis of complex monitoring and alerting patterns, and provide clear guidance to collaborators on which tool/which pattern/which alert ...
ย
NBCUniversal
We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences.
Other jobs at NBCUniversal
ย
ย
ย
ย
ย
ย
ย
ย
Notifications about similar jobs
Get notifications to your inbox about new jobs that are similar to this one.
No spam. No ads. Unsubscribe anytime.
Similar jobs
ย
ย
ย
ย
ย
ย
ย
ย