A Tri-State-based entertainment organization is seeking a new Platform Site Reliability Operations Lead to join their team, serving as the primary operational authority for technical incidents and refining monitoring tools.
***This is a Hybrid opportunity requiring the qualified professional to work onsite at least 3 days a week.***
Responsibilities:
- Acting as the primary Incident Lead during technical disruptions
- Overseeing the end-to-end incident lifecycle
- Identifying and investigating critical issues in real-time
- Developing scalable Incident Management processes
- Tracking and analyzing key metrics for improvement
- Performing other duties, as needed
Qualifications:
- 5+ years of related work experience
- Bachelor’s Degree or equivalent experience
- Deep experience with Observability tools
- Strong analytical and problem-solving skills
- Exceptional communication skills
- Hands-on experience with monitoring tools
- Ability to translate complex incidents clearly
- Detail-oriented with multitasking capability
Desired Qualifications:
- Bachelor’s Degree in a computer-related field
- Experience in Agile frameworks
- Experience in Media & Entertainment



