Engineer 3, Engineering Operations
comcast
Job Description
Core Responsibilities:
Candidate must be from Network domain Background**
- Effective Incident Manager during outages who has the respect of management and Engineer fix agents.
- Effective Bridge Management , Incident Communications and driving the incident towards Mitigation .
- Identify opportunities for improving the Observability Gaps and work with Devops in Alert Configuration & Fine tuning , Alert Onboarding and suppress Alert noises.
- Act as a thought-leader, technical expert and first point of reference for leading practices in reliability engineering.
- Develops strong technical knowledge of the applications and services
- Partners with Engineering and Deployment peers to drive rigorous root cause investigations and action items which improve system availability and resilency
- Collaborates with development teams to understand application changes and identify potential issues that may arise, create implementation and back-out plans, and oversee the implementation during the scheduled maintenance window.
- Analyzes data and metrics, identifies problem areas and provides actionable insight to management.
- Provides input to engineering and vendors on defects and required enhancements. Attains all relevant industry standard technical certifications.
- Performs complex and routine maintenance tests for designated areas of engineering. Identifies and isolate issues. Ensures that all maintenance is properly validated to minimize subscriber impact to (ideally) zero.
- Ability to work in a fast-paced 24x7 technical operations environment.
- Complusory adherence to Return to Office Policy . Must be able to work
variable schedule(s) & days as necessary
Technical skill Specification:
- 2-5 yrs of hands on experience working as Incident & Problem Management
- Familiarity with Site Reliability Engineering (SRE) principles
- Exceptional written and oral communication skills, with ability to articulate complex emergent situations clearly to all levels of the organizaton.
- Hands on Experience in configuring alerts using platforms like Grafana, Prometheus, Kibana
- Familiarity in Technologies like IP Networking , Databases, Application architectures, Loadbalancers, API , Microservices, Web Services (SOAP XML), Python
- Experience working in Public cloud environment ( AWS , Azure) will be helpful
- Highly collaborative with a strong work ethic.
- Embraces challenges, displays strong creative flexibility.
- Familiarity with ITIL and/or eTOM frameworks
- Bachelor’s degree or equivalent
- Experience in Telco/Cable or Video industry would be an added advantage