Configuration / Design/Observability/PowerShell / UI Path Senior Associate - Operate
pwc
Job Description
- Respond effectively to the diverse perspectives, needs, and feelings of others.
- Use a broad range of tools, methodologies and techniques to generate new ideas and solve problems.
- Use critical thinking to break down complex concepts.
- Understand the broader objectives of your project or role and how your work fits into the overall strategy.
- Develop a deeper understanding of the business context and how it is changing.
- Use reflection to develop self awareness, enhance strengths and address development areas.
- Interpret data to inform insights and recommendations.
- Uphold and reinforce professional and technical standards (e.g. refer to specific PwC tax and audit guidance), the Firm's code of conduct, and independence requirements.
# Job Description: SRE Senior Associate (6-9 Years)
**Location: Bangalore and Hyderabad
**Work Schedule:** 24/7 Support (Shift-based) with 5 days’ work from office
**Experience:** 5-9 Years
**Department:** SRE Automation
Key Responsibilities:
-
Strong understanding of SRE practices including SLIs/SLOs, error budgets, service health, and operational KPIs
-
Ability to automate operational tasks using Python, Shell, PowerShell, Go, or similar languages
-
Experience improving alerting systems, reducing noise, and refining observability instrumentation
-
Proficiency with cloud platforms and core services (compute, storage, networking, serverless)
-
Experience executing root-cause analysis and problem management
-
Ability to lead incident response and coordinate cross-team troubleshooting
-
Experience identifying systemic reliability gaps and proposing engineering solutions
-
Ability to design performance tests, validate reliability risks, and assess scalability
-
Strong communication skills for partnering with development, operations, and leadership
-
Leads tuning of monitoring rules, dashboards, and reliability metrics;
-
Leads development of automation to reduce operational toil and manual interventions;
-
Leads incident response actions and service stabilization procedures;
-
Leads post-incident reviews and contributes to long-term fixes;
-
Leads resilience initiatives such as chaos testing and failover drills;
-
Leads capacity forecasting and risk identification;
-
Leads refinement of operational standards, documentation, and runbooks;
-
Leads collaboration with product and engineering teams to embed reliability requirements.
Preferred:
-
AWS Solutions Architect Associate; Azure Administrator; Kubernetes CKA; Terraform Associate; ITIL Foundation, Observability certifications, Scripting and Coding Certifications will be great as well.
-
Familiarity with CMDB, Configuration Repositories, and ITSM platforms (ServiceNow, BMC Remedy, etc.).
-
5 days working from office with shift flexibility to support 24/7 client environment.
-
Collaborative and dynamic cloud-focused team.
-
Opportunities for continuous learning and professional development.