Observability Engineer Lead
equifax
Job Description
Key Accountabilities
-
Maintain a backlog and and prioritise tasks related to Observability
-
Develop and implement the organization's observability strategy
-
Lead and manage a team of observability engineers
-
Focus on the availability, reliability, and performance of systems in production
-
Manage all SDLC environments for the tribe, including cloud platform, shared assets and tools.
-
Manage software deployments and releases for the tribe, with a view to drive down the number of failed changes and warranty issues.
-
Responsible for monitoring, scale, performance and cost optimisation
-
Stewardship and tracking of system SLI, SLO and SLA
-
Solve problems and triage complex distributed architecture service maps. On call for high severity application incidents and improving run books to improve MTTR
-
Lead availability blameless postmortem and own the call to action to remediate recurrences.
What experience you need
-
Tertiary education in Technology or Business or equivalent job experience required
-
5+ years experience with monitoring tools Google/AWS Cloud Monitoring, Appdynamics, DataDog, Splunk , Elastic Search or similar
-
5+ years’ experience in system support, coding or operations.
-
Hands-on experience with Windows/Linux environments
-
Excellent problem-solving and communication skills
-
Provide step-by-step technical help, both written and verbal
-
Familiar with SAFe framework