Observability Engineer Lead
equifax
Job Description
Key Accountabilities
-
Maintain a backlog and and prioritise tasks related to Observability
-
Develop and implement the organization's observability strategy
-
Lead and manage a team of observability engineers
-
Focus on the availability, reliability, and performance of systems in production
-
Manage all SDLC environments for the tribe, including cloud platform, shared assets and tools.
-
Manage software deployments and releases for the tribe, with a view to drive down the number of failed changes and warranty
-
issues.
-
Responsible for monitoring, scale, performance and cost optimisation
-
Stewardship and tracking of system SLI, SLO and SLA
-
Solve problems and triage complex distributed architecture service maps. On call for high severity application incidents and
-
improving run books to improve MTTR
-
Lead availability blameless postmortem and own the call to action to remediate recurrences.
What experience you need
-
Tertiary education in Technology or Business or equivalent job experience required
-
8+ years experience with monitoring tools Google/AWS Cloud Monitoring, Appdynamics, DataDog, Splunk , Elastic Search or similar
-
8+ years’ experience in system support, coding or operations.
-
8+ years experience of system administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
- Excellent problem-solving and communication skills
-
Provide step-by-step technical help, both written and verbal
-
Familiar with SAFe framework
What could set you apart
-
You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
-
Experience managing Infrastructure as code via tools such as Terraform or CloudFormation
-
Passion for automation with a desire to eliminate toil whenever possible
-
You’ve built software or maintained systems in a highly secure, regulated or compliant industry
-
Experience and passion for working within a DevOps culture and as part of a team
-
Proficiency with continuous integration and continuous delivery tooling and practices