Site Reliability Engineer - Intermediate
equifax
Job Description
What you’ll do
-
Work in a DevSecOps environment responsible for the building and running of large-scale, massively distributed, fault-tolerant systems.
-
Work closely with development and operations teams to build highly available, cost effective systems with extremely high uptime metrics.
-
Work with cloud operations team to resolve trouble tickets, develop and run scripts, and troubleshoot
-
Create new tools and scripts designed for auto-remediation of incidents and establishing end-to-end monitoring and alerting on all critical aspects
-
Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
-
Participate in a team of first responders in a 24/7 environment, follow the sun operating model for incident and problem management
What experience you need
-
-
BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
-
2-5 years of experience in software engineering, systems administration, database administration, and networking.
-
1+ years of experience developing and/or administering software in public cloud
-
Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.
-
Experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
-
Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
-
System administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
-
Proficiency with continuous integration and continuous delivery tooling and practices
-
Cloud Certification Strongly Preferred
-
What could set you apart
-
You have experience designing, analyzing and troubleshooting large-scale distributed systems.
-
You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
-
You have experience managing Infrastructure as code via tools such as Terraform or CloudFormation
-
You are passionate for automation with a desire to eliminate toil whenever possible
-
You’ve built software or maintained systems in a highly secure, regulated or compliant industry
-
You thrive in and have experience and passion for working within a DevOps culture and as part of a team