SRE Engineer
FIS
Job Description
What you will be doing
- Design, implement, and maintain highly available and scalable cloud-based infrastructure solutions.
- Develop and deploy comprehensive monitoring, alerting, and incident response mechanisms to proactively safeguard system health.
- Champion SRE best practices, driving improvements in reliability, efficiency, and automation across our engineering teams.
- Troubleshoot and resolve complex software and system issues across various technology layers.
- Collaborate closely with development teams to streamline release processes, optimize performance, and preempt potential bottlenecks.
- Participate in on-call rotation for incident response.
What you bring
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 2+ years of experience in an SRE role or similar DevOps-focused position.
- Solid foundation in Linux system administration and networking concepts.
- Proficient in a scripting language (Python, Bash, etc.) and experience with automation tools (Ansible, Terraform, Jenkins, etc.).
- Hands-on experience with cloud technologies (AWS, Azure, or GCP preferred).
- Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog, etc.).
- Strong analytical and problem-solving skills with a keen eye for detail.
- Excellent communication and collaboration abilities to work effectively in a cross-functional team.