SRE / Reliability Engineer
infogain
Job Description
ROLES & RESPONSIBILITIES
-
Education:
-
Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
-
Certifications in Dynatrace (e.g., Dynatrace Certified Professional or similar) are a plus.
-
-
Experience:
-
8+ years of experience in application performance monitoring (APM), systems engineering, or site reliability engineering (SRE).
-
2+ years of hands-on experience implementing and managing Dynatrace in an enterprise environment, with a focus on full-stack monitoring and performance optimization.
-
Experience in monitoring distributed applications, microservices, containers, and Large Enterprise ecosystems .
-
Familiarity with cloud environments (AWS, Azure)
-
-
Technical Expertise:
-
Strong knowledge of Dynatrace platform capabilities (e.g., AI-driven insights, Distributed Tracing, PurePath, Real User Monitoring, Log Monitoring).
-
Experience with cloud-native technologies like Kubernetes, Docker, and container orchestration tools.
-
Proficiency with scripting and automation tools (e.g., Python, Bash).
-
Familiarity with monitoring best practices, such as defining SLOs, SLIs, and implementing monitoring as code.
-
Experience integrating Dynatrace with third-party tools like ITSM (ServiceNow), ticketing systems, and CI/CD tools.
-
-
Soft Skills:
-
Strong analytical skills with the ability to identify performance bottlenecks and recommend optimization strategies.
-
Excellent troubleshooting skills, with a focus on proactive monitoring and performance improvement.
-
Ability to collaborate effectively with cross-functional teams and communicate technical concepts to both technical and non-technical stakeholders.
-
Strong written and verbal communication skills.
-