DevOps Engineer
griddynamics
Job Description
-
Design, implement, and maintain scalable log ingestion and observability platforms.
-
Deploy and manage ClickHouse for high-performance log storage and analytics.
-
Build and maintain dashboards and monitoring solutions using Grafana.
-
Implement distributed tracing and telemetry using OpenTelemetry (OTEL).
-
Configure and optimize log ingestion, processing, and retention pipelines.
-
Develop and manage automated alerting solutions using Everbridge or similar enterprise alerting platforms.
-
Manage multi-tenant observability environments with appropriate access controls.
-
Automate log retention policies, archival, and lifecycle management.
-
Collaborate with engineering and security teams to improve system reliability and operational efficiency.
-
Perform production support, incident response, troubleshooting, and root cause analysis.
Qualifications
-
3–5 years of experience in DevOps, SRE, or Platform Engineering.
-
Hands-on experience with:
-
ClickHouse
-
Grafana
-
OpenTelemetry (OTEL)
-
Log ingestion and observability pipelines
-
Monitoring and alerting platforms
-
-
Experience with enterprise alerting tools such as Everbridge or equivalent.
-
Strong understanding of multi-tenant environments and observability architecture.
-
Experience with retention policy automation and data lifecycle management.
-
Good knowledge of Linux, networking, and cloud infrastructure.
-
Scripting experience using Bash, Python, or similar.
-
Willingness to work night shifts.
-
Ability to work in a hybrid model (3 days from the Bangalore office).
-
Immediate to 30 days' notice period preferred.
-