DevOps Engineer

zappyhire

kochi 3 Years Exp Posted 25d ago

Job Description

Key Responsibilities:

Support Function Requirements: This role is a critical support function, providing the initial line of defense for the Customer Success and SDLC teams. Effective participation in the on-call rotation necessitates punctuality, impartiality, and professionalism, ensuring solutions are delivered to the business with integrity and dependability. Accountability, Ownership, and Punctuality are non-negotiable requirements for this position. Infrastructure & Cloud Management ● Design, deploy, and maintain AWS cloud infrastructure including EKS, EC2, RDS, Lambda, S3, Route53, and IAM ● Manage and optimize Kubernetes environments with focus on EKS cluster health, scaling, and troubleshooting ● Maintain configuration management standards across environments ● Ensure infrastructure security through access controls, network policies, and compliance with security best practices Environment Health & Developer Support ● Maintain health and stability of development, staging, and production environments  ● Provide swift, responsive support to developers and testers to unblock workflows and resolve environment issues ● Proactively identify environment cracks, configuration drifts, and anomalies before they impact teams ● Leverage Datadog to monitor environment health, detect drift, and maintain observability across all systems CI/CD & Deployment ● Build and maintain CI/CD pipelines using GitHub Actions and AWS CodePipeline ● Implement and manage GitOps-based delivery workflows using Argo CD ● Support containerization strategies and maintain container orchestration systems ● Ensure reliable, repeatable deployments across all environments Monitoring & Incident Response ● Implement comprehensive monitoring using Datadog for infrastructure, applications, and services ● Create actionable dashboards, alerts, and metrics for proactive system management ● Participate in 24/7 on-call rotation providing support to internal teams and external stakeholders ● Methodically troubleshoot and resolve issues across the full technology stack ● Conduct root cause analysis and implement preventive measures Database & Application Support ● Provide operational support for MongoDB, Neo4j (Graph), and PostgreSQL databases ● Troubleshoot application issues across Python and Node.js environments ● Support database performance tuning and optimization efforts Documentation & Process Improvement ● Document infrastructure components, runbooks, and operational procedures ● Develop automation scripts using Python and shell scripting to streamline operations ● Identify process gaps and proactively flag potential issues before they escalate ● Contribute to continuous improvement of DevOps practices and tooling Required Qualifications: ● 3+ years of experience in DevOps, SRE, or similar infrastructure roles ● Strong expertise in AWS services (EKS, EC2, RDS, Lambda, S3, Route53, IAM) ● Hands-on experience with Kubernetes orchestration, particularly EKS cluster management ● Proficiency with CI/CD tools including GitHub Actions, AWS CodePipeline, and Argo CD  ● Experience with containerization technologies (Docker) and microservices architectures ● Proven ability to identify configuration drifts and environment issues through monitoring tools, preferably Datadog ● Working knowledge of Python and shell scripting for automation ● Familiarity with database technologies including MongoDB, PostgreSQL, and Graph databases ● Strong understanding of infrastructure security principles ● Excellent documentation and communication skills Preferred Qualifications: ● AWS certifications (Solutions Architect, DevOps Engineer, or SysOps Administrator) ● Experience with Neo4j or other graph databases ● Background in GitOps workflows and advanced deployment strategies ● Knowledge of serverless architecture patterns ● Experience with distributed systems troubleshooting ● Familiarity with compliance frameworks and security auditing What We're Looking For: ● Methodical problem-solver who approaches issues systematically and sees them through to resolution ● Responsive team supporter who provides swift assistance to developers and testers ● Proactive communicator who flags potential issues early and keeps stakeholders informed ● Multitasker comfortable managing multiple priorities and context-switching when needed ● Detail-oriented with a keen eye for spotting environment anomalies and configuration drift ● Team player ready to support colleagues and contribute to on-call rotations ● Continuous learner eager to stay current with evolving cloud technologies and practices ● A proactive AI enthusiast dedicated to driving processes and operational enhancements through AI-powered solutions. What We Offer: ● Collaborative and innovative work environment ● Opportunities to work with modern cloud-native technologies ● Professional development and continuous learning opportunities ● Competitive compensation and benefits pac

Similar Openings for You