Sr AWS DevOps Engineer - 66304

keka

Hyderabad, India 5 Years Exp Posted 45d ago

Job Description

  • Advanced Linux & OS Troubleshooting: Perform in-depth analysis of system issues including disk I/O bottlenecks, kernel panics, service failures, and networking issues using tools like strace, journalctl, netstat, iostat, lsof, etc.
  • DR Solution Design & Build: Architect, provision, and configure a comprehensive Disaster Recovery solution on AWS — including compute, networking, storage, and failover components aligned to defined RPO/RTO objectives
  • Infrastructure Enhancements: Identify and implement additional infrastructure improvements to strengthen the reliability, scalability, and security posture of the DR environment, including advanced networking using Transit Gateway, VPC Peering, and traffic segmentation.
  • CI/CD Pipeline Development: Design and maintain Jenkins-based CI/CD pipelines to automate infrastructure provisioning, application deployments, and environment configuration across dev, staging, and production environments using CloudFormation.
  • Risk Remediation & Security Hardening: Assess and remediate identified risks across IAM roles and policies, Secrets Manager configurations, encryption standards, WAF rules, and third-party dependencies to meet security and compliance requirements.
  • Disaster Recovery Testing & Validation: Execute and support DR testing activities including RPO/RTO validation, cross-region replication verification, Multi-AZ failover simulations, and Route 53 DNS failover testing.
  • Technical & DR Documentation: Author and maintain technical documentation including DR Operational Readiness Documents (ORDs), runbooks, and architecture diagrams using provided templates and guidance.
  • Monitoring & Observability: Configure and maintain AWS CloudWatch dashboards, alarms, and log groups to provide full visibility into DR infrastructure health, incidents, and automated recovery actions.
  • Deployment Support & Warranty: Provide hands-on deployment support during go-live and deliver post-deployment warranty coverage to ensure environment stability and rapid resolution of any issues.
  • GitLab to GitHub Migration: Support the potential migration of source code repositories from GitLab to GitHub, including pipeline reconfiguration, branch strategy alignment, and post-migration validation, as bandwidth permits.
  • Scripting & Automation: Develop and maintain Shell and Python scripts to automate routine operational tasks, DR workflows, risk-remediation actions, and event-driven processes — with observability and guardrails built in

Similar Openings for You