Site Reliability Engineer
coforge
Job Description
Job Description:
We at Coforge are hiring a Site Reliability Engineer with the following skillset:
- Design, implement, and manage scalable and secure cloud-based infrastructure using AWS Cloud Services.
- Develop and maintain Infrastructure as Code (IaC) using Python CDK and Terraform for provisioning and managing cloud resources.
- Deploy and orchestrate containerized applications using Kubernetes and Docker.
- Write and maintain automation scripts using Python (Mandatory) and Shell Scripting to streamline system operations and deployments.
- Implement and manage CI/CD pipelines using Jenkins to ensure efficient, automated software delivery.
- Collaborate with development and operations teams to ensure seamless integration with version control systems like Git.
- Monitor system health and performance using tools like Prometheus and Grafana, and troubleshoot any issues to ensure reliability.
- Maintain Linux-based environments, ensuring optimal performance and security.
- Ensure cloud infrastructure aligns with best practices following the AWS Well-Architected Framework.
- Manage messaging queues and streaming platforms such as RabbitMQ and Kafka for distributed systems.
- Implement caching mechanisms using Redis for enhancing application performance.
- Work with databases such as Oracle DB to ensure data integrity and high availability.