Software Development Engineer IV DevOps
meesho
Job Description
What you will do
- Develop reusable InfraStructure code and testing frameworks for InfraStructure.
- Develop tools and frameworks to allow Meesho engineers to provision and manage InfraStructure access controls.
- Design and develop solutions for cloud security, secrets-management and key rotations.
- Design a centralised logging and metrics platform that can handle Meesho’s scale.
- Take on new InfraStructure requirements and develop infrastructure codeWork with service teams to help them onboard container platform.
- Scale the Meesho platform to handle millions of requests concurrently.
- Drive solutions to reduce MTTR and MTTD, enabling High Availability and Disaster Recovery.
What you will need
- Bachelors / Masters in Computer Science.
- 8-12 years of in-depth and hands-on professional experience in the DevOps /Systems Engineering domain.
- Proficiency in Strong Systems, Linux, OpenSource, Infrastructure Engineering, and DevOps fundamentals.
- Hands-on experience with data processing frameworks(eg. Spark, Databricks)Familiar with Big Data Technologies.
- Experience with DataOps concepts and tools(eg. Airflow, Zeplin).
- Exceptional design and architectural skills.
- Experience in building large-scale distributed systems.
- Experience in Scalable Systems - transactional systems (B2C).
- Expertise in designing and implementing solutions on public cloud (AWS) for security, resilience etc.
- Expertise in Security Hardening of cloud infrastructure applications/web servers against known/unknown vulnerabilities.
- Understanding of compliance and security.
- Ability to assess business needs and requirements to ensure appropriate approaches.
- Expertise in Capacity Planning Design, cost and effort estimations and cost-optimisation.
- Ability to deliver the best operations tooling and practices, including CI/CD.
- In-depth understanding of SDLC.
- Ability to write infrastructure as code for public or private clouds.
- Ability to implement modern cloud Integration architecture.
- Knowledge of configuration and infra management (Chef/Puppet/Ansible and Terraform) or CI tools (Any).
- Knowledge of scripting language: Python, Shell, Go (proficiency in any one).
- Knowledge of container services like Docker, Kubernetes, EKS/GKE, etc.
- Ability to architect and implement end-to-end monitoring of solutions in the cloud.
- Ability to design for failover, high availability, MTTR, MTTD, RTO, RPO and so on.
- Ability to define and report on business and processes metrics.
- Ability to balance governance, ownership and freedom against reliability.
- Ability to develop and motivate individual contributors on the team.