Infrastructure Monitoring Specialist

hsbc

Pune NM Years Exp Posted 242d ago

Job Description

We are currently seeking an experienced professional to join our team in the role of Infrastructure Monitoring Specialist..

 In this role, you will:

  • Collaborating closely with software and operations teams to improve end-to-end monitoring and alerting production services.
  • They deliver lasting, preventative improvements that cross the development/operation team divides.
  • They coordinate our response to service impacting incidents
  • Routinely modifying configurations or systems in a way that produces lasting improvements from a one-time effort
  • Applying their expertise and experience to assist with architecting the next generation of services
  • Assisting with support escalation in high impacting incidents, coordinating SMEs and vendors as required
  • Representing ITID “outwards” to manage quality of service delivered.
  • Understand & analyze changes in technology & process across the Group / regions that would impact development & support of builds & tools.
  • Collaborate with regional teams and global function as required. Ensure understanding of practices within regions and drive standardization amongst regions.
  • Communicate project updates / progress, action plans / issues on timely basis.
  • Organize & lead meetings with regional teams for development or support of deliverables.
  • Escalation Management 
  • Proactively identify problem situations and resolve to give maximum customer satisfaction.

Requirements

To be successful in this role, you should meet the following requirements:

  • Good communication skills to collaborate with Global and regional stakeholders
  • Strong fundamentals in distributed systems and networking
  • Experience programming in at least one of the following languages: Bash scripting, Python, Java Script, Java etc.
  • Experience programing in APIs.
  • Experience on DevOps tools like – Puppet, Ansible, Tanium, Git etc.
  • Experience in monitoring solutions (Patrol, Truesight, BHOM, AppDynamics, Opensource tools) to create best-of-breed production monitoring, incident detection and response solutions.
  • Develop and maintain tools used in problem investigation and remediation.
  • DevOps – We build it / We support it.  Participation in regular follow-the-sun on call rotas to ensure adequate out of hours cover for the services.
  • Participate in the design and engineering of auto-healing solutions. 

Similar Openings for You