Senior Engineer - SRE
db
Job Description
Role Description
- Site reliability engineers create a bridge between development and operations by applying a software engineering mindset to system administration topics.
- As an SRE at Deutsche Bank, you will play a pivotal role in ensuring the reliability, scalability, and performance of our systems.
- You will collaborate closely with feature and cross-functional teams to design, build, and maintain robust and efficient systems, applying cutting-edge technologies and best practices.
What we’ll offer you
As part of our flexible scheme, here are just some of the benefits that you’ll enjoy
- Best in class leave policy
- Gender neutral parental leaves
- 100% reimbursement under childcare assistance benefit (gender neutral)
- Sponsorship for Industry relevant certifications and education
- Employee Assistance Program for you and your family members
- Comprehensive Hospitalization Insurance for you and your dependents
- Accident and Term life Insurance
- Complementary Health screening for 35 yrs. and above
Your key responsibilities
- Proven experience leading and scaling Production/SRE teams in a high-growth environment.
- Maintain services once they are live by measuring and monitoring availability, latency, and the overall system health.
- Identify, design, develop, deploy tools and processes to monitor, maintain, and report site performance and availability.
- Streamlining repetitive tasks for automation using Ansible, Shell Script, and Java; monitoring server health using Python and Shell-script; implementing Business Continuity/Disaster Recovery plans for end-to-end application support processes.
- Conducting build and configuration using release management tools, including BitBucket and Teamcity; utilizing release management and incident tracking tools, including ServiceNow to track incidents and work items and their progress.
- Leveraging SQL Server and Oracle databases, Linux OS, Java, and OpenShift to perform analysis of issues and resolve incidents; and setting up and maintaining monitoring of Non-Functional Requirements (NFRs) to monitor overall quality, availability, response time, security and reliability of applications using Geneos, Prometheus, and Grafana.
- Develops routines to deploy CIs to the target environments.
- Provides Release Deployments on non-Production Management controlled environments.
- Capture Build and Deployment notes, develop Software Product Deployment & Operating Instructions.
- Provide Level 3 support for technical infrastructure components (e.g. databases, middleware and user interfaces).
- Perform problem and root cause analysis for application production incidents and delivers the necessary resolution pack (i.e. hotfixes, patches).
- Provide L3 Support and remediation on any issues pertaining to the above applications by providing detailed code analysis of applications’ production platform. Remediate incidents and outages pertaining to the platform.
- Conduct regularly scheduled Problem Management meetings with IT Product Managers (ITPMs), infrastructure groups, problem managers and incident managers to track progress and highlight issues.
Your skills and experience
- Experience Required - 9 to12 Years
- Hand-on Experience in UNIX, scripting (Shell, Perl)
- Hand-on Experience in various communication Protocols (AS2, HTTPS, File Transfer Protocol Secured(FTPS), RFCs, SNC, MQ etc.)
- Hand-on Experience with Webserver (Apache) implementation and configuration
- Hand-on Experience with Application server (WebLogic) implementation and configuration
- Hands on experience with OpenShift Fabric, tomcat, Wildfly configuration
- Hands on experience with Geneos, Control M, Airflow, GCP landing zone configuration
- Hands on experience with TeamCity, Jenkin, udeploy, CI-CD pipeline setup
- Hand-on Experience in Oracle PL SQL
- Good understanding on Core Java
- Hand-on Knowledge on handling Industry standard financial transaction related file formats
- Hand-on Knowledge on various compression, encryption techniques like SSL etc., and Secured Shell (SSH) authentication
- Excellent communication and influencing skills.