Job
Description
About The Role :
Job Title:
Senior Engineer - SRE, AVP
Location:
Pune, India
Corporate TitleAVP
Role Description
Site reliability engineers create a bridge between development and operations by applying a software engineering mindset to system administration topics.As an SRE at Deutsche Bank, you will play a pivotal role in ensuring the reliability, scalability, and performance of our systems.You will collaborate closely with feature and cross-functional teams to design, build, and maintain robust and efficient systems, applying cutting-edge technologies and best practices.
What well offer you
100% reimbursement under childcare assistance benefit (gender neutral)Sponsorship for Industry relevant certifications and educationAccident and Term life Insurance
Your key responsibilities
Proven experience leading and scaling Production/SRE teams in a high-growth environment.Maintain services once they are live by measuring and monitoring availability, latency, and the overall system health.Identify, design, develop, deploy tools and processes to monitor, maintain, and report site performance and availability.Streamlining repetitive tasks for automation using Ansible, Shell Script, and Java; monitoring server health using Python and Shell-script; implementing Business Continuity/Disaster Recovery plans for end-to-end application support processes.Conducting build and configuration using release management tools, including BitBucket and Teamcity; utilizing release management and incident tracking tools, including ServiceNow to track incidents and work items and their progress.Leveraging SQL Server and Oracle databases, Linux OS, Java, and OpenShift to perform analysis of issues and resolve incidents; and setting up and maintaining monitoring of Non-Functional Requirements (NFRs) to monitor overall quality, availability, response time, security and reliability of applications using Geneos, Prometheus, and Grafana.Develops routines to deploy CIs to the target environments.Provides Release Deployments on non-Production Management controlled environments.Capture Build and Deployment notes, develop Software Product Deployment & Operating Instructions.Provide Level 3 support for technical infrastructure components (e.g. databases, middleware and user interfaces).Perform problem and root cause analysis for application production incidents and delivers the necessary resolution pack (i.e. hotfixes, patches).Provide L3 Support and remediation on any issues pertaining to the above applications by providing detailed code analysis of applications production platform. Remediate incidents and outages pertaining to the platform.Conduct regularly scheduled Problem Management meetings with IT Product Managers (ITPMs), infrastructure groups, problem managers and incident managers to track progress and highlight issues.
Your skills and experience
E xperience Required - 9 to12 YearsHand-on Experience in UNIX, scripting (Shell, Perl)Hand-on Experience in various communication Protocols (AS2, HTTPS, File Transfer Protocol Secured(FTPS), RFCs, SNC, MQ etc.)Hand-on Experience with Webserver (Apache) implementation and configurationHand-on Experience with Application server (WebLogic) implementation and configurationHands on experience with OpenShift Fabric, tomcat, Wildfly configurationHands on experience with Geneos, Control M, Airflow, GCP landing zone configurationHands on experience with TeamCity, Jenkin, udeploy, CI-CD pipeline setupHand-on Experience in Oracle PL SQLGood understanding on Core JavaHand-on Knowledge on handling Industry standard financial transaction related file formatsHand-on Knowledge on various compression, encryption techniques like SSL etc., and Secured Shell (SSH) authenticationExcellent communication and influencing skills.
Education/Qualifications
Degree from an accredited college or university with a concentration in Engineering or Computer Science
How well support you