So, what s the role all about
We are building a critical function focused on proactively monitoring data processing pipelines for each of our customers, with the goal of identifying and addressing any data gaps before they impact the customer experience.
As a key member of this team, you will operate in a fast-paced, high-impact environment, responsible for monitoring, automating, and supporting a mission-critical data ingestion module. Your work will directly contribute to ensuring data reliability, operational excellence, and customer trust.
How will you make an impact
You will team up with highly talented engineers and architects, working on cutting-edge technology in a fast-paced environment. Your responsibilities will include:
- Daily monitoring of systems to ensure customer data is processed correctly.
- Ensuring timely data processing (T+1 or T+5 for weekly arriving data).
- Reviewing data ingestion and alert generation using automated scripts.
- Executing manual monitoring tasks for new use cases until automation is implemented.
- Identifying data gaps and raising cases to appropriate internal teams.
- Performing trend analysis and communicating with stakeholders.
- Automating tasks using Python, Perl, and PowerShell scripting.
- Managing change processes and deployment plans.
- Creating and maintaining a technical knowledge base.
- Providing hands-on support for AWS/Azure environments.
- Collaborating with Support, R&D, and Operations teams to meet customer-specific data processing needs.
- Solving complex customer data problems across multiple critical applications.
- Building internal tools and utilities to optimize operations.
- Driving customer communication during critical events.
- Providing on-call support and working in a 24 7 shift environment.
Have you got what it takes
- 3 4 years of relevant experience.
- Strong hands-on experience in managing Application Support (2-tier/3-tier apps).
- Excellent problem-solving, analytical, and communication skills.
- Experience handling complex application performance issues.
- Proficiency in managing containerized/cloud-based applications (AWS services like EC2, S3, VPN).
- Experience in Monitoring/Operations or Infrastructure Operations teams.
- Strong troubleshooting skills.
- OS-level knowledge (Windows or Linux).
- Database skills (SQL, Oracle, Postgres, Cassandra).
- Middleware experience (Tomcat, WebLogic, WebSphere).
- Ability to identify root causes of performance issues and mitigate bottlenecks.
- Exposure to scripting languages (Ansible, Perl, Python, Ruby, Shell, PowerShell).
- Familiarity with tools like OpsGenie, Nagios, Rundeck.
- Experience with Kubernetes and cloud/application-level security.
- Background in Banking & Financial domain is a plus.
- Experience working in Agile/Sprint development models.