We seek an experienced IT professional to join us as a Senior Site Reliability Engineer, working in the Product Reliability Engineering function who will:
- Perform day-to-day site reliability engineering functions including Maintenance and incident resolution for all applications, products, and services - including debit, prepaid and risk lines of business.
- Perform ongoing/Proactive analysis of various authorization, Api and UI based applications to detect potential problems and actively engage & facilitate the discussion to find the best possible solution.
- Work under direct supervision to ensure on-time delivery of projects, and production support plans for upgrades, enhancements, and deployments.
- Work closely with service partners such as product development, engineering teams to seamlessly implement the innovative solutions to improve the reliability, scalability, and efficiency.
- Assist in automating the routine tasks and processes to improve overall efficiency and reduce human errors.
- Actively participate in troubleshooting activities and SWAT calls and drive investigation towards swift resolution.
- Build comprehensive and robust documentation repositories that can facilitate knowledge transfer among L1/L2 peers.
- Assist the team with implementing GenAI and machine learning trends to continuously optimize the application reliability and efficiency.
- Work with observability team to design and implement the modern Visa observability solutions such as Anomaly detection, operations intelligent platform (OIP), Fault Isolation tool (FIT) across all Verifi products.
- Participate in on-call roster to support business including off-hours.
- Self-motivated, and have excellent interpersonal and communication skills.
This is a hybrid position. Expectation of days in office will be confirmed by your Hiring Manager.
- 3-6 years of relevant work experience and a Bachelors degree
Preferred Qualifications:-
- 3 or more years of work experience with a Bachelor s Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD)
- 3 years of experience working with Java or .NET based business critical transaction processing, API or UI based applications.
- Experience with one or more programming languages such as Python, Java, .NET, C#, PowerShell, Bash scripting.
- Experience with writing queries and working with MSSQL and MySQL databases.
- Basic understanding of CI/CD pipelines and tools like Jenkins, chef etc.
- Strong understanding of networking concepts, protocols, and architecture.
- Basic understanding of ITIL concepts & processes such as incident/change/problem management, call triaging, escalation procedures and such.
- Basic understanding of Middleware components such as Kafka, Hazelcast, Qlik etc.
- Basic understanding of container orchestration systems, such as Kubernetes.
- Basic understanding of monitoring, logging and tracing tools such as Splunk, Prometheus, Grafana, riverbed etc., for troubleshooting and performance tuning.
- Basic understanding of AI frameworks and libraries to further enhance the application resiliency and day to day operational tasks.
- Prior experience with building tools to automate production support activities that enable efficiency and productivity of all operations groups.
- Prior experience working in shift model in 15*7 environments.
- Candidate should be comfortable communicating with technical and non-technical peer groups, including Account Management, Client Services, and other technical platform and application support groups.
- Strong work ethic, self-starter, ability to work in fast-paced, team-oriented environment.