Job
Description
Role Overview: As a Lead BizOps Engineer at Mastercard, you will be responsible for driving the customer experience strategy forward by consistently innovating and problem-solving. You will lead a team of engineers managing a high-availability infrastructure environment using AWS Cloud, Kubernetes, Linux, CI/CD, and other open-source technologies. Your role will involve supporting daily operations, analyzing ITSM activities, assessing security vulnerabilities, and providing technical leadership to mentor junior team members. Key Responsibilities: - Lead a team of engineers managing a high-availability infrastructure environment using AWS Cloud, Kubernetes, Linux, CI/CD, and other open-source technologies. - Support daily operations with a focus on triage and root cause analysis, understanding the business impact of products, and practicing sustainable incident response with blameless postmortems. - Take a holistic approach to problem-solving during production events to optimize mean time to recovery. - Provide technical leadership and mentor junior team members on best practices and processes. - Analyse ITSM activities and provide feedback to infrastructure and development teams on operational gaps or resiliency concerns. - Assess the security of existing and proposed systems, recommending and implementing plans to resolve vulnerabilities. - Establish and recommend processes, policies, and standards for system use and services, and innovate on new methodologies to improve operations. - Understand business processes and production applications to troubleshoot issues and recommend short and long-term resolutions. - Coordinate with internal groups to resolve recurrent problems, alerts, and escalated issues, ensuring clear communication. - Exhibit a sense of urgency to resolve issues and ensure SLAs and operational standards are met. - Clear communication skills, both spoken and written, with the ability to read and interpret complex information, talk with customers, and listen well. Qualifications: - BSc/MSc IT/Comp, BCA, MCA, BE, BTech. - 6+ years of Dev, SRE, Automation and/or system administration experience with large-scale cloud-native microservices platforms. - 3+ years of hands-on experience managing and monitoring complex API focused environments preferably in AWS and Kubernetes. - Strong communication and leadership skills to foster team collaboration and problem-solving. - Experience with infrastructure automation and scripting using Python and/or bash scripting. - Experience with Infrastructure-as-Code using Terraform, CloudFormation, Packer, etc. - Strong hands-on experience with monitoring tools such as Splunk, Dynatrace, Prometheus, Grafana, ELK stack, etc., to build observability for large-scale microservices deployments. - Excellent problem-solving, triaging, and debugging skills in large-scale distributed systems. - Experience managing cloud infrastructure and operations in strict security, compliance, and regulatory environments. - Experience with CI/CD frameworks and Pipeline-as-Code such as Jenkins, Spinnaker, Gitlab, Argo, Artifactory, etc. - Proven skills to work effectively across teams and functions to influence the design, operations, and deployment of highly available software. - AWS Solutions Architect certification preferred.,