Lead Site Reliability Engineer

10 - 14 years

35 - 40 Lacs

Posted:3 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Lead Site Reliability Engineer
==
Lead Site Reliability Engineer (Storage) Pune, India Our Purpose We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team one that makes better decisions, drives innovation and delivers better business results. Role Summary We re seeking a Lead Site Reliability Engineer to advance our SRE capabilities across enterprise storage platforms, with a focus on Software Defined Storage (Ceph). This role involves managing and leading software defined CEPH storage (Object, Block, File) efforts, building automation and monitoring solutions, improving infrastructure availability, collaborating across global teams Key Responsibilities Lead day-to-day operations of Mastercard s enterprise storage platforms. Represent the storage team in project meetings, offering technical support and guidance. Collaborate with internal teams to understand monitoring and automation needs. Design and implement storage solutions using tools like Ansible, Bitbucket, CHEF, Jenkins. Administer and maintain enterprise monitoring tools in a multi-tier storage environment. Troubleshoot issues across networking, Linux/Unix systems, and applications. Maintain documentation for all solutions and processes. Lead and mentor team of engineers and drive cross-training efforts. Participate in disaster recovery planning and yearly audits Continuously learn and integrate emerging technologies. Lead vulnerability management, patching and compliance efforts About You Proven experience resolving complex availability issues through automation and monitoring. Self-starter with minimal need for supervision. Comfortable working with geographically distributed teams. Strong expertise in UNIX/Red Hat Linux, Ceph storage, and networking/security. Proficient in scripting languages like Python and Bash. Hands-on experience with Grafana, Prometheus, HAProxy, and Pacemaker. Ability to identify and automate repetitive tasks. Strong analytical and problem-solving skills. Excellent communication, documentation, and time management abilities. Experience leading workstreams and mentoring technical talent. Familiarity with ITSM processes, incident/change management, and vendor coordination. Willingness to provide 3rd-line out-of-hours operational support. Corporate Security Responsibility Abide by Mastercard s security policies and practices. Ensure the confidentiality and integrity of the information being accessed. Report any suspected information security violation or breach, and Complete all periodic mandatory security trainings in accordance with Mastercard s guidelines.




Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Mastercard logo
Mastercard

IT Services and IT Consulting

Purchase NY

RecommendedJobs for You