Software Development Lead

12 - 17 years

9 - 13 Lacs

Posted:18 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description


 About The Role  

Project Role
Software Development Lead

Project Role Description
Develop and configure software systems either end-to-end or for a specific stage of product lifecycle. Apply knowledge of technologies, applications, methodologies, processes and tools to support a client, project or entity.
Must have skills Site Reliability Engineering

Good to have skills
DevOps, AWS Administration
Minimum 15 year(s) of experience is required

Educational Qualification
BTECH
SummaryAs the manager of the Site Reliability Engineering (SRE) team, you will lead a high-impact team focused on building and scaling automation, observability, and incident response to improve the reliability, stability, and performance of our cloud-based services. You will play a critical role in shaping the reliability strategy for our SaaS platforms, driving innovation in incident response, and ensuring our systems are resilient, performant, and aligned with customer expectations. This role requires a strong technical foundation, a passion for operational excellence, and the ability to lead cross-functional collaboration across engineering and operations teams. Roles and responsibilities: 12+ years of relevant experience in SRE, DevOps, or infrastructure engineering with 4+ years of experience in a technical leadership or management role.People management experience within software development, managing a high-performing team with strong SRE capabilities around observability, reliability, and incident responseAbility to roll out enterprise level programs (e.g. SLOs, incident response, observability standards) across a variety of product and engineering teams.Recruiting, interviewing, and hiring top engineering talent to fill out team needs that are aligned with a broader talent strategyDevelopment of engineers career planning and skills growth. Identify areas for engineers to build more knowledge and create opportunities for them to exercise these new engineering and soft skills in practiceWork closely with leadership located in other geographies on joint efforts to drive Site Reliability Engineering journeyDeep understanding of SRE principles, including SLOs, and SLIsStrong knowledge of cloud platforms (AWS, Azure, OCI) and infrastructure-as-code tools (e.g., Terraform)Expertise implementing and running observability and monitoring tools (e.g., Datadog, Dynatrace, ELK)Lead incident response processes including coordination, root cause analysis (RCA), and long-term mitigation.Experience managing teams in a 24/7 production environment.Proficiency in software development automation (e.g., Python, Go, Shell, etc.)Excellent communication and collaboration skills across technical and non-technical stakeholdersFoster a culture of continuous improvement, blameless postmortems, and proactive monitoringBalance innovation with operational excellenceDrive alignment across engineering, product, and operations on service health and customer impactPractitioner of agile practices and play lead roles such as Scrum Master or Product Owner. Agile role certifications a plus, including value stream mapping practices to identify and eliminate waste in software delivery processesDisplay empathy towards engineers and their friction, work with them to develop common solution. Technical experience & Professional attributes:Lead and mentor a team of Site Reliability Engineers aligned to value streams and agile teams.Define and implement SRE best practices, including incident management, blameless postmortems, and error budgeting.Drive the adoption of observability standards across the enterprise using tools like Datadog and CloudWatch.Collaborate with engineering teams to design scalable, fault-tolerant systems with insightful observabilityPartner with Customer Success and Product teams to map and monitor key user journeys.Manage on-call rotations and ensure effective incident response and root cause analysis.Contribute to the evolution of our Cloud Platform by standardizing monitoring, alerting, and deployment practices.Support training and enablement efforts through internal platforms.Education qualifications:Bachelors degree in computer science, Information Systems, or related field; or equivalent combination of education/experience. Masters degree is a plus.
Additional Information: Be part of the larger Site Reliability and Cloud Engineering organization Be an influential people leader of a new site. This includes working with site leader and senior leadership in coordinating site-level activities and other functions as the site grows. Manage a team of varying seniority and skills around Site Reliability Engineering practicesYou will be working with a Trusted Tax Technology Leader, committed to delivering reliable and innovative solutions Qualification BTECH

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Accenture logo
Accenture

Professional Services

Dublin

RecommendedJobs for You