Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home

Jobs

Home
>
Jobs in Noida
>
HCLTech
>
Site Reliability Engineer

Site Reliability Engineer

HCLTech

8 years

0 Lacs

Noida Uttar Pradesh India

Posted:7 hours ago| Platform:

Apply

Skills Required

reliability engineering development automate leadership vision strategy drive scalability training automation collaboration architecture network design monitoring analysis scripting code provisioning management onboarding devops tooling support testing planning analyze metrics containerization kubernetes openshift gcp azure aws programming certifications datadog

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title: Site Reliability Engineer (SRE) - LEAD

Department:

Job Summary:

Site Reliability Engineer (SRE)

Key Responsibilities:

Strategic Leadership & Governance

Define and evolve the SRE CoE vision, strategy, and roadmap.
Establish enterprise-wide SRE standards, frameworks, and maturity models.
Drive adoption of SRE principles across product and platform teams.

Enablement

Act as a subject matter expert and advisor to engineering teams on reliability, scalability, and performance.
Conduct workshops, training sessions, and knowledge-sharing forums.
Promote a culture of observability, automation, and continuous improvement.

Collaboration & Mentorship

Partner with engineering, product, and operations leaders to align reliability goals with business outcomes.
Mentor SREs and engineers across teams, fostering a community of practice.
Lead cross-functional reliability reviews and architecture assessments.
Collaborate with development, operations, and network teams.
Align infrastructure reliability with application SLOs/SLIs.
Advocate for best practices in system architecture and operations.

Infrastructure & Reliability

Design, implement, and maintain scalable, reliable infrastructure.
Ensure high availability and disaster recovery strategies.
Improve reliability for legacy and hybrid (cloud/on-prem) systems.

Monitoring & Incident Management

Develop and maintain monitoring, alerting, and incident response systems.
Conduct root cause analysis and post-mortems.
Participate in on-call rotations and respond to production issues.

Automation & Efficiency

Automate repetitive tasks using scripting and tooling.
Lead Infrastructure-as-Code (IaC) and automation for provisioning and scaling.
Create sustainable systems through automation and continuous improvement.
Evaluate and recommend tools for monitoring, alerting, incident management, and chaos engineering.
Build reusable automation frameworks and templates for onboarding teams to SRE practices.
Collaborate with DevOps and platform teams to integrate reliability tooling into CI/CD pipeline
Support rigorous testing and release procedures.

Performance & Capacity

Lead capacity planning, system upgrades, and OS patching.
Gather and analyze system/application metrics for performance tuning.

Containerization & Cloud

Support Kubernetes and container platforms in hybrid environments.
Work with OpenShift, GCP, Azure and AWS for cloud-integrated services.

Required Qualifications:

Bachelor’s degree in computer science, Engineering, or a related field (or equivalent experience).
8+ years of experience in SRE.
Proficiency in at least one programming/scripting language (e.g., Python).
Experience with cloud platforms (AWS, GCP, Azure).

Preferred Qualifications:

Experience in setting up or leading a CoE or similar strategic function.
Certifications in cloud, DevOps, or SRE-related domains.
Experience with chaos engineering and resilience testing.
Experience with observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
Experience with incident management and SLO/SLI/SLA frameworks.

More Jobs at HCLTech

Golang Developer

Pune, Bengaluru, Noida

6 - 11 yrs

INR 16 - 31 Lacs

Aws Cloud Engineer

Chennai, Bengaluru, Hyderabad

5 - 10 yrs

INR 15 - 30 Lacs

React.js Full stack Developer

Bengaluru

1 - 3 yrs

INR 5 - 11 Lacs

Mega Walk-In Drive For Mortgage Underwriter

Bengaluru

2 - 7 yrs

INR 3 - 8 Lacs

HCL Tech Hiring For Tosca Automation Testing

Chennai

7 - 12 yrs

INR 5 - 15 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.