SRE production support(Full time with MNC)

7 - 12 years

15 - 27 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Summary:

We are seeking a highly skilled and motivated Lead Infra - SRE to join our Infrastructure Management team. The ideal candidate will possess a strong foundation in Site Reliability Engineering (SRE) principles and practices, with a focus on ensuring the reliability, availability, and performance of our infrastructure. This role requires a proactive approach to problem-solving and a commitment to continuous improvement in our systems and processes.

Responsibilities:

  • Design, implement, and maintain scalable and reliable infrastructure solutions that meet the needs of our applications and services.
  • Monitor system performance and reliability, identifying and resolving issues proactively to minimize downtime.
  • Collaborate with development teams to integrate SRE practices into the software development lifecycle, ensuring that reliability is a key consideration from the outset.
  • Develop and maintain automation tools and scripts to streamline operations and improve efficiency.
  • Implement and manage incident response processes, ensuring timely resolution of incidents and effective communication with stakeholders.
  • Conduct post-incident reviews to identify root causes and implement preventive measures.
  • Stay current with industry trends and best practices in SRE and infrastructure management, recommending improvements to our processes and technologies.
  • Mentor and guide junior team members, fostering a culture of learning and collaboration within the team.

Mandatory Skills:

  • Strong knowledge of Site Reliability Engineering (SRE) principles and practices.
  • Proficiency in cloud platforms (e.g., AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes).
  • Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
  • Solid understanding of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Strong scripting skills in languages such as Python, Bash, or Go.
  • Excellent problem-solving skills and the ability to work under pressure.
  • Strong communication and collaboration skills, with the ability to work effectively in a team environment.

Preferred Skills:

  • Experience with configuration management tools (e.g., Ansible, Puppet, Chef).
  • Familiarity with CI/CD pipelines and DevOps practices.
  • Knowledge of security best practices in infrastructure management.
  • Experience with database management and optimization.
  • Understanding of networking concepts and protocols.

Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Relevant certifications in SRE, cloud technologies, or infrastructure management are a plus.

If you are passionate about infrastructure management and have a strong SRE background, we encourage you to apply and join our dynamic team!

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Tekgence logo
Tekgence

Information Technology and Services

Tech City

RecommendedJobs for You