Team Lead - Operations

5 - 7 years

10 - 13 Lacs

Posted:None| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Roles and Responsibilities:

Operations Team Lead

Key Responsibilities:

  • Lead the operations team in managing and maintaining production and staging servers, ensuring optimal uptime, performance, and security.
  • Monitor system health, logs, and metrics using tools like

    Nagios, Prometheus, Grafana, or ELK stack

    , and proactively identify and resolve issues before they impact services.
  • Drive root cause analysis and resolution for incidents and service disruptions, including performance bottlenecks, software bugs, and configuration issues.
  • Ensure secure configuration and operation of infrastructure by enforcing best practices in

    firewalling, access control, patch management, and vulnerability mitigation

    .
  • Implement and refine standard operating procedures (SOPs) for system maintenance, monitoring, deployments, and escalation management.
  • Serve as the escalation point for technical issues, customer concerns, and production incidents, coordinating timely resolution and communication.
  • Collaborate with internal engineering, QA, and support teams to streamline operations, improve system resilience, and enhance customer satisfaction.
  • Plan and manage regular system updates, security audits, and failover testing to ensure business continuity.
  • Provide mentorship, training, and performance oversight for team members, promoting a culture of accountability and continuous improvement.
  • Maintain documentation for infrastructure architecture, operational processes, and incident reports.

Desired Candidate Profile:

  • Bachelors degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience in IT operations, system administration, or DevOps, including team leadership or supervisory roles.
  • Strong hands-on experience with

    Linux/Unix server environments

    , networking concepts, and infrastructure tools.
  • Proficient in

    troubleshooting, performance tuning, log analysis

    , and system debugging.
  • Knowledge of monitoring and alerting systems, automation/scripting (e.g., Bash, Python), and infrastructure-as-code tools is a plus.
  • Familiarity with customer support practices, incident communication, and service-level management.
  • Experience working in environments with high availability, disaster recovery, and security compliance requirements.
  • Strong interpersonal and communication skills, with the ability to manage stakeholders, lead teams, and handle customer-facing situations with professionalism.
  • Certifications such as

    RHCE, AWS SysOps, or CompTIA Security+

    are advantageous.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

kalyani nagar, pune, maharashtra

Hyderabad, Telangana, India