Posted:1 week ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

You are an experienced and proactive DevOps Lead with over 5 years of hands-on experience in DevOps, Site Reliability Engineering, or Systems Administration. In this role, you will spearhead infrastructure and DevOps initiatives, guiding a team of DevOps engineers, architecting scalable infrastructure, and driving continuous improvement in CI/CD, monitoring, and security practices. Your strategic involvement will align infrastructure reliability with business needs and foster collaboration between development and IT operations teams. Your responsibilities include leadership and strategy, where you will lead and mentor the DevOps team, define and implement best practices, and collaborate with engineering and product teams for scalable solutions. You will take ownership of system architecture, providing guidance on infrastructure design and performance optimization. In terms of infrastructure and system management, you will oversee provisioning, configuration, and maintenance of software, hardware, and networks across cloud and on-prem environments. You will manage cross-platform environments including Linux, Windows, and macOS, driving improvements in infrastructure automation and development environments. Additionally, you will oversee the configuration and performance of web servers (e.g., Apache, Nginx) and databases (e.g., MySQL, MongoDB). For DevOps and CI/CD, you will architect and manage robust CI/CD pipelines using tools like Jenkins, GitHub Actions, or similar. You will lead initiatives around infrastructure as code, containerization (Docker), and hybrid/multi-cloud deployments (AWS preferred), while establishing standards for version control, deployment workflows, and release management. Your role also involves guiding teams in adopting scalable, repeatable deployment strategies. In monitoring, reliability, and performance, you will set up and manage monitoring, alerting, and observability systems (e.g., Uptime Kuma, Prometheus, Grafana). You will lead incident response, root cause analysis, and efforts to improve overall system reliability. Furthermore, you will define and track key metrics for uptime, system health, and performance. Regarding security and compliance, you will own and enforce infrastructure security policies, oversee backup strategies, disaster recovery planning, and compliance initiatives, and ensure secure DevOps practices in collaboration with other teams. Preferred qualifications and skills for this role include 5+ years of relevant experience, proven leadership or mentoring experience, deep understanding of Linux-based systems and cloud platforms (especially AWS), strong expertise in CI/CD pipelines, Git workflows, and infrastructure as code, proficiency in managing web servers and backend stacks (PHP, Node.js, etc.), hands-on experience with containerization and orchestration tools (Docker, Kubernetes is a plus), experience with monitoring and logging frameworks, excellent problem-solving skills, a collaborative mindset, and awareness of emerging DevOps trends and technologies with the ability to recommend and implement forward-looking solutions.,

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

Hyderabad, Telangana, India

Bengaluru, Karnataka

hyderabad, telangana