Operation Engineer (SRE)

5 - 7 years

16 - 25 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

  • Strong Kubernetes expertise, including cluster management, scaling, Helm, or K8s API experience.
  • Advanced knowledge of cloud-native tools (e.g., Prometheus, Istio, Terraform) and the ability to design and implement secure, scalable, and reliable infrastructure.
  • Proven contributions to automation, scalability, security best practices (IAM, network policies, secret management, container security, vulnerability management), and disaster recovery.
  • Experience designing resilient systems with redundancy and infrastructure optimization.
  • Strategic problem-solving skills, with a demonstrated application of SRE principles. This should be evident in their described responsibilities and impact in previous roles.
  • They need to have knowledge of how networking works and how you can monitor systems and also have some programming experience.
  • Proficiency in Linux Administration: Extensive experience with Debian systems.
  • Kubernetes Expertise: Skilled in Kubernetes orchestration, preferably with Kops experience.
  • Docker and AWS Proficiency: Solid background in utilizing Docker in AWS environments.
  • Networking Acumen: Strong understanding and experience in network technologies and protocols.
  • Alerting and Monitoring Skills: Competence in developing and managing monitoring and alerting systems.

Essential Competencies:

* Collaborative Problem-Solving: Actively engage with team members to diagnose and resolve technical issues.

* Adaptive Learning: Demonstrate the ability to rapidly acquire knowledge and skills in unfamiliar areas to address emerging challenges.

* Proactive System Enhancement: Independently identify and present improvements for our roadmap that will optimize existing technological infrastructures.

* Proficiency in Linux Administration: Extensive experience with Debian systems.

* Kubernetes Expertise: Skilled in Kubernetes orchestration, preferably with Kops experience.

* Docker and AWS Proficiency: Solid background in utilizing Docker in AWS environments.

* Networking Acumen: Strong understanding and experience in network technologies and protocols.* Alerting and Monitoring Skills: Competence in developing and managing monitoring and alerting systems.

* Programming Skills: Proficiency in one or more relevant programming languages such as Python or Go.

Desirable Additional Skills:

* Experience with Unified Logging: Familiarity with Graylog or similar platforms for log management.

* Distributed Systems Knowledge: Understanding of the complexities and management of distributed computing environments.

* Vulnerability Management: Experience in vulnerability scanning and monitoring.

* Prometheus for System Monitoring: Proficiency in using Prometheus for monitoring system performance.

* CI/CD Expertise: Experience with Continuous Integration and Continuous Deployment systems, such as CircleCI and ArgoCD.

* GitOps and Infrastructure as Code (IaC): Familiarity with GitOps principles and IaC practices.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You