9 - 13 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As an SRE Engineer, your role will involve designing, implementing, and maintaining scalable and reliable systems. You will be responsible for automating operational processes to reduce manual intervention. Proactively monitoring system performance, ensuring optimal reliability, and availability will be crucial. Collaborating closely with development, operations, and product teams to ensure smooth integration and deployment of new features and services is a key aspect of your role. Additionally, developing and maintaining clear documentation for system configurations, processes, and troubleshooting guidelines will be essential. Participation in on-call rotations and conducting root cause analysis for timely incident resolutions will also be part of your responsibilities. Furthermore, driving initiatives focused on enhancing system reliability, performance, scalability, and efficiency will be expected. Key Responsibilities: - Design, implement, and maintain scalable and reliable systems - Automate operational processes to reduce manual intervention - Proactively monitor system performance for optimal reliability and availability - Collaborate with development, operations, and product teams for smooth integration and deployment - Develop and maintain clear documentation for system configurations and troubleshooting guidelines - Participate in on-call rotations for quick incident response and root cause analysis - Drive initiatives to enhance system reliability, performance, scalability, and efficiency Qualifications: - Proven experience as an SRE or similar role with strong skills in system administration, networking, and software development - Proficiency in scripting languages such as Python, Go, or Bash - In-depth knowledge of AWS, Azure, and GCP cloud platforms - Strong experience with Docker and Kubernetes for containerization - Expertise in using observability tools like Prometheus, Grafana, and ELK stack for system monitoring - Excellent analytical and problem-solving skills for resolving technical challenges - Strong communication and teamwork abilities for cross-functional collaboration - Experience with Terraform, Ansible, or similar tools for managing infrastructure - Familiarity with Jenkins, GitLab CI/CD for automation and deployment - Understanding of security principles, best practices, and compliance standards in a cloud-native environment In this role, your primary skills will include: - Site Reliability Engineering (SRE) - Python - Cloud (AWS, Azure, GCP) - Docker / Kubernetes - Grafana Your secondary skills will include: - Jenkins - GitLab,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

pune, bengaluru, mumbai (all areas)

hyderabad, telangana, india

hyderabad, telangana, india

bengaluru, karnataka, india