7 - 12 years
32 - 45 Lacs
                                Posted:3 days ago|
                                Platform:
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                                
                            
Hybrid
Full Time
 As a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness,  
 performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our   cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with   development teams to architect, deploy, and optimize systems that are not only reliable but also resilient and  secure.    Handle SRE operational duties including responding to pull requests and ensuring smooth continuous  integration and delivery processes.  Maintain and fine-tune applications for optimal performance, ensuring they meet specified  requirements.   Explore and experiment with new technologies through Proof-of-concepts to enhance existing  functionalities or discover new opportunities.   Automate deployment, configuration, and operational processes to improve efficiency and  accuracy.   Collaborate with development teams to guide system architecture and design, focusing on reliability,  efficiency, and scalability.   Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all  critical services are monitored effectively.    Develop custom reliability tools and frameworks for use by engineering teams.   Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough   post-mortem analyses.   Drive system and process efficiencies including capacity planning, configuration management,   performance tuning, monitoring, and root cause analysis.   Act as a consultant within the organization for best practices in infrastructure management and assist   teams in effective infrastructure utilization.    Experience with state machines such as AWS Step Functions or Azure Logic Apps.   Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or   DynaTrace is highly desirable.    Proficiency in Kubernetes with CKA/CKAD certification being advantageous.   Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments.   Good programming skills in high-level languages, with a preference for Python. Go, or any other   compiled languages is an advantage.  familiarity with Observability tools like Grafana, Prometheus, and New Relic.   Strong project management and organizational skills.   An open mindset with the ability to quickly adapt to new technologies and learning practices.   About Cloud Native Engineering   The Cloud Native Engineering Practice is an organization of engineers who work with our production services   throughout their entire life cycle, from design and architecture, through implementation, deployment, and   sustaining operation.SREs delivers important system properties: reliability, performance, efficiency, and   scalability, for the products and platforms that our customers use every day.   SREs work in high-performance squads with expertise on large scale system reliability and in-depth   understanding of critical business components architecture, as well as dedicated engineering teams building   comprehensive tools, platform and infrastructure.     
 
    
 
                Xebia It Architects
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
 
        Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
 
            
         
                        Practice Golang coding challenges to boost your skills
Start Practicing Golang Now 
    hyderabad, gurugram, bengaluru
32.5 - 45.0 Lacs P.A.
ludhiana, chandigarh, pune
14.0 - 18.0 Lacs P.A.
hyderabad, chennai, bengaluru
Experience: Not specified
3.75 - 7.5 Lacs P.A.
ahmedabad
10.8 - 15.0 Lacs P.A.
0.5 - 0.6 Lacs P.A.
noida
20.0 - 30.0 Lacs P.A.
hyderabad, chennai, bengaluru
2.0 - 6.0 Lacs P.A.
pune, mysuru, bengaluru
12.0 - 18.0 Lacs P.A.
0.5 - 0.6 Lacs P.A.
8.0 - 12.0 Lacs P.A.