Devops/SRE Engineer - Python/Terraform

7 - 12 years

32 - 45 Lacs

Posted:3 days ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

As a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness,
performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with development teams to architect, deploy, and optimize systems that are not only reliable but also resilient and secure. Handle SRE operational duties including responding to pull requests and ensuring smooth continuous integration and delivery processes. Maintain and fine-tune applications for optimal performance, ensuring they meet specified requirements. Explore and experiment with new technologies through Proof-of-concepts to enhance existing functionalities or discover new opportunities. Automate deployment, configuration, and operational processes to improve efficiency and accuracy. Collaborate with development teams to guide system architecture and design, focusing on reliability, efficiency, and scalability. Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all critical services are monitored effectively. Develop custom reliability tools and frameworks for use by engineering teams. Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough post-mortem analyses. Drive system and process efficiencies including capacity planning, configuration management, performance tuning, monitoring, and root cause analysis. Act as a consultant within the organization for best practices in infrastructure management and assist teams in effective infrastructure utilization. Experience with state machines such as AWS Step Functions or Azure Logic Apps. Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or DynaTrace is highly desirable. Proficiency in Kubernetes with CKA/CKAD certification being advantageous. Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments. Good programming skills in high-level languages, with a preference for Python. Go, or any other compiled languages is an advantage. familiarity with Observability tools like Grafana, Prometheus, and New Relic. Strong project management and organizational skills. An open mindset with the ability to quickly adapt to new technologies and learning practices. About Cloud Native Engineering The Cloud Native Engineering Practice is an organization of engineers who work with our production services throughout their entire life cycle, from design and architecture, through implementation, deployment, and sustaining operation.SREs delivers important system properties: reliability, performance, efficiency, and scalability, for the products and platforms that our customers use every day. SREs work in high-performance squads with expertise on large scale system reliability and in-depth understanding of critical business components architecture, as well as dedicated engineering teams building comprehensive tools, platform and infrastructure.


Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Golang Skills

Practice Golang coding challenges to boost your skills

Start Practicing Golang Now
Xebia It Architects logo
Xebia It Architects

IT Services and IT Consulting

Atlanta Georgia

RecommendedJobs for You

hyderabad, chennai, bengaluru

hyderabad, chennai, bengaluru