Senior Site Reliability Development Engineer

8 - 13 years

16.0 - 30.0 Lacs P.A.

Bengaluru

Posted:2 weeks ago| Platform: Naukri logo

Apply Now

Skills Required

GolangJavaGCPPythonKubernetesDockerGithubMySQL

Work Mode

Work from Office

Job Type

Full Time

Job Description

Location: Bengaluru Experience: 8+ Years Our stack includes the following key technologies (most are open source and related to the Cloud Native Computing Foundation CNCF): Python, Java/Kotlin, Golang Kubernetes, Docker, Helm, Crossplane Github Actions, ArgoCD Cassandra, Postgres, MySQL, Redis Google Cloud Platform (GCP) Job Title: Site Reliability Development Engineer We are seeking a skilled and hands-on Site Reliability Development Engineer to join our team. You will play a key role in building and maintaining highly reliable systems, ensuring seamless customer experiences through robust and efficient solutions. Key Responsibilities: 1. System Reliability and Performance: Assist in designing and implementing solutions for system reliability and performance. Support the adoption of technologies and methodologies to enhance system resilience and scalability. Contribute to projects, ensuring alignment with organizational goals. 2. Hands-On Engineering and Innovation: Engage in hands-on coding, contributing to critical system components, observability as code, and automation tools. This will comprise roughly 75% of your time. Experiment with new approaches to improve system reliability, performance, and efficiency. Stay updated with SRE practices, integrating new ideas and technologies into our systems. 3. Team Collaboration and Development: Collaborate with team members, fostering a culture of excellence and continuous learning. Share expertise and insights to elevate the technical capabilities of the team. Demonstrate best practices in coding, system design, and operational excellence. 4. Incident Management and Prevention: Participate in incident management processes, ensuring rapid and effective resolution. Analyze system performance and incidents to identify trends and areas for improvement. Assist in developing and implementing strategies to prevent system failures and enhance overall reliability. Promote a blameless culture of continuous learning and growth. What makes you successful: Experience as a Software Development Engineer, Site Reliability Engineer, or in a similar role. Familiarity with SRE tools, technologies, and practices including Kubernetes and GitOps-driven pipelines. Strong problem-solving skills and the ability to design and implement systems. Proficiency in programming and automation with a hands-on approach. Good communication and collaboration skills, with the ability to work effectively in a team. Preferred bonus skills: GitHub Actions, Helm, Argo CD, Crossplane, Datadog, Google Cloud Platform, Go, Python, Functional Programming, Test Driven Design, Observability Driven Design, Pair Programming.

RecommendedJobs for You