Senior Site Reliability Engineer

7 - 9 years

10 - 12 Lacs

Posted:2 months ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? I f yes, come join our team and develop your career. The Senior Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, make recommendations to improve service outcomes, design solutions in partnership with product, engineering, and architecture teams, build, test, operationalize tools and applications to improve customer experience and reduce costs. Additionally, the Senior Site Reliability Engineer will provide oversight and coaching to engineers and be an escalation for our global command center engineers. About the Role: In this opportunity as Senior Site Reliability Engineer , you will be responsible for: Operational Excellence : Drive the implementation of best practices for reliability, scalability, and performance across our systems and services. Establish and monitor key metrics to ensure uptime, availability, and response times meet or exceed SLAs. Leading the work to drive efficiencies and reduce service operations risks. Lead the research of new capabilities, testing new solutions, recommending and implementing new technologies to improve customer experience and reduce costs. System Architecture: Collaborate with cross-functional teams to design, build, and maintain scalable and resilient architectures for our cloud-based infrastructure and applications. Identify opportunities for optimization and efficiency improvements. Solve intractable problems and devising solutions to improve the products and services we offer our customers. DevOps Practices : Promote and implement DevOps principles and practices to streamline software delivery, automate infrastructure provisioning, and improve deployment processes. Collaborate with development teams to integrate SRE practices into the software development lifecycle. Automation and Tooling : Champion the use of automation and tooling to streamline operational workflows, increase efficiency, and reduce manual toil. Drive the development of monitoring, alerting, and automation solutions to proactively identify and remediate issues. Continuous Improvement: Promote a culture of continuous improvement by fostering innovation, experimentation, and learning within the team. Encourage knowledge sharing and professional development to enhance technical skills and expertise. About You: Youre a fit for the role of Senior Site Reliability Engineer if: Minimum 7+ years of experience with cloud technologies, services, use of their APIs, and configuration tools. (e.g., AWS, Azure, GCP). Strong problem-solving and analytical skills, with a proactive approach to identifying and resolving complex technical issues. You are proficient in DevOps practices and methodologies, with hands-on experience in CI/CD pipelines, configuration management, and infrastructure as code. You use AI/ML tools to help improve service, reduce costs, and worked with AI-Operations solutions. You are familiar with programming languages such as Python, Java, C#. You have designed and supported scalable systems and services. You are able to demonstrate ownership of accountabilities. You are proficient with Networking, Widows, Linux, Container, PostgreSQL, or related infrastructure services at scale. You can automate tasks to improve service operations and support. You use configuration management tools to manage configuration at scale. You apply the scientific method to system components to identify improvements. You are proficient in Observability tools such as Data Dog or New Relic. You are proficient in data analysis from sources such as SQL, S3, Athena, etc.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Thomson Reuters logo
Thomson Reuters

Information Services

Toronto

RecommendedJobs for You