Site Reliability Engineering Manager, Cloud Efficiency and Performance

4 - 8 years

0 Lacs

Posted:2 weeks ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Site Reliability Engineering Manager at ThousandEyes, you will lead the Efficiency and Performance SRE team based in Bangalore. Your responsibilities will include developing and maintaining Resource Intelligence initiatives such as cloud cost reporting, management, efficiency indicators, and vendor engagement. Additionally, you will work on improving resource- and performance-engineering tooling, protocols, compliance automation, and participate in the global follow-the-sun SRE on-call rotation. Your expertise in areas like AWS IAM, cost and resource tooling, capacity planning, and Infrastructure as Code using Terraform will be crucial for this role. Leading and inspiring a talented SRE team will be a key part of your role, where you will foster a culture of innovation, collaboration, and excellence. You will be responsible for driving the strategic vision for managing cloud-based infrastructure resourcing and reporting systems. Clear communication with leadership, stakeholders, vendors, infrastructure, product, and production engineering teams will be essential to ensure transparency and accountability of resource utilization. Staying current with industry best practices, tooling, and automation will be important to enhance the platform and systems. You will drive operational excellence in operations and security processes while mentoring and developing engineering talent within the Efficiency and Performance SRE team. Your role will also involve collaborating with multiple teams to execute shared goals and formulate technical strategies and roadmaps. To qualify for this role, you should have experience leading a team of 4+ engineers and a total of 4+ years of experience in building and supporting mission-critical services with a focus on automation, availability, and performance. Experience in public cloud environments, incident response processes, and technical leadership is required. You should be able to provide a strong technical vision for your team, balance tactical needs with strategic growth, and have worked on large-scale distributed systems with multi-tiered architecture.,

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Cisco logo
Cisco

Software Development

San Jose CA