Posted:2 days ago|
Platform:
Hybrid
Full Time
Lead and mentor a team of SRE engineers, fostering a reliability, efficiency, and continuous improvement culture.
Develop and execute SRE strategies to enhance our systems and services' reliability, availability, and performance.
Designed and implemented observability and monitoring solutions using tools like New Relic, Azure Application Insights, AWS X-Ray, and other relevant technologies.
Establish and maintain alerting systems to proactively identify and address potential issues before they impact our customers.
Collaborate with cross-functional teams, including development, operations, and security, to build and maintain resilient and scalable systems.
Drive initiatives for automating operational processes, reducing manual interventions, and enhancing system performance.
Provide leadership in incident response, ensuring swift resolution of issues and effective post-incident reviews to prevent recurrence.
Stay current with industry trends and advancements in site reliability engineering, applying best practices to improve our operations continually.
Promote a data-driven approach to decision-making, leveraging observability data to identify opportunities for optimization and innovation.
Qualifications:5-10 years of experience in site reliability engineering, infrastructure management, or a related field.
Proven experience in leading and mentoring engineering teams, with a focus on reliability and performance.
Strong strategic thinking skills, with the ability to develop and execute SRE strategies aligned with business goals.
Expertise in observability and monitoring tools such as New Relic, Azure Application Insights, AWS X-Ray, or similar platforms.
Experience with cloud platforms, particularly AWS and Azure, and a strong understanding of their services and capabilities.
Hands-on experience with alerting systems, incident management, and performance optimization.
Strong scripting and automation skills (e.g., Python, Bash, PowerShell) to automate and streamline operations.
Excellent problem-solving skills with a proactive and analytical approach to identifying and resolving issues.
Effective communication skills, with the ability to convey technical concepts to both technical and non-technical audiences.
Relevant certifications (such as AWS Certified Solutions Architect, Azure Administrator Associate, or similar) are a plus but not required.
Apex One
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now7.0 - 17.0 Lacs P.A.
40.0 - 45.0 Lacs P.A.
bengaluru
12.0 - 17.0 Lacs P.A.
hyderabad, bengaluru
25.0 - 30.0 Lacs P.A.
5.0 - 9.0 Lacs P.A.
pune, chennai, bengaluru
9.0 - 14.0 Lacs P.A.
bengaluru
6.0 - 9.0 Lacs P.A.
10.0 - 20.0 Lacs P.A.
13.0 - 17.0 Lacs P.A.
13.0 - 17.0 Lacs P.A.