Jobs

Interviews
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home

Jobs

Home
>
Jobs in Pune
>
Talkmetakeme Software Solutions
>
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Talkmetakeme Software Solutions

7 - 10 years

15 - 22 Lacs

Pune

Posted:1 month ago| Platform:

Apply

Skills Required

ITOM DevOps Site Reliability Engineering Dynatrace Proficiency in monitoring networking fundamentals Servicenow Cloud Services Alerting Container Orchestration Bash Kubernetes and Docker scripting Terraform Capacity Planning Ansible CI/CD Splunk Linux/Unix Python

Work Mode

Work from Office

Job Type

Full Time

Job Description

The Role We are seeking a Site Reliability Engineer (SRE) to join our dynamic team responsible for the operational management of critical applications. This role involves leveraging tools like Dynatrace, Splunk for monitoring to ensure system reliability, performance, and scalability. The ideal candidate will have a strong background in SRE practices, automation, and a passion for improving system operations. Key Responsibilities Application Reliability: Ensure the reliability, availability, and performance of over 100 applications through proactive monitoring and incident management. Monitoring & Observability: Implement and maintain observability solutions using Dynatrace, creating dashboards and alerts to monitor system health and performance. IT Operations Management: Utilize ServiceNow ITOM for configuration management, incident response, and change management processes. Automation & Tooling: Develop automation scripts and tools to reduce manual tasks, improve deployment processes, and enhance system scalability. Incident Management: Lead the response to system incidents, perform root cause analysis, and implement preventive measures to avoid recurrence. Collaboration: Work closely with development, QA, and infrastructure teams to integrate reliability into the software development lifecycle. Capacity Planning: Analyze system performance data to forecast capacity needs and ensure systems can handle future growth. Key Requirements Experience: 5+ years in Site Reliability Engineering, DevOps, or related roles within large-scale enterprise environments. Technical Skills: Proficiency in monitoring tools like Dynatrace, ITOM platforms like ServiceNow, and scripting languages such as Python or Bash. Automation: Experience with infrastructure-as-code tools (e.g., Terraform, Ansible) and CI/CD pipelines. Operating Systems: Strong knowledge of Linux/Unix systems and networking fundamentals. Experience with Container Orchestration including Kubernetes and Docker Design and own Technical Solutions for broad or complex requirements with insightful and strategic approaches Prior experience deploying Cloud Services, Monitoring, Alerting, and Handling Escalations Experience supporting a High-Availability applications including SaaS environment. Charting new DevOps practices and a well-defined roadmap. Problem-Solving: Demonstrated ability to troubleshoot complex system issues and implement effective solutions. Communication: Excellent verbal and written communication skills, with the ability to collaborate across teams. Preferred Qualifications Certifications: Relevant certifications in SRE, DevOps, or cloud platforms (e.g., AWS, Azure, GCP). Cloud Experience: Familiarity with cloud-native architectures and services. Agile Methodologies: Experience working in Agile/Scrum environments. Why Join Us? Innovative Environment: Be part of a team that embraces innovation and continuous improvement. Career Growth: Opportunities for professional development and career advancement. Flexible Work: Hybrid work model supporting work-life balance. Impact: Play a crucial role in maintaining the reliability of services that impact millions of customers.

Mock Interview

Practice Video Interview with JobPe AI

Start Itom Interview Now

My Connections Talkmetakeme Software Solutions

Download Chrome Extension (See your connection in the Talkmetakeme Software Solutions )

Download Now

Talkmetakeme Software Solutions

talkmetakeme.com

Software Development

Tech City

50-100 Employees

11 Jobs

Key People

Alice Johnson

CEO
Bob Smith

CTO
Charlie Brown

Head of Marketing

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

Site Reliability Engineer (SRE)

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

More Jobs at Talkmetakeme Software Solutions

Mock Interview

Download Chrome Extension (See your connection in the Talkmetakeme Software Solutions )

RecommendedJobs for You

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

Site Reliability Engineer (SRE)

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

More Jobs at Talkmetakeme Software Solutions