Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in pune
>
Apex One
>
SRE (Site Reliability Engineer)

SRE (Site Reliability Engineer)

Apex One

3 - 7 years

13 - 17 Lacs

pune

Posted:3 months ago| Platform:

Apply

Skills Required

devops software development cycle python computer systems aws kubernetes c++ golang site reliability engineering docker ansible mesos java git gcp oops jenkins prometheus sre microsoft azure javascript ruby ceph splunk terraform nfs yarn

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job OverviewWe are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any relevant issues.

Roles And Responsibilities

System Monitoring and Incident Response: for implementing monitoring solutions to track system health,performance, and availability. They proactively monitor systems, identify issues, and respond to incidentspromptly, working to minimize downtime and mitigate impacts.Post-Incident Analysis: Led incident response efforts, coordinated with cross-functional teams, andconducted post-incident analysis to identify root causes and implement preventive measures.Continuous Improvement and Reliability Engineering: SREs drive continuous improvement efforts byidentifying areas for enhancement, implementing best practices, and fostering a culture of reliabilityengineering. They participate in post-mortems, conduct blameless retrospectives, and drive initiatives toimprove system reliability, stability, and maintainability.Collaboration and Knowledge Sharing: SREs collaborate closely with software engineers, operations teams,and other stakeholders to ensure smooth coordination and effective communication. They share knowledge,provide technical guidance, and contribute to the development of a strong engineering culture.Support and maintain configuration management for various applications and systemsImplement comprehensive service monitoring, including dashboards, metrics, and alertsDefine, measure, and meet key service level objectives, such as uptime, performance, incidents, and chronicproblemsPartner with application and business stakeholders to ensure high quality product development and releaseCollaborate with the development team to enhance system reliability and performance.

Qualifications

Bachelors degree in Information Technology, Computer Science, or related field.Strong knowledge of software development processes and procedures.Strong problem-solving abilities.Excellent understanding of computer systems, servers, and network systems.Ability to work under pressure and manage multiple tasks simultaneously.Strong communication and interpersonal skills.Strong knowledge of coding languages like Python, Java, Go, etc.Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++,Ruby, and JavaScriptExperience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamicresource management frameworks (Apache Mesos, Kubernetes,Yarn)Experience with cloud computing platforms such as AWS, Azure, or Google CloudExperience with DevOps tools such as Git, Jenkins, Ansible, Terraform, Docker, etc.Experience with monitoring tools such as Splunk, PrometheusSkills: problem solving,post-incident analysis,aws,monitoring tools,cloud computing,key service level objectives,reliability engineering,configuration management,devops practices,coding languages,monitoring tools (splunk, prometheus),continuous improvement,site reliability engineering,service monitoring,incident response,reliability,software development processes,system monitoring,splunk,devops tools (git, jenkins, ansible, terraform, docker),kubernetes,cloud computing (aws, azure, google cloud),devops,ansible,programming (python, java, go, c/c++, ruby, javascript),prometheus,cloud infrastructure,monitoring servicesKeywordscloud computing,splunk,prometheus,software development processes,system monitoring,devops tools,git,jenkins,ansible,terraform,docker,python,java,go,c/c++,ruby,javascript,Site Reliability Engineering*Mandatory Key Skillscloud computing,splunk,prometheus,software development processes,system monitoring,devops tools,git,jenkins,ansible,terraform,docker,python,java,go,c/c++,ruby,javascript,Site Reliability Engineering*

More Jobs at Apex One

Backend Developer(Nodejs)

Hyderabad, Bengaluru

4 - 8 yrs

INR 15 - 25 Lacs

Customer Care Advisor (UK Shift-Hybrid)

Pune

1 - 3 yrs

INR 5 - 9 Lacs

Engineering Manager(GenAI,JAVA, AI/ML, AWS,Saas is Must)

Ahmedabad, Gujarat, India

Experience: Not specified

Salary: Not disclosed

Product Marketing Manager(Saas is must)

Vadodara, Gujarat, India

Experience: Not specified

Salary: Not disclosed

Engineering Manager(GenAI,JAVA, AI/ML, AWS,Saas is Must)

Vadodara, Gujarat, India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Apex One

Technology Solutions

Tech City

Login to

Please Verify Your Phone or Email

Confirm Action

SRE (Site Reliability Engineer)