Home
Jobs

Site Reliability Operator(24X7 role) :Hyderabad

4 - 8 years

8 - 16 Lacs

Posted:1 month ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

NOTE : Interview mode : Face to Face at Hyderabad SaaS Engineering ( Oracle Cloud Service Center ) is looking for ideal candidate who can : - • Technical Resolution of Service Issues • Automation of day-on-day operation work. • Troubleshooting: have a deep understanding of SaaS services and dependencies in order to respond quickly and efficiently to major incidents and minimize service disruptions when they occur • Identify the processes which becomes bottlenecks in operations management and resolve them through process improvement, automation. • Stay informed of new technologies , Innovate Oracle Cloud Service Center You will be joining the OCSC (Oracle Cloud Service Centre) as an SR0 (site reliability operator). Your job role will be helping Oracle ensure the availability of cloud services 24x7.365. You will leverage excellence in communication, technical/business analysis, problem solving and attention to detail to methodically resolve issues. We are responsible for preventing customer impacting events and when incidents do impact customers, we are accountable for solving those issues quickly while working to ensure those issues do not happen again. We offer unique opportunities for smart, hands-on engineers with the expertise and passion to solve difficult problems in distributed highly available services and virtualized/Containerized infrastructure. As a member of the Oracle Cloud Service Centre team, you will be surrounded by willing to help individuals representing some of the brightest and most innovative minds in the industry. You will be a part of an organization that prides itself on providing training, empowerment, and career progression.OCSC Mission: Safeguard all SaaS services by preventing problems; and when problems do occur, solving them quickly and completely.The Oracle Cloud Service Centre monitors and responds to Service Events that are impacting our SaaS customers' ability to use their services Responsibilities Your Role/Opportunity:An opportunity for Service Reliability Operator (SRO) who will ensure the availability and resiliency of our Cloud services 24x7x365. The ideal candidate will have a pulse on the Oracles SaaS services and be accountable for the troubleshooting and resolution of service issues.Additionally, you will have the opportunity to create future automation and tooling that will allow us to continuously improve our service. Your role in driving improvements in availability, effort and velocity will delight our customers with and while reducing costs of Operations.You will leverage excellence in communication, technical/business analysis, problem solving and attention to detail to methodically resolve issues. What Youll Do: Technical Resolution of Service Issues Automation of day-on-day operation work. Troubleshooting: have a deep understanding of our services and dependencies in order to respond quickly and efficiently to major incidents and minimize service disruptions when they occur Identify the processes which becomes bottlenecks in operations management and resolve them through process improvement, automation. Stay informed of new technologies , Innovate Ownership: understand internal team process and ensure compliance with them. Administer production servers/services and test system health Offer mitigation paths to accelerate the process of system recovery Work with system monitoring and alerting tools to identify trouble source Execute defined SOPs to avoid or reduce event impact duration Undertake Incident Command training and experience working on an on-call rotation Our ideal candidate : BS degree in CS, EE, or equivalent 4+ Years work experience in supporting Production Services Strong knowledge on Unix/Linux/window OS Application support experience in SaaS (software as a service) environment Strong knowledge on SQL ( writing query, Joins, View, Hint etc) - Execute, debug and test SQL programs Excellent Working / Troubleshooting experience in Application Middleware / Tomcat/WebLogic Server Knowledge of Oracle Database Administration Understanding API calls ( Web Services (e.g., RESTful, SOAP) Exposure to DevSecOps Tools Demonstrable experience in one or more scripting/programming languages: Python, Java, shell Strong communication and analytical skills Understanding of virtualization /containerized solutions (Dockers, Kubernetes) and Cloud services Able to work as part of a shift in a 24x7x365 operations team. Understanding of monitoring / Log management tool/dashboards (Prometheus, Grafana, Kibana or Equivalent AppD, Splunk) Excellent problem-solving skills Technical background with an ability to troubleshoot issues impacting large scale service architectures and application stacks. Handles complex problems with a positive "can do" attitude Team player and able to work with others all skill levels

Mock Interview

Practice Video Interview with JobPe AI

Start Unix Interview Now

My Connections Oracle

Download Chrome Extension (See your connection in the Oracle )

chrome image
Download Now
Oracle
Oracle

Information Technology

Redwood City

135,000 Employees

5543 Jobs

    Key People

  • Safra Catz

    CEO
  • Larry Ellison

    Co-Founder & CTO

RecommendedJobs for You

Mumbai, New Delhi, Bengaluru