Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and uptime of critical services for our client's team. Your primary responsibilities will revolve around Kubernetes administration, CentOS server management, Java application support, incident handling, and change management. The ideal candidate for this role should have a solid background in ArgoCD for Kubernetes management, Linux proficiency, basic scripting skills, and familiarity with modern monitoring, alerting, and automation tools. We are seeking a self-motivated individual with strong communication skills, both verbal and written, who can work effectively both independently and collaboratively. Your daily tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be responsible for conducting routine system and application maintenance tasks following standard operating procedures to prevent and resolve issues promptly. Additionally, you will be in charge of responding to and managing incidents, facilitating post-mortem meetings, conducting root cause analysis, and ensuring timely issue resolution. Furthermore, you will monitor production systems, applications, and overall performance, utilizing tools to detect abnormal behaviors in software and collect relevant information for developers to understand and address the underlying causes. Security checks, policy and procedure documentation, script/code writing for tool and service development, post-mortem learning, and administration work on tools like JIRA and New Relic are also part of your responsibilities. In terms of technical skills, you should have at least 5 years of experience in a SaaS and Cloud environment. Proficiency in Kubernetes cluster administration, Linux scripting, database systems (MySQL, DB2), Linux (CentOS / RHEL) administration, change management procedures, on-call responsibilities, deployment management using Jenkins, monitoring tools (e.g., New Relic, Splunk, Nagios), log aggregation tools (e.g., Splunk, Loki, Grafana), and scripting knowledge in at least one language is essential. Experience with API programming and integrating tools such as Jira, Slack, xMatters/PagerDuty will be advantageous for this role.,
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be ensuring the reliability and uptime of critical services. This will involve a strong focus on Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for someone who is self-motivated, possesses excellent communication skills (both oral and written), and can work both independently and collaboratively. Your main tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will also be responsible for conducting routine tasks for system and application maintenance, following SOPs to correct and prevent issues. In addition, you will respond to and manage running incidents, conduct post-mortem meetings, perform root cause analysis, and ensure timely resolution. Furthermore, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collect information to help developers understand the root causes of problems. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary to develop tools and services, and learning from post-mortems to prevent new incidents are also part of your responsibilities. Technical skills required for this role include 5+ years of experience working in a SaaS and Cloud environment, administration of Kubernetes clusters with ArgoCD, Linux scripting for automation, experience with database systems like MySQL and DB2, Linux administration skills, understanding of change management procedures, on-call responsibilities, experience with managing deployments using Jenkins, and familiarity with monitoring tools like New Relic, Splunk, and Nagios. Additionally, experience with log aggregation tools like Splunk, Loki, or Grafana, strong scripting knowledge in at least one language, and experience with API programming and integrating tools such as Jira, Slack, and xMatters/PagerDuty are preferred. This is an exciting opportunity for a motivated individual with the right skill set to make a significant impact on our client's team.,
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be to ensure the reliability and uptime of critical services. Your focus will include Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for a self-motivated individual with excellent communication skills, both oral and written, who can work effectively both independently and collaboratively. Your responsibilities will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be conducting routine tasks for system and application maintenance and following SOPs to correct or prevent issues. Responding to and managing running incidents, including post-mortem meetings, root cause analysis, and timely resolution will also be part of your responsibilities. Additionally, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collecting information to help developers understand the issues. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary, and learning from post-mortems to prevent new incidents are also key aspects of the role. Technical skills required for this position include: - 5+ years of experience in a SaaS and Cloud environment - Administration of Kubernetes clusters, including management of applications using ArgoCD - Linux scripting to automate routine tasks and improve operational efficiency - Experience with database systems like MySQL and DB2 - Experience as a Linux (CentOS / RHEL) administrator - Understanding of change management procedures and enforcement of safe and compliant changes to production environments - Knowledge of on-call responsibilities and maintaining on-call management tools - Experience with managing deployments using Jenkins - Prior experience with monitoring tools like New Relic, Splunk, and Nagios - Experience with log aggregation tools such as Splunk, Loki, or Grafana - Strong scripting knowledge in one of Python, Ruby, Bash, Java, or GoLang - Experience with API programming and integrating tools like Jira, Slack, xMatters, or PagerDuty If you are a dedicated professional who thrives in a high-pressure environment and enjoys working on critical services, this opportunity could be a great fit for you.,
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
73564 Jobs | Dublin
Wipro
27625 Jobs | Bengaluru
Accenture in India
22690 Jobs | Dublin 2
EY
20638 Jobs | London
Uplers
15021 Jobs | Ahmedabad
Bajaj Finserv
14304 Jobs |
IBM
14148 Jobs | Armonk
Accenture services Pvt Ltd
13138 Jobs |
Capgemini
12942 Jobs | Paris,France
Amazon.com
12683 Jobs |