Jobs
Interviews

73 Kubernetes Administration Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

6.0 - 10.0 years

9 - 13 Lacs

mumbai, navi mumbai

Work from Office

As a member of the Support organization, your focus is to deliver post-sales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales non-technical customer inquiries via phone and electronic means, as well as, technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues.As a Senior Systems Engineer, you will interface with the customer's IT staff on a regular basis. Either at the client's site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status.Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential. A Bachelor's degree in Computer Science, Engineering or equivalent experience is preferred with five years related experience. Experience with Oracle's core products, applications, and tools is important. Career Level - IC3 Responsibilities ACS is industry-leading expertise with the highest customer satisfaction to support organizations business every step of the way. Required Technical and Professional ExpertiseMinimum 7+ years experience in Linux/ Solaris Server administration. In Depth Knowledge of Supercluster, Exadata, M8-8 Server Administration, Patching, Performance Tuning and its troubleshooting methodologies. PDOM/LDOM configuration, ZFS patching / configuration. Oracle PCA patching & administration Project delivery for Oracle Engineered System Exadata, ZDLRA etc landscape including hardware landscape, In Depth Knowledge of Supercluster Exadata features, capacity planning Tuning and troubleshooting of any system components. Oracle Kubernetes administration Ready & experienced to work in 24*7 environment. Implement, enforce and adhere to Disaster Recovery plan for platforms Participate on customer driven projects. Perform Implementation of Engineering Systems, Management & Support lifecycle management, monitoring, support, disaster recovery, compliance, and standards. Enable Customer to use the Engineering Systems Appliance to leverage its features.Added AdvantageScripting knowledge Detailed Description and Job Requirements As a member of the Support organization, your focus is to deliver postsales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales nontechnical customer inquiries via phone and electronic means, as well as technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues. As an Advanced Support Engineer, you will interface with the customers IT staff on a regular basis. Either at the clients site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status. Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential.

Posted 1 day ago

Apply

6.0 - 10.0 years

6 - 11 Lacs

mumbai, navi mumbai

Work from Office

As a member of the Support organization, your focus is to deliver post-sales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales non-technical customer inquiries via phone and electronic means, as well as, technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues. As a Senior Systems Engineer, you will interface with the customer's IT staff on a regular basis. Either at the client's site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status.Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential. A Bachelor's degree in Computer Science, Engineering or equivalent experience is preferred with five years related experience. Experience with Oracle's core products, applications, and tools is important. Career Level - IC3 Responsibilities ACS is industry-leading expertise with the highest customer satisfaction to support organizations business every step of the way. Required Technical and Professional Expertise Minimum 7+ years experience in Linux/ Solaris Server administration. In Depth Knowledge of Supercluster, Exadata, M8-8 Server Administration, Patching, Performance Tuning and its troubleshooting methodologies. PDOM/LDOM configuration, ZFS patching / configuration. Oracle PCA patching & administration Project delivery for Oracle Engineered System Exadata, ZDLRA etc landscape including hardware landscape, In Depth Knowledge of Supercluster Exadata features, capacity planning Tuning and troubleshooting of any system components. Oracle Kubernetes administration Ready & experienced to work in 24*7 environment. Implement, enforce and adhere to Disaster Recovery plan for platforms Ensure Engineering System Standards and Controls are maintained Provide 3rd and 4th level Oracle Engineering Systems for complex issues. Participate on customer driven projects. Perform Implementation of Engineering Systems, Management & Support lifecycle management, monitoring, support, disaster recovery, compliance, and standards. Enable Customer to use the Engineering Systems Appliance to leverage its features. Added Advantage - Scripting knowledge Detailed Description and Job Requirements - As a member of the Support organization, your focus is to deliver postsales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales nontechnical customer inquiries via phone and electronic means, as well as technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues. As an Advanced Support Engineer, you will interface with the customers IT staff on a regular basis. Either at the clients site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status. Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential.

Posted 1 day ago

Apply

6.0 - 11.0 years

6 - 11 Lacs

mumbai, navi mumbai

Work from Office

As a member of the Support organization, your focus is to deliver post-sales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales non-technical customer inquiries via phone and electronic means, as well as, technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues.As a Senior Systems Engineer, you will interface with the customer's IT staff on a regular basis. Either at the client's site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status.Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential. A Bachelor's degree in Computer Science, Engineering or equivalent experience is preferred with five years related experience. Experience with Oracle's core products, applications, and tools is important. Responsibilities ACS is industry-leading expertise with the highest customer satisfaction to support organizations business every step of the way. Required Technical and Professional ExpertiseMinimum 7+ years experience in Linux/ Solaris Server administration. In Depth Knowledge of Supercluster, Exadata, M8-8 Server Administration, Patching, Performance Tuning and its troubleshooting methodologies. PDOM/LDOM configuration, ZFS patching / configuration. Oracle PCA patching & administration Project delivery for Oracle Engineered System Exadata, ZDLRA etc landscape including hardware landscape, In Depth Knowledge of Supercluster Exadata features, capacity planning Tuning and troubleshooting of any system components. Oracle Kubernetes administration Ready & experienced to work in 24*7 environment. Implement, enforce and adhere to Disaster Recovery plan for platforms Ensure Engineering System Standards and Controls are maintained Provide 3rd and 4th level Oracle Engineering Systems for complex issues. Participate on customer driven projects. Perform Implementation of Engineering Systems, Management & Support lifecycle management, monitoring, support, disaster recovery, compliance, and standards. Enable Customer to use the Engineering Systems Appliance to leverage its features.Added AdvantageScripting knowledge Detailed Description and Job Requirements As a member of the Support organization, your focus is to deliver postsales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales nontechnical customer inquiries via phone and electronic means, as well as technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues. As an Advanced Support Engineer, you will interface with the customers IT staff on a regular basis. Either at the clients site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status. Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential.

Posted 1 day ago

Apply

6.0 - 10.0 years

9 - 13 Lacs

mumbai, navi mumbai

Work from Office

Responsibilities ACS is industry-leading expertise with the highest customer satisfaction to support organizations business every step of the way. Required Technical and Professional ExpertiseMinimum 7+ years experience in Linux/ Solaris Server administration. In Depth Knowledge of Supercluster, Exadata, M8-8 Server Administration, Patching, Performance Tuning and its troubleshooting methodologies. PDOM/LDOM configuration, ZFS patching / configuration. Oracle PCA patching & administration Project delivery for Oracle Engineered System Exadata, ZDLRA etc landscape including hardware landscape, In Depth Knowledge of Supercluster Exadata features, capacity planning Tuning and troubleshooting of any system components. Oracle Kubernetes administration Ready & experienced to work in 24*7 environment. Implement, enforce and adhere to Disaster Recovery plan for platforms Participate on customer driven projects. Perform Implementation of Engineering Systems, Management & Support lifecycle management, monitoring, support, disaster recovery, compliance, and standards. Enable Customer to use the Engineering Systems Appliance to leverage its features.Added AdvantageScripting knowledge Detailed Description and Job Requirements As a member of the Support organization, your focus is to deliver postsales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales nontechnical customer inquiries via phone and electronic means, as well as technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues. As an Advanced Support Engineer, you will interface with the customers IT staff on a regular basis. Either at the clients site or from a remote location, you will be responsible for resolution of moderately complex technical problems related to the installation, recommended maintenance and use and repair/workarounds for Oracle products. You should be highly experienced in some Oracle products and several platforms that are being supported. You will be expected to work with only general guidance from management while advising management on progress/status. Job duties are varied and complex utilizing independent judgment. May have project lead role. Because of substantial customer interfacing, a demonstrated ability to work with customers on an independent basis with exceptional communication skills, while consistently achieving the highest levels of customer satisfaction is essential.

Posted 1 day ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

Role Overview: As a subject matter expert in technology, you will play a crucial role in modernizing the Cloud platform for Retail banking and integrated products at Finastra. Your primary focus will be on leveraging modern cloud technologies such as Azure, Azure DevOps, Containers, Kubernetes, and Service Mesh to improve monitoring, logging, developer efficiency, and continuous integration and deployments. Additionally, you will mentor junior team members to help them grow as technologists and be responsible for designing and managing DevOps infrastructure services at scale. Key Responsibilities: - Design and manage highly scalable, reliable, and fault-tolerant CI/CD Pipeline infrastructure & networking for SaaS Based Retail Banking Solution at Finastra. - Enhance monitoring and security posture of infrastructure/application by implementing protective measures efficiently to achieve better ROI and TCO. - Collaborate with Dev Engineering teams to automate Application/Infrastructure/Network processes and meet long-term business needs. - Research and evaluate parallel products, define and govern application/infrastructure baselines. - Communicate and collaborate effectively across distributed teams in a global environment. - Implement toolsets to facilitate developers" use of Containers, Kubernetes, and Service Mesh. - Develop tools for developers, operations, and release teams to utilize Kubernetes and Service Mesh seamlessly. - Ensure platform security and monitoring using tools like Prometheus/Grafana and implement best practices. - Deliver zero-defect and highly resilient code, exceeding availability and defect SLA. - Present technical solutions, capabilities, considerations, and features in business terms. - Convert user stories into detailed development tasks. - Communicate status, issues, and risks precisely and in a timely manner. Qualifications Required: - 8 to 10 years of hands-on experience in SaaS/IaaS with expertise in DevOps techniques and continuous integration solutions using Ansible, Bash, Docker, Git, Maven. - Proficiency in Load Balancing, Rate Limiting, Traffic Shaping, and managing connectivity between Applications and Networks. - Deep knowledge of Linux, Container technologies (e.g., Docker), Terraform, Kubernetes administration, and cluster orchestrators/schedulers (Kubernetes, Mesos). - Strong scripting fundamentals in programming languages like Spring, Python, Java, Ruby, etc. - Understanding of distributed system fundamentals, interactive application development paradigm, memory management, performance optimization, database interactions, network programming, and more. - Working knowledge of Oracle, DB2, PostgreSQL, or Mongo DB databases. - Experience with microservices architecture, RESTful services, CI/CD, and at least one Cloud Service provider like Azure AKS, AWS EKS, GCP GKE, OpenShift, Rancher, etc. - Familiarity with Kubernetes controllers/operators, Docker, CKA or CKAD certification, and operational experience in deploying and managing Kubernetes. - AZ400 certification is a plus. (Note: The additional details of the company were not explicitly mentioned in the provided job description.),

Posted 4 days ago

Apply

5.0 - 8.0 years

4 - 8 Lacs

bengaluru

Hybrid

Hi, Required mandatory skills:Kubernetes administration with Rancher Skill Ctc:12LPA Location:Bangalore-onsite Mode:Hybrid ( 3 days WFO) Type:c2h-on Insightek Payroll If interesetd drop cv to suniths@insightekgc.com Role & responsibilities Should have minimum 5-8 years working experience in Docker/K8s and certified as Kubernetes administrator. Sound knowledge of MapR file system & cluster management. Configuring & troubleshooting issues with MapR. Preferably from a DevOps Engineering background. The candidates must have experience in larger cluster administration. Monitor system events to ensure health, maximum system availability and service quality Manage the container platform ecosystem (installation, upgrade, patching, monitoring) Troubleshoot complex technical issues and assist development team as necessary to resolve issues related to K8s. Hands-on experience with Rancher. Have experience to Perform system & application patching Provide on call support Answer users query and service requests Preferred candidate profile Docker: Design, build, manage and operate infrastructure and configuration of all platform environments with a focus on automation and infrastructure as code. Design, build, manage and operate the infrastructure as a service layer (hosted and cloud-based platforms) that supports the different platform services. Develop a log analytics solution to provide logging-as-a-service to hosted applications based on open-source solutions, to speed up the debugging process. Evaluate performance trends and expected changes in demand and capacity and establish the appropriate scalability plans. Identify and troubleshoot any availability and performance issues at multiple layers of deployment, from hardware, operating environment, network and application. Recommend and maintain technology related policies and procedures. Identify and suggest various opportunities to improve efficiency and functionality. Implement data security and protection Kubernetes : Administrating services status in Kubernetes Master and Minions Administrating Pods, Docker images, services, replication controller Scaling/Descaling Pods Administrating configuration changes in the Kubernetes files Microservices Deployment Troubleshooting issues encountered during deployment Provide Support to L1 & L2 team in creating RCAs & resolving issues & in completing daily assigned activities including the training Provide on-call support Good knowledge in shell scripting Conforming to client compliances and expectations Istio or Service Mesh knowledge would be a plus.D

Posted 4 days ago

Apply

6.0 - 11.0 years

5 - 15 Lacs

hyderabad, pune, bengaluru

Work from Office

Kubernetes platform Admin Loc- Open Exp- 6+ yrs Required Skills & Experience: 78 years of Linux and containerization experience. 4+ years of hands-on experience with Kubernetes (on-premises) . Strong expertise in kubeadm, Helm , and Kubernetes internals. Experience with bare-metal provisioning, VM infrastructure, and storage solutions . Practical knowledge of Prometheus, Grafana, OpenTelemetry for observability. Proficiency with ingress controllers (NGINX / Traefik), cert-manager , and Kubernetes networking. Solid understanding of Kubernetes security (RBAC, PSPs, secrets management). Hands-on with Docker and CI/CD pipelines. Strong Linux system administration and troubleshooting skills.

Posted 1 week ago

Apply

7.0 - 12.0 years

8 - 12 Lacs

hyderabad

Work from Office

Position overview: The Staff Systems Engineer will be responsible for providing support to e2open data center engineering functions across multiple geographical locations to achieve Hosting Operations Service Level Objectives. The successful candidate must have excellent large-scale infrastructure system design and implementation experience, as well as possess strong analytical, troubleshooting, and problem-solving skills, and be able to independently work with E2open's cross-functional teams across multiple geographical locations. Key Responsibilities: Manage and support e2opens Kubernetes container infrastructure platform across development, staging, performance testing, QA and production environments Manage and perform scheduled maintenance procedures to ensure that e2opens hosting services are secure, reliable and performant Collaborate with counterparts from Datacenter Operations, Information Security and Product Development to design, develop and implement solutions that meet security and compliance requirements and SLOs Design, deploy, and manage Kubernetes clusters using VMware vSphere Supervisor and Tanzu Kubernetes Grid Service Develop best practices and procedures to deploy, monitor and administer applications in a Kubernetes environment using CI/CD pipelines Develop best practices and procedures to configure and maintain the Kubernetes platform using GitOps principles Develop capacity management strategies to maintain the scalability, reliability and performance of e2opens application hosting services Develop and maintain documentation for system configurations, procedures, and policies Manage and lead complex technical implementation and migration projects against aggressive timelines Work with internal and external teams to troubleshoot customer impacting issues Onboard and/or assist with onboarding containerized in-house applications into Kubernetes Requirements: Diploma, Bachelor's Degree or equivalent in Computer Science, Engineering or a related field. 7+ years of Kubernetes administration, design and implementation experience, preferably on VMware vSphere Supervisor clusters Experience building and operating large-scale IT infrastructure that run high-performance distributed systems in production Experience building and operating production Kubernetes clusters in a primarily private but also occasionally public cloud infrastructure Proficiency with Linux, CI/CD Tools (ArgoCD, FluxCD), Configuration Management, Infrastructure as Code (GitOps) Strong engineering skills with scripting and programming languages such as bash, Python, and Go. Possess sound working knowledge of Server Virtualization, Containerization, Disaster Recovery, Automation, Software Defined Datacenter, Enterprise Storage, Operating Systems, Software Defined Networking, Information Security and Cloud Infrastructure. Excellent communication and interpersonal skills Strong analytical and problem-solving skills Proactive approach to identifying problems, performance bottlenecks, and areas for improvement Ability to work independently and as part of a team CKA, CKAD or CKS certification a plus.

Posted 1 week ago

Apply

2.0 - 5.0 years

9 - 14 Lacs

bengaluru

Work from Office

Required skills. Must be an immediate joiner Must have to experience in CI/CDpipelines to automate software delivery, including building, testing, and deploying applications. Must have exposure to manage infrastructure using IaCtools like Terraform, CloudFormation, to enable reproducible and scalable environments. ( preferred Terraform ) Able to Manage configuration files and tools like Ansible to ensure consistency and maintainability across different environments. Must be able to Deploy, configure, and manage cloud infrastructure services like GCP to support application deployment and scalability. Able to Implement and maintain monitoring and logging solutions using Dynatrace . Must have Familiarity with containerization technologies like Docker, containerD and container orchestration platforms such as Kubernetes. Networking and Security: Understanding of networking concepts, security best practices, and experience in securing cloud infrastructure and applications Release Management: Coordinate and automate the release process, manage version control, and handle environment-specific configurations. Must know BTP cloud /Kyma

Posted 1 week ago

Apply

1.0 - 5.0 years

0 Lacs

tamil nadu

On-site

This role is responsible for designing and developing repeatable on-premise and hybrid cloud architectures by combining best practices in infrastructure as codes, orchestrations, toolings, CI/CD, GitOps, etc. You will utilize these architectures from development to production environments, create reference implementations, and continuously enhance these architectures to meet the increasing demands of both internal and external customers/stakeholders. As the potential candidate for this role, you will serve as the technical lead for any infrastructure and platform requirements of the project. Your responsibilities include driving engagement, motivating the team towards success, continuously challenging yourself for improvement, adhering to small and consistent increments that provide value and impact to the business, maintaining Agile and growth mindsets, and striving for excellence. The technical requirements for this role are as follows: - At least one (1) year of Kubernetes administration experience, Certified Kubernetes Administrator (CKA) and/or Certified Kubernetes Application Developer (CKAD) credentials are a plus but not required - Proficiency in Linux, preferably with RHCE v7/v8 certification - Experience in automation with Ansible and Terraform - Knowledge of observability tools such as Prometheus and Grafana - Familiarity with cloud platforms like AWS and/or Azure - Experience in CI/CD pipelines as code using Jenkins - Proficiency in scripting/programming languages including Bash, Groovy, Python, Ruby, and/or Go In addition, the technical competencies required for this role include: - System Administration (GNU/Linux, *nix) - Core - On-demand Infrastructure/Cloud Computing, Storage, and Infrastructure (SaaS, PaaS, IaaS) - Advanced - Virtualization, Containerization, and Orchestration (Docker, Docker, Swarm, Kubernetes) - Advanced - Continuous Integration/Deployment (CI/CD) and Automation (Jenkins, Ansible) - Advanced - Project Management - Advanced - Infrastructure/service monitoring and log aggregation design and implementation (Appdynamics, ELK, Grafana, Prometheus, etc.) - Core - Distributed data processing frameworks (Hadoop, Spark, etc.), big data platforms (EMR, HDInsight, etc.) - Entry - NoSQL and RDBMS Design and Administration (MongoDB, PostgreSQL, Elasticsearch) - Core - Change Management Coordination - Advanced - Software Development - Core - DevOps Process Design and Implementation - Advanced About Standard Chartered: Standard Chartered is an international bank that aims to make a positive difference for clients, communities, and employees. With a history spanning over 170 years, the bank values challenging the status quo, embracing challenges, and seeking opportunities for growth and improvement. The bank's purpose is to drive commerce and prosperity through its unique diversity, guided by the brand promise to be here for good. As an employee at Standard Chartered, you can expect: - Core bank funding for retirement savings, medical and life insurance, with flexible and voluntary benefits available in some locations - Various time-off options including annual leave, parental/maternity leave (20 weeks), sabbatical (up to 12 months), and volunteering leave (3 days) - Flexible working arrangements based on home and office locations with adaptable working patterns - Proactive wellbeing support through digital platforms, development courses, Employee Assistance Programme, sick leave, mental health first-aiders, and self-help toolkits - A continuous learning culture that supports growth, reskilling, and upskilling through physical, virtual, and digital learning opportunities - Inclusivity and values-driven environment that celebrates diversity and respects individuals" potentials across teams, functions, and geographies If you are seeking a purpose-driven career in a values-driven organization that fosters growth and inclusivity, Standard Chartered welcomes your unique talents and contributions.,

Posted 2 weeks ago

Apply

7.0 - 10.0 years

20 - 35 Lacs

pune

Hybrid

We are looking for a Site Reliability Engineer who will look after one of the newest propositions in the GfK portfolio supporting our next-generation platform - GfKnewron. If you are excited about cloud engineering (GCP), passionate about all Kubernetes and understand the importance of automation, then the challenge is for you. You will Work in partnership with our software engineering teams to build and operate the next-generation infrastructure to support the GfK product portfolio. Work extensively with GCP. Collaborate with the product development squads to enable them to test and deploy software rapidly whilst ensuring the highest standards of reliability and security. Automate the build and deployment of infrastructure using tools such as Terraform, Docker, Kubernetes & other orchestration technologies in a hybrid-cloud environment. Influence architectural decisions with focus on security, scalability, cost and high-performance. Set up and maintain monitoring, metrics and reporting systems for fine-grained observability and actionable alerting and create and maintain appropriate backup, restore and redundancy solutions for business-critical data. Qualifications You have 7+ years of relevant experience as an SRE/DevOps Engineer Have a background in either Systems Administration or Software Engineering Strong experience with major public Cloud Providers (ideally GCP but this is not a must have) Strong experience with Docker and Kubernetes. Strong experience with IaC (Terraform) Strong understanding of GitOps concepts and tools (ideally Flux) Excellent knowledge of technical architecture and modern design patterns, including micro-services, serverless functions, NoSQL, RESTful APIs, etc. Ability to set up and support CI/CD pipelines and tooling using Gitlab. Proficiency in a high-level programming language such as Python, Ruby or Go Experience with monitoring, log aggregation and alerting tooling (GCP Logging, Prometheus, Grafana). Additional Information Exciting work environment that brings people together. Use the latest digital technologies. Ongoing trainings to support your development. Opportunities for personal and professional growth. Great compensation and bonus scheme linked to individual performance and company results. Flexible working hours and home office.

Posted 2 weeks ago

Apply

3.0 - 6.0 years

4 - 7 Lacs

pune

Work from Office

About the Company Gruve is an innovative Software Services startup dedicated to empowering Enterprise Customers in managing their Data Life Cycle. We specialize in Cyber Security, Customer Experience, Infrastructure, and advanced technologies such as Machine Learning and Artificial Intelligence. Our mission is to assist our customers in their business strategies utilizing their data to make more intelligent decisions. As a well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks. Why Gruve At Gruve, we foster a culture of innovation, collaboration, and continuous learning. We are committed to building a diverse and inclusive workplace where everyone can thrive and contribute their best work. If you’re passionate about technology and eager to make an impact, we’d love to hear from you. Gruve is an equal opportunity employer. We welcome applicants from all backgrounds and thank all who apply; however, only those selected for an interview will be contacted. Position Description We are seeking an experienced Kubernetes Data Center Administrator to manage and maintain multiple infrastructure systems running Kubernetes across our data centers. The ideal candidate will be responsible for creating, managing, and debugging Kubernetes clusters and services, while ensuring operational excellence through collaboration with IT teams. This role demands deep technical expertise in Kubernetes, virtualization, and data center operations, along with strong experience in ITSM platforms and compliance management. Key Responsibilities Design, deploy, and maintain multiple Kubernetes clusters across data center environments. Manage and troubleshoot Kubernetes services including: MinIO (object storage) Prometheus (monitoring) Istio (service mesh) MongoDB and PostgreSQL (databases) Collaborate with IT teams to support operational needs including: Change management Patch and software update cycles Data protection and disaster recovery planning DCIM (Data Center Infrastructure Management) systems Compliance audits and reporting Diagnose and resolve complex Kubernetes configuration issues. Modify platform components and scripts to improve reliability and performance. Administer and integrate multiple ITSM platforms for: Asset management Change management Incident management Problem Management Maintain detailed documentation of Kubernetes environments and operational procedures. Ensure systems meet regulatory and organizational compliance standards. Qualifications 8-10 years of experience in Kubernetes administration and virtualization technologies. Proven experience managing production-grade Kubernetes clusters and services. Strong understanding of data center operations and infrastructure systems. Hands-on experience with ITSM platforms (e.g., Jira Service Management). Proficiency in scripting (e.g., Bash, Python) and automation tools. Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana). Experience with disaster recovery planning and compliance audits. At least one CNCF Kubernetes certification (e.g., CKA, CKS, CKAD). Experience with container security and policy enforcement preferred. Familiarity with GitOps workflows and tools like ArgoCD or Flux preferred. Knowledge of infrastructure-as-code tools (e.g., Terraform, Ansible) preferred. Equal Employment Opportunity We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Work Environment This position may require working in a fast-paced environment. On-site Pune presence is required

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

pune, maharashtra

On-site

You have hands-on experience working in AWS Cloud environments. You are proficient in implementing best practices for AWS workload security and operational efficiency. You have a proven track record of deploying greenfield AWS Cloud environments, which includes AWS account management, configuring transit gateways, and setting up user access policies/rules. Additionally, you have experience in on-prem to AWS Cloud migrations. Your expertise extends to working in AWS SecOps, DevOps, and FinOps domains. You possess the ability to self-learn and independently execute tasks once requirements are clear. Ideally, you are AWS Certified. You have the capability to integrate cloud infrastructure and automation processes with CI/CD pipelines to streamline deployment and configuration management. In terms of security and compliance, you are skilled in implementing cloud security best practices, including IAM policies, key management, and ensuring compliance with industry standards such as HIPAA, GDPR, or SOC 2. You also have a strong background in writing, managing, and deploying Infrastructure as Code using Terraform. Furthermore, you excel in Kubernetes administration, encompassing managing clusters, deploying containerized applications, monitoring, scaling, and securing workloads in a production environment. This position is based in Pune, Noida, and Bangalore.,

Posted 2 weeks ago

Apply

3.0 - 6.0 years

5 - 8 Lacs

mumbai, hyderabad

Work from Office

Immediately available candidates only. Location :Navi Mumbai. About SettleMint. SettleMint India was formed in 2019, with headquarters in Delhi, India. The India team focuses on client deliverables and the development of high-performance low-code Blockchain. We operate from Delhi, along with certain project locations.. We are currently seeking to hire a Kubernetes Administrator at our client site Navi Mumbai to further"strengthen our software engineering & delivery team."If you are self-driven, client-focused and results-oriented, we would like to welcome you to our team.. Roles and Responsibilities:. Design, develop, and deploy scalable and secure Kubernetes-based infrastructure.. Collaborate with the development team/vendors to assess and optimise application performance within Kubernetes.. Automate deployment, scaling, and management of containerised applications.. Develop scripts for automating routine tasks around deployments and monitoring.. Resolve technical issues related to the Kubernetes infrastructure.. Ensure the high availability of applications and services in the Kubernetes environment.. Monitor and review the system logs and detect issues in the Kubernetes cluster.. Work closely with the DevOps team to implement continuous integration and delivery processes.. Stay updated with new trends and best practices in container orchestration.. Develop and maintain documentation for the Kubernetes infrastructure.. Conduct regular security audits to ensure the safety of the infrastructure.. Help develop and maintain automated processes, tools, and documentation in support of Docker.. Securely managing Kubernetes Cluster on at least one of the cloud providers (AWS, Azure or GCP cloud). Deploy ReplicaSet, DaemonSet, Statefulsets, CronJobs, Jobs, etc. Ingress Controllers (Nginx, Istio etc) and cloud native load balancers.. Requirements:. Certifications such as CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer),. 3-5 Years of experience.. Experience in creating and managing production scale Kubernetes clusters, Deep understanding of Kubernetes networking. Experience in setting up monitoring and alerting for Kubernetes cluster using open source monitoring tools like Grafana, Prometheus.. Should be flexible to work in the rotating shifts as per the requirement..

Posted 3 weeks ago

Apply

1.0 - 4.0 years

4 - 8 Lacs

hyderabad

Work from Office

Project Role : Technology Support Engineer Project Role Description : Resolve incidents and problems across multiple business system components and ensure operational stability. Create and implement Requests for Change (RFC) and update knowledge base articles to support effective troubleshooting. Collaborate with vendors and help service management teams with issue analysis and resolution. Must have skills : Docker Kubernetes Administration Good to have skills : NAMinimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As a Technology Support Engineer, you will engage in resolving incidents and problems across various business system components, ensuring operational stability. Your typical day will involve collaborating with different teams, implementing Requests for Change, and updating knowledge base articles to enhance troubleshooting effectiveness. You will also work closely with vendors to assist service management teams in analyzing and resolving issues, contributing to a seamless operational environment. Roles & Responsibilities:- Expected to be an SME.- Collaborate and manage the team to perform.- Responsible for team decisions.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Facilitate training sessions for junior team members to enhance their skills.- Monitor and evaluate team performance to ensure alignment with operational goals. Professional & Technical Skills: - Must To Have Skills: Proficiency in Docker Kubernetes Administration.- Strong understanding of container orchestration and management.- Experience with cloud platforms and services.- Familiarity with continuous integration and continuous deployment practices.- Knowledge of networking concepts and troubleshooting techniques. Additional Information:- The candidate should have minimum 5 years of experience in Docker Kubernetes Administration.- This position is based at our Hyderabad office.- A 15 years full time education is required. Qualification 15 years full time education

Posted 3 weeks ago

Apply

15.0 - 20.0 years

5 - 10 Lacs

bengaluru

Work from Office

About The Role Project Role : DevOps Engineer Project Role Description : Responsible for building and setting up new development tools and infrastructure utilizing knowledge in continuous integration, delivery, and deployment (CI/CD), Cloud technologies, Container Orchestration and Security. Build and test end-to-end CI/CD pipelines, ensuring that systems are safe against security threats. Must have skills : Microsoft Azure DevOps, DevOps, Docker Kubernetes Administration, knowledge of CI/CD pipeline Good to have skills : NAMinimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As a DevOps Engineer, you will be responsible for building and setting up new development tools and infrastructure. A typical day involves utilizing your knowledge in continuous integration, delivery, and deployment, as well as cloud technologies and container orchestration. You will work on building and testing end-to-end CI/CD pipelines, ensuring that systems are secure against potential threats while collaborating with various teams to enhance operational efficiency and effectiveness. Roles & Responsibilities:- Expected to be an SME.- Collaborate and manage the team to perform.- Responsible for team decisions.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Mentor junior team members to enhance their skills and knowledge.- Continuously evaluate and improve existing processes and tools to optimize performance. Professional & Technical Skills: - Must To Have Skills: Proficiency in Microsoft Azure DevOps, DevOps, Docker Kubernetes Administration.- Strong understanding of continuous integration and continuous deployment methodologies.- Experience with cloud service providers, particularly Microsoft Azure.- Familiarity with containerization technologies and orchestration tools.- Knowledge of security best practices in software development and deployment. Additional Information:- The candidate should have minimum 5 years of experience in Microsoft Azure DevOps.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education

Posted 3 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

DISH Network Technologies India Pvt. Ltd, a technology subsidiary of EchoStar Corporation, is a pioneering organization driving innovation and value for its customers through cutting-edge technology solutions. Our diverse product portfolio includes Boost Mobile, DISH TV, Sling TV, OnTech, Hughes, and Hughesnet, offering a wide range of services from consumer wireless to global satellite connectivity solutions. As one of EchoStar's largest development centers outside the U.S., our facilities in India are at the forefront of technological convergence. Our talented engineering team is dedicated to catalyzing innovation in multimedia network and communications development. Join our Technology teams that challenge the status quo and redefine industry capabilities. Through research, technology innovation, and solution engineering, you will be instrumental in shaping the products and platforms of tomorrow, connecting consumers with innovative solutions. In the role of System Reliability & Performance, you will be responsible for designing, implementing, and maintaining monitoring solutions for various platforms like webMethods, GemFire, AWS services, and Kubernetes clusters. Your tasks will include automation of operational tasks, incident response, and system provisioning. Additionally, you will participate in on-call rotations, conduct performance tuning, and ensure platform reliability. Your expertise will be key in managing and optimizing platforms like webMethods, GemFire, AWS Cloud, and Rancher Kubernetes. Collaborating closely with development teams, you will champion SRE best practices and ensure system documentation and maintenance. As a mentor, you will contribute to a culture of continuous learning and improvement. To excel in this role, you should hold a Bachelor's degree in Computer Science or a related field and have at least 8 years of experience in SRE, DevOps, or technical operations. Proficiency in webMethods, GemFire, AWS cloud services, Kubernetes administration, and scripting languages like Python and Java is essential. Experience with CI/CD pipelines, monitoring tools, and networking concepts is required. Nice-to-have skills include familiarity with other integration platforms, distributed databases, AWS certifications, Kubernetes certifications, and chaos engineering principles. An agile mindset and effective problem-solving abilities are crucial for success in this fast-paced, collaborative environment. At DISH Network Technologies India Pvt. Ltd, we offer a range of benefits including insurance, financial programs, mental wellbeing support, employee stock purchase program, professional development reimbursement, and team outings. Join us in driving technological innovation and redefining the future of connectivity.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and uptime of critical services for our client's team. Your primary responsibilities will revolve around Kubernetes administration, CentOS server management, Java application support, incident handling, and change management. The ideal candidate for this role should have a solid background in ArgoCD for Kubernetes management, Linux proficiency, basic scripting skills, and familiarity with modern monitoring, alerting, and automation tools. We are seeking a self-motivated individual with strong communication skills, both verbal and written, who can work effectively both independently and collaboratively. Your daily tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be responsible for conducting routine system and application maintenance tasks following standard operating procedures to prevent and resolve issues promptly. Additionally, you will be in charge of responding to and managing incidents, facilitating post-mortem meetings, conducting root cause analysis, and ensuring timely issue resolution. Furthermore, you will monitor production systems, applications, and overall performance, utilizing tools to detect abnormal behaviors in software and collect relevant information for developers to understand and address the underlying causes. Security checks, policy and procedure documentation, script/code writing for tool and service development, post-mortem learning, and administration work on tools like JIRA and New Relic are also part of your responsibilities. In terms of technical skills, you should have at least 5 years of experience in a SaaS and Cloud environment. Proficiency in Kubernetes cluster administration, Linux scripting, database systems (MySQL, DB2), Linux (CentOS / RHEL) administration, change management procedures, on-call responsibilities, deployment management using Jenkins, monitoring tools (e.g., New Relic, Splunk, Nagios), log aggregation tools (e.g., Splunk, Loki, Grafana), and scripting knowledge in at least one language is essential. Experience with API programming and integrating tools such as Jira, Slack, xMatters/PagerDuty will be advantageous for this role.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be ensuring the reliability and uptime of critical services. This will involve a strong focus on Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for someone who is self-motivated, possesses excellent communication skills (both oral and written), and can work both independently and collaboratively. Your main tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will also be responsible for conducting routine tasks for system and application maintenance, following SOPs to correct and prevent issues. In addition, you will respond to and manage running incidents, conduct post-mortem meetings, perform root cause analysis, and ensure timely resolution. Furthermore, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collect information to help developers understand the root causes of problems. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary to develop tools and services, and learning from post-mortems to prevent new incidents are also part of your responsibilities. Technical skills required for this role include 5+ years of experience working in a SaaS and Cloud environment, administration of Kubernetes clusters with ArgoCD, Linux scripting for automation, experience with database systems like MySQL and DB2, Linux administration skills, understanding of change management procedures, on-call responsibilities, experience with managing deployments using Jenkins, and familiarity with monitoring tools like New Relic, Splunk, and Nagios. Additionally, experience with log aggregation tools like Splunk, Loki, or Grafana, strong scripting knowledge in at least one language, and experience with API programming and integrating tools such as Jira, Slack, and xMatters/PagerDuty are preferred. This is an exciting opportunity for a motivated individual with the right skill set to make a significant impact on our client's team.,

Posted 1 month ago

Apply

6.0 - 10.0 years

0 Lacs

pune, maharashtra

On-site

As an OpenShift Admin with 6 to 8 years of relevant experience, you will be responsible for building automation to support product development and data analytics initiatives. In this role, you will develop and maintain strong customer relationships to ensure effective service delivery and customer satisfaction. Regular interaction with customers will be essential to refine requirements, gain agreement on solutions and deliverables, provide progress reports, monitor satisfaction levels, identify and resolve concerns, and seek cooperation to achieve mutual objectives. To be successful in this role, you must have a minimum of 6 years of experience as an OpenShift Admin, with expertise in Kubernetes Administration, Automation tools such as Ansible, AWS EKS, Argo CD, and Linux administration. Extensive knowledge and experience with OpenShift and Kubernetes are crucial for this infrastructure-focused position. You should be experienced in deploying new app containers from scratch in OpenShift or Kubernetes, as well as upgrading OpenShift and working with observability in these environments. Additional skills that would be beneficial for this role include experience with Anthos/GKE for Hybrid Cloud, HashiCorp Terraform, and HashiCorp Vault. As an OpenShift Admin, you will be expected to create, maintain, and track designs at both high and detailed levels, identify new technologies for adoption, conduct consistent code reviews, and propose changes where necessary. You will also be responsible for provisioning infrastructure, developing automation scripts, monitoring system performance, integrating security and compliance measures, documenting configurations and processes, and deploying infrastructure as code and applications using automation and orchestration tools. The hiring process for this position will consist of screening rounds conducted by HR, followed by two technical rounds, and a final HR round. If you are someone with a strong background in OpenShift Administration and related technologies, and you are passionate about driving innovation and excellence in infrastructure management, we encourage you to apply for this role in our Pune office.,

Posted 1 month ago

Apply

6.0 - 11.0 years

20 - 35 Lacs

Hyderabad

Work from Office

Job Summary: We are looking for a highly skilled and adaptable Senior Site Reliability Engineer / Principal Site Reliability Engineer to become a key member of our Cloud Engineering team. In this crucial role, you will be instrumental in designing and refining our cloud infrastructure with a strong focus on reliability, security, and scalability . As an SRE, you'll apply software engineering principles to solve operational challenges, ensuring the overall operational resilience and continuous stability of our systems. This position requires a blend of managing live production environments and contributing to engineering efforts such as automation and system improvements. Responsibilities Cloud Infrastructure Architecture and Management: Design, build, and maintain resilient cloud infrastructure solutions to support the development and deployment of scalable and reliable applications. This includes managing and optimizing cloud platforms for high availability, performance, and cost efficiency. Enhancing Service Reliability: Lead reliability best practices by establishing and managing monitoring and alerting systems to proactively detect and respond to anomalies and performance issues. Utilize SLI, SLO, and SLA concepts to measure and improve reliability. Identify and resolve potential bottlenecks and areas for enhancement. Driving Automation and Efficiency: Contribute to the automation, provisioning, and standardization of infrastructure resources and system configurations. Identify and implement automation for repetitive tasks to significantly reduce operational overhead. Develop Standard Operating Procedures (SOPs) and automate workflows using tools like Rundeck or Jenkins. Incident Response and Resolution: Participate in and help resolve major incidents, conduct thorough root cause analyses, and implement permanent solutions. Effectively manage incidents within the production environment using a systematic problem-solving approach. Collaboration and Innovation: Work closely with diverse stakeholders and cross-functional teams, including software engineers, to integrate cloud solutions, gather requirements, and execute Proof of Concepts (POCs). Foster strong collaboration and communication. Guide designs and processes with a focus on resilience and minimizing manual effort. Promote the adoption of common tooling and components, and implement software and tools to enhance resilience and automate operations. Be open to adopting new tools and approaches as needed. Requirements Experience: 6 to 14 Years Role: We have multiple roles the final role will depend on the candidate's experience and credentials Education: BE/B. Tech/MCA/M.Sc./MTech/M.S Technology Stack: Linux Administration, Shell / Python Scripting, AWS Cloud Services (EC2, S3), Cloud Operations, Linux (CentOS, Rocky Linux), Jenkins, ArgoCD, Kubernetes Management, Ansible, Terraform, OS Patching, Release Management, Incident Management Infrastructure Management: Proven proficiency in on-premises hosting and virtualization platforms (VMware, Hyper-V, or KVM). Solid understanding of storage internals (NAS, SAN, EFS, NFS) and protocols (FTP, SFTP, SMTP, NTP, DNS, DHCP). Experience with networking and firewall technologies. Strong hands-on experience with Linux internals and operating systems (RHEL, CentOS, Rocky Linux). Experience with Windows operating systems to support varied environments. Service Reliability Concepts: Good understanding of SLI, SLO, SLA and error budgeting Other Mandatory Requirements: 1) Excellent communication skills 2) 24/7 support with monthly rotation shifts

Posted 1 month ago

Apply

5.0 - 8.0 years

0 - 1 Lacs

Pune

Work from Office

Certified Kubernetes Administrator (CKA) mandatory Very good knowledge and operational experience with containerization and cluster management infrastructure setup and production environment maintenance ( Kubernetes, vCluster, Docker, Helm ) Very good knowledge and experience with high availability requirements (RTO and RPO) on cloud ( AWS preferred with VPC, Subnet, ELB, Secrets manager, EBS Snapshots, EC2 Security groups, ECS, Cloudwatch and SQS) Very good knowledge and experience in administrating Linux , clients and servers Experience working with data storage, backup and disaster recovery using DynamoDB, RDS PostgreSQL and S3 Good experience and confidence with code versioning ( Gitlab Preferred ) Experience in automation with programming and IaC scripts ( Python / Terraform ) Experience with SSO setup and user management with Keycloak and / or Okta SSO Experience in service mesh monitoring setup with Istio, Kiali, Grafana, Loki and Prometheus Experience with GitOps setup and management for ArgoCD / FluxCD Preferred candidate profile

Posted 1 month ago

Apply

4.0 - 8.0 years

0 Lacs

maharashtra

On-site

As a Kubernetes Administrator/DevOps Senior Consultant, you will be responsible for designing, provisioning, and managing Kubernetes clusters for applications based on micro-services and event-driven architectures. Your role will involve ensuring seamless integration of applications with Kubernetes orchestrated environments and configuring and managing Kubernetes resources such as pods, services, deployments, and namespaces. Monitoring and troubleshooting Kubernetes clusters to identify and resolve performance issues, system errors, and other operational challenges will be a key aspect of your responsibilities. You will also be required to implement infrastructure as code (IAC) using tools like Ansible and Terraform for configuration management. Furthermore, you will design and implement cluster and application monitoring using tools like Prometheus, Grafana, OpenTelemetry, and Datadog. Managing and optimizing AWS cloud resources and infrastructure for Managed containerized environments (ECR, EKS, Fargate, EC2) will be a part of your daily tasks. Ensuring high availability, scalability, and security of all infrastructure components, monitoring system performance, identifying bottlenecks, and implementing necessary optimizations are also crucial responsibilities. Your role will involve troubleshooting and resolving complex issues related to the DevOps stack, developing and maintaining documentation for DevOps processes and best practices, and staying current with industry trends and emerging technologies to drive continuous improvement. Creating and managing DevOps pipelines, IAC, CI/CD, and Cloud Platforms will also be part of your duties. **Required Skills:** - 4-5 years of extensive hands-on experience in Kubernetes Administration, Docker, Ansible/Terraform, AWS, EKS, and corresponding cloud environments. - Hands-on experience in designing and implementing Service Discovery, Service Mesh, and Load Balancers. - Extensive experience in defining and creating declarative files in YAML for provisioning. - Experience in troubleshooting containerized environments using a combination of Monitoring tools/logs. - Scripting and automation skills (e.g., Bash, Python) for managing Kubernetes configurations and deployments. - Hands-on experience with Helm charts, API gateways, ingress/egress gateways, and service meshes (ISTIO, etc.). - Hands-on experience in managing Kubernetes Network (Services, Endpoints, DNS, Load Balancers) and storages (PV, PVC, Storage Classes, Provisioners). - Design, enhance, and implement additional services for centralized Observability Platforms, ensuring efficient log management based on the Elastic Stack, and effective monitoring and alerting powered by Prometheus. - Design and Implement CI/CD pipelines, hands-on experience in IAC, git, monitoring tools like Prometheus, Grafana, Kibana, etc. **Good to Have Skills:** - Relevant certifications (e.g., Certified Kubernetes Administrator CKA / CKAD) are a plus. - Experience with cloud platforms (e.g., AWS, Azure, GCP) and their managed Kubernetes services. - Perform capacity planning for Kubernetes clusters and optimize costs in On-Prem and cloud environments. **Preferred Experience:** - 4-5 years of experience in Kubernetes, Docker/Containerization.,

Posted 1 month ago

Apply

4.0 - 8.0 years

5 - 8 Lacs

Bengaluru, Karnataka, India

On-site

Position Summary: We are seeking a highly skilled and experienced Senior Kubernetes Administrator to join our team The ideal candidate will be a true expert in Kubernetes with a strong background in container orchestration, deployment, and management Certification in Kubernetes administration is required, along with a minimum of 10 years of hands-on experience post-certification The Senior Kubernetes Administrator will play a critical role in designing, implementing, and maintaining our Kubernetes infrastructure to ensure optimal performance, scalability, and reliability Key Responsibilities: Design and implement Kubernetes clusters according to best practices, considering factors such as scalability, performance, security, and high availability. Configure and manage Kubernetes resources including pods, services, deployments, and namespaces. Monitor and troubleshoot Kubernetes clusters to identify and resolve performance issues, system errors, and other operational challenges. Develop automation scripts and tools to streamline deployment, configuration, and maintenance tasks. Collaborate with cross-functional teams to define infrastructure requirements, design solutions, and implement changes as needed. Ensure compliance with security policies and best practices in Kubernetes environments. Stay up-to-date with the latest trends, technologies, and best practices in Kubernetes and container orchestration. Requirements: Bachelors degree in Computer Science, Information Technology, or related field. Certification in Kubernetes administration (eg, Certified Kubernetes Administrator - CKA). Minimum of 10 years of experience in IT operations, with at least 5 years focused on Kubernetes administration. In-depth knowledge of Kubernetes architecture, components, and ecosystem. Proficiency in container technologies such as Docker, container networking, and storage. Strong scripting skills (eg, Bash, Python) and experience with automation tools (eg, Ansible, Terraform). Solid understanding of Linux operating systems and system administration. Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Effective communication skills and the ability to work collaboratively in a team environment. Experience with cloud platforms (eg, AWS, Azure, Google Cloud) and container orchestration services is a plus.

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be to ensure the reliability and uptime of critical services. Your focus will include Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for a self-motivated individual with excellent communication skills, both oral and written, who can work effectively both independently and collaboratively. Your responsibilities will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be conducting routine tasks for system and application maintenance and following SOPs to correct or prevent issues. Responding to and managing running incidents, including post-mortem meetings, root cause analysis, and timely resolution will also be part of your responsibilities. Additionally, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collecting information to help developers understand the issues. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary, and learning from post-mortems to prevent new incidents are also key aspects of the role. Technical skills required for this position include: - 5+ years of experience in a SaaS and Cloud environment - Administration of Kubernetes clusters, including management of applications using ArgoCD - Linux scripting to automate routine tasks and improve operational efficiency - Experience with database systems like MySQL and DB2 - Experience as a Linux (CentOS / RHEL) administrator - Understanding of change management procedures and enforcement of safe and compliant changes to production environments - Knowledge of on-call responsibilities and maintaining on-call management tools - Experience with managing deployments using Jenkins - Prior experience with monitoring tools like New Relic, Splunk, and Nagios - Experience with log aggregation tools such as Splunk, Loki, or Grafana - Strong scripting knowledge in one of Python, Ruby, Bash, Java, or GoLang - Experience with API programming and integrating tools like Jira, Slack, xMatters, or PagerDuty If you are a dedicated professional who thrives in a high-pressure environment and enjoys working on critical services, this opportunity could be a great fit for you.,

Posted 1 month ago

Apply
Page 1 of 3
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies