Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 10.0 years
8 - 12 Lacs
mumbai
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 5 days ago
5.0 - 10.0 years
8 - 12 Lacs
pune
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 5 days ago
5.0 - 10.0 years
7 - 11 Lacs
ahmedabad
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 6 days ago
13.0 - 15.0 years
1 - 3 Lacs
bengaluru, karnataka, india
On-site
The Role Join us as a Site Reliability Engineer (SRE) and embark on an exciting journey of ensuring reliability, resiliency, and innovation in our information systems and ecosystems. As an SRE at Kyndryl, you'll be at the forefront of driving continuous improvement and delivering exceptional service to our customers. Your role goes beyond traditional engineering, as you'll have the opportunity to analyze business needs, tackle complex problems, and provide strategic advice and designs. You'll be involved in every stage of the software lifecycle, from building and testing to deploying changes and maintaining robust systems. We're looking for a true visionary who can think strategically and help shape the future of our services. Your expertise in building trusted relationships with customers and partnering with them for success will be instrumental in driving our growth. As an SRE, you'll have the unique opportunity to work on end-to-end services, spanning customer sites and platforms. Collaboration and proactivity are key as you work alongside a talented team of professionals, eager to make a difference. You'll embrace an entrepreneurial mindset, taking ownership of your responsibilities and constantly seeking innovative solutions. With an unwavering focus on quality, robustness, and security, you'll be a driving force in implementing cutting-edge tools that enhance our operations, improve reliability, and gather valuable feedback on our platforms. Your ability to identify and mitigate common operational issues will play a crucial role in delivering seamless experiences to our customers. If you're passionate about pushing the boundaries of technology, thrive in a collaborative environment, and are motivated by the opportunity to shape the future of reliability engineering, then we want to hear from you. Join our team and be part of a dynamic and forward-thinking organization that values innovation and excellence in everything we do. As Site Reliability Engineer , you will focus on Infrastructure Services Clients and work constantly towards enhancing the Reliability of the estate. You will optimize the availability of IT infrastructure, systems and services constantly working on improving the reliability of the environment to meet the commitments which has made to its clients related to availability target levels in a cost-effective manner. You will use technical and client environment knowledge to assure services and components are designed and delivered to meet their availability targets. You are required to specialize in reliability with the right mix of knowledge and skills in Cloud Systems, responsible to analyze business needs, problem determination, advise & design, build, test, deploy, changes and maintenance of a well-engineered information system. Your Future at Kyndryl Kyndryl has a global footprint, which means that as a Site Reliability Engineer at Kyndryl you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential offering a wide range of professional and personal growth opportunities that you won't find anywhere else. Who You Are You're good at what you do and possess the required experience to prove it. However, equally as important you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused someone who prioritizes customer success in their work. And finally, you're open and borderless naturally inclusive in how you work with others. Required Technical and Professional Expertise 13+ Years. of experience in Linux/Windows. Strong exposure on Cloud Architecture on VMware, Citrix, Nutanix, Azure, AWS, Cloud deployments, Microsoft Windows, Microsoft Exchange, Microsoft SQL and Multisite clustering. E xperience in handling multiple accounts as required with over all Wintel and virtualizing . Strong experience on Microsoft Active Directory federation Service Implementation, Integration with 3rd party Application and upgrading Domain Controllers. Strong Knowledge on SCCM, SCOM ,WSUS Services and Veritas Clustering for Windows. Strong knowledge on managing File Server, Storage Server and CA Server (Certificate Authority Servers). Strong Understanding on Storge and Network to target any issues end to end and exp in troubleshooting on Network packet capture and Analyzing Packet capture. Monitor IT availability levels by comparing actual levels against targets and addressing shortfalls. End-to-end understanding of enterprise architectures and complex (backend) systems (understand more than the component itself) Passion for resolving reliability issues and identify strategies to mitigate going forward. Ability to root cause sources of instability in a high-traffic, distributed system. Understanding and practical working experience of operating systems / hypervisor internals are familiar with the TCP/IP stack, network routing and load balancing. Experience with configuration and troubleshooting. Proven experience in risk-based systems running on Several Server platforms along with OS clustering . Experience in Windows and VM Administration, Clustering, Hyper-V, Azure Hybrid Infrastructure and AWS. Preferred Technical and Professional Experience Bachelor's Degree mandatory. CCNA/CCNP Certification. SUSE Linux/ RHEL Certification. Ansible/Python experience preferred.
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
hyderabad
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
gurugram
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
chennai
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
noida
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges.
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
bengaluru
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 6 days ago
5.0 - 10.0 years
8 - 12 Lacs
kolkata
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
3.0 - 8.0 years
3 - 8 Lacs
navi mumbai, maharashtra, india
On-site
Experienced in .NET (35 yrs), DevOps/SRE (3+ yrs), CI/CD, Git, IaC, Agile, and cloud-native application development. Skilled in observability, KQL/SQL, and delivering cross-functional DevOps solutions in production environments.
Posted 1 week ago
3.0 - 8.0 years
3 - 8 Lacs
ahmedabad, gujarat, india
On-site
Experienced in .NET (35 yrs), DevOps/SRE (3+ yrs), CI/CD, Git, IaC, Agile, and cloud-native application development. Skilled in observability, KQL/SQL, and delivering cross-functional DevOps solutions in production environments.
Posted 1 week ago
8.0 - 12.0 years
0 Lacs
pune, maharashtra
On-site
As the Tech Lead, Technical Production Control at Fiserv, you will be an integral part of a 24/7 team responsible for monitoring the production platform including applications and servers. Your primary role will involve addressing incidents such as alerts related to Batch Jobs, servers, and applications by coordinating with relevant parties and taking ownership of the situation. You will interact with US counterparts, Datacenter teams, and external vendors to ensure smooth operations. Key Responsibilities: - Resolve production incidents by analyzing issues and coordinating with necessary parties. - Monitor and respond to Observability Alerts using tools like Splunk, Dynatrace, and Automic. - Collaborate with L1 and L2 teams to isolate and escalate alerts related to application and datacenter issues. - Work with Application/systems Analyst on production issues/outages. - Adhere to data center policies and corporate IT compliance standards. - Maintain knowledge of various applications, their interactions, and procedures to correct production errors. - Diagnose and respond to customer support service requests. - Perform advanced research to resolve production-related incidents. - Ensure service and performance compliance with documented standards. - Identify automation opportunities for repetitive tasks. Requirements: - Bachelor's degree in Information Technology, Computer Science, Electrical/Computer Engineering, or related field. - 8-10 years of overall experience. - Proficiency in Monitoring and alerting tools such as Splunk, Dynatrace, Moogsoft, Automic. - Experience with ticketing tools like ServiceNow for incident documentation. - Deep knowledge of Windows 2012 and 2019 servers. - Strong written communication skills for preparing Incident Reports. - Experience in meeting stringent SLAs and implementing process improvements. - Financial industry experience is a plus. - Automation and scripting proficiency (scheduled jobs, Splunk, Python, Selenium). - Willingness to work flexible shifts or weekends based on business demands. - Experience in Site Reliability. Join us at Fiserv Technology group to work on revenue-generating projects and deliver best-in-class financial services products. Explore the opportunities of a career with Fiserv and Find Your Forward with us. Thank you for your interest in employment with Fiserv. To apply, please use your legal name, complete the profile, and attach your resume.,
Posted 1 week ago
5.0 - 7.0 years
17 - 22 Lacs
hyderabad
Work from Office
The ideal candidate is a Senior Site Reliability Engineer with strong expertise in CI/CD pipeline design, infrastructure automation, and backend service development. They have hands-on experience with Node.js, Python scripting, and managing large-scale Kubernetes clusters. The candidate is well-versed in AWS cloud infrastructure, including AWS CDK, and has a deep understanding of DevOps and security best practices. Familiarity with ArgoCD, Kustomize, and GitOps workflows is a strong advantage. They should also be capable of monitoring and optimizing system performance, ensuring reliability and scalability across environments, and collaborating with cross-functional teams. Responsibilities : - Lead the design and implementation of CI/CD pipelines to streamline deployment processes. - Develop and maintain backend services using Node.js, focusing on security and mitigating cyber vulnerabilities. - Automate processes using Python scripting to build utilities that support CI/CD pipelines. - Manage large-scale infrastructure and multiple Kubernetes clusters to ensure optimal performance and reliability. - Implement AWS infrastructure solutions, utilizing AWS CDK and core AWS services to enhance our cloud capabilities. - Collaborate with cross-functional teams to ensure seamless integration of services and infrastructure. - Monitor system performance and troubleshoot issues to maintain high availability and reliability. Qualifications we seek in you : Minimum Qualifications / Skills : - Proven experience in a Senior SRE or similar role. - Strong expertise in CI/CD deployments. - Working knowledge of Python scripting for automation. - Experience in developing and maintaining backend services using Node.js. - Practical experience with AWS infrastructure, including strong working knowledge of AWS CDK and core AWS services. Preferred Qualifications/ Skills : - Familiarity with ArgoCD and Kustomize. - Hands-on experience in managing large-scale infrastructure and multiple Kubernetes clusters. - Strong understanding of security best practice in software development.
Posted 1 week ago
4.0 - 5.0 years
8 - 12 Lacs
gurugram
Work from Office
Position Overview : We are seeking an SRE to join our high-impact platform engineering team. You will maintain SLAs for real-time services deployed across hybrid clouds and Kubernetes clusters, contributing to automation, observability, and availability goals. Roles and Responsibilities : - Monitor application and infrastructure metrics; build dashboards and alerts (Prometheus, Grafana, ELK). - Automate health checks, incident remediation, and reliability guardrails. - Manage on-call rotations, conduct root cause analysis, and implement postmortem action plans. - Define and track SLOs, SLIs, and error budgets. - Use chaos engineering and resilience testing to ensure fault tolerance. Must Have Skills : - 4 - 5 years of experience in managing production-grade Kubernetes clusters and cloud-native platforms. - Proficiency in Linux system internals, containers, and networking. - Scripting/automation expertise in Python/Go/Shell. - Familiarity with incident management, runbooks, and observability standards. - Exposure to service discovery, DNS routing, and load balancing is a bonus. Qualification : BE/BTech/MCA/ME/MTech/MS in Computer Science or a related technical field or equivalent practical experience.
Posted 1 week ago
5.0 - 7.0 years
17 - 22 Lacs
bengaluru
Work from Office
The ideal candidate is a Senior Site Reliability Engineer with strong expertise in CI/CD pipeline design, infrastructure automation, and backend service development. They have hands-on experience with Node.js, Python scripting, and managing large-scale Kubernetes clusters. The candidate is well-versed in AWS cloud infrastructure, including AWS CDK, and has a deep understanding of DevOps and security best practices. Familiarity with ArgoCD, Kustomize, and GitOps workflows is a strong advantage. They should also be capable of monitoring and optimizing system performance, ensuring reliability and scalability across environments, and collaborating with cross-functional teams. Responsibilities : - Lead the design and implementation of CI/CD pipelines to streamline deployment processes. - Develop and maintain backend services using Node.js, focusing on security and mitigating cyber vulnerabilities. - Automate processes using Python scripting to build utilities that support CI/CD pipelines. - Manage large-scale infrastructure and multiple Kubernetes clusters to ensure optimal performance and reliability. - Implement AWS infrastructure solutions, utilizing AWS CDK and core AWS services to enhance our cloud capabilities. - Collaborate with cross-functional teams to ensure seamless integration of services and infrastructure. - Monitor system performance and troubleshoot issues to maintain high availability and reliability. Qualifications we seek in you : Minimum Qualifications / Skills : - Proven experience in a Senior SRE or similar role. - Strong expertise in CI/CD deployments. - Working knowledge of Python scripting for automation. - Experience in developing and maintaining backend services using Node.js. - Practical experience with AWS infrastructure, including strong working knowledge of AWS CDK and core AWS services. Preferred Qualifications/ Skills : - Familiarity with ArgoCD and Kustomize. - Hands-on experience in managing large-scale infrastructure and multiple Kubernetes clusters. - Strong understanding of security best practice in software development.
Posted 1 week ago
6.0 - 9.0 years
12 - 16 Lacs
pune
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools: logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills - 510 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD: GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) ? - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As an SRE Engineer with 5 to 8 years of experience, you will be responsible for deploying, managing, and monitoring software throughout the full Continuous Delivery lifecycle. You will collaborate closely with developers to provide an operational perspective, suggesting enhancements for further scalability, improved performance, and easier maintenance of systems. Additionally, you will actively participate in operational initiatives such as containerization and virtualization technologies, while also monitoring results and driving continuous improvement through process enhancements. Your role will involve mentoring and coaching junior engineers to foster their professional growth and development. To excel in this position, you are expected to have a minimum of 5 years of experience as a Site Reliability or DevOps Engineer. A degree in Engineering or Computer Science or equivalent practical experience is required. Hands-on experience in automating the deployment and monitoring of distributed applications at large scale in both public and private cloud environments is essential. Proficiency in utilizing build and CI/CD pipeline tools like Jenkins (preferred) or GitLab CI, along with related technologies such as Nexus, static code analysis tools, and open-source scans, is highly valued. Moreover, expertise in Azure is a mandatory requirement for this role.,
Posted 2 weeks ago
6.0 - 9.0 years
4 - 7 Lacs
navi mumbai
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
6.0 - 9.0 years
4 - 7 Lacs
hyderabad, gachibowli
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
6.0 - 9.0 years
4 - 7 Lacs
hyderabad, hitech city
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
6.0 - 9.0 years
4 - 7 Lacs
mumbai suburban
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
6.0 - 9.0 years
8 - 11 Lacs
hyderabad
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
6.0 - 9.0 years
8 - 11 Lacs
mumbai
Work from Office
Monitoring: Prometheus, Grafana, Stackdriver, Splunk - Incident Management: PagerDuty, Opsgenie, VictorOps - Performance Testing: JMeter, Gatling, LoadRunner - Logging and Analysis: ELK Stack (Elasticsearch, Logstash, Kibana) - Automation Tools: Terraform, Ansible, Kubernetes Operators - Cloud: GCP (Monitoring, Logging, Compute Engine, Cloud Functions) - Scripting: Python, Go, Bash
Posted 2 weeks ago
12.0 - 16.0 years
0 Lacs
karnataka
On-site
Delta Tech Hub is a vital part of Delta Air Lines, a global airline leader renowned for safety, innovation, reliability, and exceptional customer experience. As a pivotal contributor to Delta's mission of connecting people and cultures worldwide, the Technology Hub focuses on delivering niche, IP-intensive, and innovative solutions to enhance operational excellence and customer service. The Hub plays a crucial role in the airline's transformation agenda by collaborating with global teams to create memorable customer experiences. To excel in this role, you must possess a Bachelor's degree in computer science, Information Systems, or a related technical field. Additionally, you should have at least 12 years of experience in Software Architecture or Lead Software Engineering at an enterprise scale, with 7+ years of hands-on experience in Enterprise Software application design, development, support, DevOps, and Site Reliability. Proficiency in Java, Python, enterprise software architecture, design, integrations, security, and platform engineering is essential. Moreover, you should be well-versed in Agile methodologies, DevOps principles, practices, and tools. Expertise in Application Security principles, microservices architecture, real-time event processing, high transaction volumes, messaging middleware, cloud native applications, and designing highly available, disaster-ready, resilient, and scalable applications is crucial. Demonstrated proficiency with AWS Technology stack and container technologies like OpenShift (Kubernetes), Docker, and Tekton is required. Furthermore, experience in GIT, CICD, TDD, and DevOps, along with designing and implementing enterprise-scale RESTful services, is necessary. Your role will involve mentoring and evolving staff Software Engineers, collaborating with outside vendors or consultants, ensuring project or product integrity, and acting as a point of contact for technical issues. A proactive approach, customer satisfaction focus, motivation, and adaptability in a fast-paced environment are preferred qualifications. In summary, the ideal candidate for this position at Delta Tech Hub should be a seasoned professional with a strong background in software development, architecture, and technology solutions. By leveraging your technical expertise, leadership skills, and commitment to innovation, you will contribute significantly to Delta's technology-driven business and its mission of fostering global connectivity and social good.,
Posted 2 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
73564 Jobs | Dublin
Wipro
27625 Jobs | Bengaluru
Accenture in India
22690 Jobs | Dublin 2
EY
20638 Jobs | London
Uplers
15021 Jobs | Ahmedabad
Bajaj Finserv
14304 Jobs |
IBM
14148 Jobs | Armonk
Accenture services Pvt Ltd
13138 Jobs |
Capgemini
12942 Jobs | Paris,France
Amazon.com
12683 Jobs |