Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
3.0 - 7.0 years
0 Lacs
pune, maharashtra
On-site
The ideal candidate for this position should have hands-on experience in Site Reliability and DevOps, along with expertise in Kubernetes, Docker, Terraform, and CI/CD. As a Level M professional, you will be working in US EST hours with Pune being the preferred location. Your responsibilities will include designing, developing, and deploying software systems and infrastructure to enhance reliability, scalability, and performance. You will be expected to identify manual processes that can be automated to improve operational efficiency. Implementing monitoring and alerting systems to proactively identify and address issues will be a key part of your role. Collaborating with customers for architecture reviews and developing new features to enhance the reliability and scalability of the platform will also be part of your duties. Working closely with various application teams to understand platform issues and design solutions for monitoring and issue resolution will be essential. You will be responsible for designing recovery and resiliency strategies for different applications. Identifying opportunities for technological improvements and the need for new tools to support capacity planning, disaster recovery, and resiliency will also be part of your role. Additionally, you will architect and implement packages/modules that can serve as blueprints for implementation by different application teams.,
Posted 1 day ago
10.0 - 14.0 years
0 Lacs
karnataka
On-site
We are looking for a skilled technical leader capable of developing tools and services to enhance the test automation, test reporting, and test debugging processes for our team of automation engineers. Your role will involve guiding the automation of test infrastructure provisioning, scaling, and more. Additionally, as part of the team, you will be responsible for building frameworks to facilitate the integration of automated testing into CI/CD pipelines across various languages and frameworks. Your technical expertise and leadership will play a crucial role in fostering a culture of site reliability, test automation, shared ownership, and transparency. Your responsibilities will include building and supporting tools and services to enhance our automated test platform, researching and implementing ways to improve user experience and reduce manual tasks, leading infrastructure automation efforts, spearheading test automation frameworks and CI/CD integration, managing test environments and infrastructure, promoting agile processes and fast release cycles, architecting monitoring and alerting systems for comprehensive test lifecycle observability, developing playbooks for incident response and disaster recovery, and instilling a culture of site reliability, shared ownership, and automation throughout the organization. You will also be involved in technical design reviews, code quality processes, and utilizing GenAI/ML tools for test development and triage processes. The ideal candidate will have a strong problem-solving ability, a passion for building usable and scalable systems, the ability to collaborate effectively across teams, a sense of responsibility and ownership, excellent communication skills, comfort with ambiguity, and a curiosity for constant learning and professional growth. Additionally, you should possess over 10 years of experience in product quality, automation, and/or DevOps, hold a Bachelor's or Master's degree in Computer Science, Engineering, or a related field, demonstrate hands-on experience in developing, deploying, and securing services, particularly in regulated environments. Experience with software development productivity metrics, infrastructure provisioning using code and scripts, networking, big data technologies, databases, Linux administration, microservices, distributed systems, performance optimizations, public cloud providers, and VMWare is preferred. Experience in cybersecurity and AI/ML testing would be an added advantage. If you are excited about tackling complex challenges, driving innovation, and leading technical initiatives to enhance test automation processes, we encourage you to apply for this role and be a part of our dynamic team.,
Posted 2 days ago
6.0 - 11.0 years
8 - 12 Lacs
Mumbai, Delhi / NCR, Bengaluru
Work from Office
Observability & SRE Engineer Azure & Splunk (3 Months) Role Overview : We are looking for a highly skilled Observability and Site Reliability Engineer (SRE) with strong experience in Splunk integration with Azure, cloud-native monitoring, and chaos engineering practices. The ideal candidate will play a key role in improving system reliability, monitoring capabilities, and resilience across our Azure cloud infrastructure. Key Responsibilities : Design, implement, and manage observability solutions using Splunk integrated with Azure Monitor, Log Analytics, and Application Insights. Develop and maintain monitoring, alerting, and dashboarding solutions to ensure system health and performance. Implement Azure Chaos Engineering tools and scenarios to proactively test the resilience of cloud applications. Collaborate with application and infrastructure teams to identify SLOs/SLIs and define reliability objectives. Automate incident detection and response processes using Splunk alerts, Azure Automation, and scripting. Conduct root cause analysis (RCA) and post-incident reviews to drive continuous improvement. Drive the adoption of SRE principles and practices across engineering teams. Location - Delhi / NCR, Bangalore, Mumbai, Pune
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Surat
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Gurugram
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Kanpur
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Kolkata
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Ahmedabad
Remote
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
5.0 - 10.0 years
8 - 12 Lacs
Chennai
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
6.0 - 9.0 years
12 - 16 Lacs
Pune
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools: logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD: GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) ? - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 week ago
4.0 - 5.0 years
8 - 12 Lacs
Gurugram
Work from Office
Position Overview : We are seeking an SRE to join our high-impact platform engineering team. You will maintain SLAs for real-time services deployed across hybrid clouds and Kubernetes clusters, contributing to automation, observability, and availability goals. Roles and Responsibilities : - Monitor application and infrastructure metrics; build dashboards and alerts (Prometheus, Grafana, ELK). - Automate health checks, incident remediation, and reliability guardrails. - Manage on-call rotations, conduct root cause analysis, and implement postmortem action plans. - Define and track SLOs, SLIs, and error budgets. - Use chaos engineering and resilience testing to ensure fault tolerance. Must Have Skills : - 4 - 5 years of experience in managing production-grade Kubernetes clusters and cloud-native platforms. - Proficiency in Linux system internals, containers, and networking. - Scripting/automation expertise in Python/Go/Shell. - Familiarity with incident management, runbooks, and observability standards. - Exposure to service discovery, DNS routing, and load balancing is a bonus. Qualification : BE/BTech/MCA/ME/MTech/MS in Computer Science or a related technical field or equivalent practical experience. Location : Gurugaon / Onsite. About Nomiso : Our mission is to Empower and Enhance the lives of our customers, through efficient solutions for their At Nomiso we encourage entrepreneurial spirit to learn, grow and improve. A great workplace, thrives on ideas and opportunities. We're in pursuit of colleagues who share similar passions, are nimble and thrive when challenged. We offer a positive, stimulating and fun environment with opportunities to grow, a fast-paced approach to innovation, and a place where your views are valued and encouraged. We are an equal opportunity employer and are committed to diversity, equity, and inclusion. We do not discriminate on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other protected characteristics.
Posted 1 week ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
The Lead Associate, Release Management position at BetaNXT involves supporting the scheduling, coordination, and verification of all Mediant's technology/application releases. You will collaborate with QA and Developer Leads to ensure builds are validated before deployment to Production and organize artifacts for release. Additionally, you will work with IT management to enhance software engineering processes and practices related to building, deploying, updating software, and maintaining environments. As part of your responsibilities, you will assist in triaging issues in Production, performing Root Cause Analysis to identify bug introductions, and providing feedback to enhance engineering processes. Your key functions will include implementing and managing release processes for software applications, APIs, and various IT initiatives. You will validate release features, prepare release instructions, and coordinate resources required for deployment. Working closely with QA Leads, you will establish and maintain a bug triage process, prioritize bugs for fixes, and ensure timely resolution by the scrum team. Collaboration with Developers, QA, and DevOps teams to identify and evaluate risks related to releases is essential. You will conduct Root Cause Analysis for discovered bugs, assist in troubleshooting production issues, and coordinate resources to address them. Managing projects and interdependencies to ensure production readiness for all system updates will also be part of your role. To be successful in this position, you should have at least 6+ years of experience and be familiar with build, deployment, and versioning software such as Bamboo and BitBucket. Experience working in a Cloud environment, preferably AWS, is required. Previous experience in the financial services and securities industry is preferred. You should be comfortable testing software applications, APIs, and database objects/SQL, with experience in DevOps, Site Reliability, or Release Management for a rapidly growing company. Familiarity with software development tools like GIT, GitLab, Docker, Postman, and Splunk is beneficial. A B.S degree is required, while an advanced degree or equivalent experience is preferred. Strong project management and communication skills are necessary, along with experience in Software Quality Assurance or Verification of Release Builds. Experience with build and release processes, especially deploying in a Cloud environment, is preferred. Familiarity with Agile/Scrum development methodologies and SQL skills to write queries and understand existing stored procedures and functions are also valuable assets in this role.,
Posted 1 week ago
2.0 - 6.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, you will play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. You will use technology to solve business problems and leverage software engineering best practices as the team strives towards excellence. Your responsibilities will include executing small to medium projects independently with initial direction and eventually designing and delivering projects by yourself. Collaborating with cross-functional teams will provide you with the opportunity to continually enhance your knowledge about JPMorgan Chase's business and relevant technologies. You will leverage technology to solve business problems by writing high-quality, maintainable, and robust code following best practices in software engineering. Additionally, you will participate in triaging, examining, diagnosing, and resolving incidents, working with others to solve problems at their root. Recognizing the toil within your role, you will proactively work towards eliminating it through systems engineering or updating application code. Understanding observability patterns is crucial, and you will strive to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis. In terms of qualifications, capabilities, and skills, you should have formal training or certification on software engineering concepts and a minimum of 2 years of applied experience. Ability to code in at least one programming language is essential, along with experience maintaining a Cloud-based infrastructure. Familiarity with site reliability concepts, principles, and practices is required, as well as observability practices using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others. Knowledge of containers or common Server OS like Linux and Windows is preferred. Emerging knowledge of software, applications, and technical processes within a given technical discipline and continuous integration and continuous delivery tools are beneficial. You should also have familiarity with common networking technologies and be able to work in a large, collaborative team, demonstrating willingness to vocalize ideas with peers and managers. Preferred qualifications, capabilities, and skills include familiarity with popular IDEs for Software Development and knowledge of using GENAI tools such as Copilot or Windsurf as Code Assistants. General knowledge of the financial services industry is preferred, along with an understanding of NFRs. By joining JPMorgan Chase as a Site Reliability Engineer II, you will have the opportunity to contribute to the reliability and efficiency of the organization's technological infrastructure while continuously enhancing your skills and knowledge in software engineering practices.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
haryana
On-site
Cosm is a global technology company that brings experiences to life in immersive environments. We help our partners create spaces and content that blur the lines of real and virtual across three primary markets: Sports and Entertainment, Science and Education, and Parks and Attractions. Cosm was born from the fusion of some of the greatest innovators in the history of technology. Evans & Sutherland, Spitz, Inc., and Cosm Immersive combined forces to power the immersive experiences of the future as Cosm. Innovation is in our DNA. The Incident Response Analyst is a mid-level role that is responsible for monitoring the overall performance of Cosm's infrastructure and systems to ensure Site Reliability for Cosm's Live Entertainment Venues and Live Broadcasts. This includes identifying and resolving high visibility incidents and escalations, contributing to the strategic planning to prevent incidents, and playing a pivotal role in shaping the overall operating framework. Responsibilities - Independently monitor and manage Cosm's technical operations, including incident resolution. - Lead the diagnosis, prioritization, and documentation of critical incidents. - Act as a primary point of contact for high-level impact incidents and escalations. - Collaborate with engineering to implement incident remediations and follow-up. - Generate and deliver regular incident and operational reports to stakeholders. - Coordinate upgrades, outages, and planned activities with cross-functional teams. - Provide mentorship and guidance to less-tenured team members. - Contribute to refining and enhancing Ops Center tools, processes, and procedures. - Work closely with field services teams to gather feedback and improve reliability. - Collaborate with B2C Customer Service to monitor incidents affecting customer experience. - Ability to be a part of an on-call rotation, occasionally working nights and weekends to support high-priority business events. Experience - Bachelor's degree in Computer Science, Information Technology, or a related field. - 5+ years of experience in an Ops Center, incident management, or a similar role. - Proficiency in incident management tools and systems (e.g., Grafana, ServiceNow). - Experience supporting infrastructures and configuring SaaS applications. - Strong analytical, communication, and problem-solving skills. - Ability to lead and work effectively in a team environment. - Experience with automation tools and platforms. - Knowledge of ITIL or similar incident/service management frameworks. - Demonstrated ability to manage high-pressure situations and multiple incidents. - Previous experience in a 24/7 operations center. Work Environment Available for overtime and weekends as the schedule varies depending on site operational needs, flexibility required. Cosm is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.,
Posted 2 weeks ago
10.0 - 14.0 years
0 Lacs
karnataka
On-site
As a key member of our team at Delta Tech Hub, you will play a crucial role in shaping the future of our technology-driven business. Your primary responsibility will be to lead the technical direction for the Digital Mobile AWS cloud strategy, specifically focusing on employee-facing applications. You will be actively involved in defining the technical architecture, ensuring the success of cloud and non-cloud solutions, and overseeing the software application development lifecycle to maintain a high level of quality in each project. Your duties will include developing and defining business and technical operating procedures, creating architectural diagrams and documentation, identifying application and business requirements through effective communication with technical teams and clients, and staying updated on new technologies to drive implementation and innovation. Your expertise in areas such as App/Integration, AWS, DevSecOps, Site Reliability, Platform, and Security will be vital in achieving our objectives. To excel in this role, you must possess a Bachelor's degree in computer science, Information Systems, or a related technical field, along with a minimum of 10 years of experience in Mobile Architecture or Lead Software Engineering at an enterprise scale. Your background should include designing enterprise software solutions, applications, integrations, security, and platform engineering for both mobile and web consumers. Additionally, you should have a proven ability to mentor and develop Mobile Software Development Engineers, define architectural design patterns, and implement various AWS services and functionalities. Your proficiency in agile methodologies, DevOps principles, object-oriented design patterns, modern tooling, and monitoring/reliability best practices will be essential for success in this role. Expertise in application security principles, cloud-native development in various programming languages, and designing performant networking systems in on-prem and cloud environments is also required. Being part of a diverse and inclusive team, you should prioritize safety and security, embrace different perspectives, and demonstrate strong communication and collaboration skills. Preferred qualifications include experience in an airline technology environment, working with microservices on a cloud platform, and managing all phases of the software development lifecycle. Your ability to estimate financial implications of architectural decisions and build flexible APIs and microservice solutions will give you a competitive edge in this role at Delta Tech Hub.,
Posted 2 weeks ago
4.0 - 9.0 years
9 - 19 Lacs
Bengaluru
Hybrid
Dear candidate, We are looking SRE ( Site Reliability Engineer) for Bangalore location. Requirement 1: SRE(Artifactory) * GitLab setup & administration * Implement best practices to improve pipeline performance * AWS with Terraform coding * Linux administration & troubleshooting * Strong coding skills in any language (preferably Python) * Familiar with container technologies (Docker / Kubernetes) * Good knowledge of infrastructure and application monitoring (Prometheus / Grafana / Could watch) Requirement 2: SRE(GITLAB) * JFrog Artifactory setup & administration * JFrog XRAY setup & administration * AWS with Terraform coding * Linux administration & troubleshooting * Strong coding skills in any language (preferably Python) * Familiar with container technologies (Docker / Kubernetes) * Good knowledge of infrastructure and application monitoring (Prometheus / Grafana / Could watch) Location:- Bangalore (Whitefield) Work mode:- Hybrid Interview Mode:- Face to face (Saturday, 5th July 2025) If interested, please share your cv at ruchika.gahlawat@innovasolutions.com.
Posted 1 month ago
5.0 - 10.0 years
7 - 12 Lacs
Surat
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Jaipur
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Bengaluru
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Lucknow
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Hyderabad
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Kolkata
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Nagpur
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Mumbai
Work from Office
We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
5.0 - 10.0 years
8 - 12 Lacs
Chandigarh
Work from Office
Job Description : We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Must-Have Skills : - 5-10 years in DevOps / SRE / Infra roles - Kubernetes (GKE preferred) - IaC with Terraform & Helm - CI/CD : GitHub Actions + GitOps (ArgoCD / Flux) - Cloud architecture expertise (IAM, VPC, Secrets) - Strong scripting/coding & backend debugging skills (Node.js, Django, etc.) - Incident management with tools like Datadog & PagerDuty - Excellent communicator & documenter Tech Stack : - GKE, Kubernetes, Terraform, Helm - GitHub Actions, ArgoCD / Flux - Datadog, PagerDuty - CloudSQL, Cloudflare, IAM, Secrets You're : - A proactive team player & strong individual contributor - Confident yet humble - Curious, driven & always learning - Not afraid to solve deep infrastructure challenges
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough