Jobs
Interviews

938 Prometheus Jobs - Page 37

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4 - 8 years

13 - 18 Lacs

Hyderabad

Work from Office

About The Role #body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{color:rgb(0,0,0) !important;}#body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{background:rgba(230,231,232,1.0) !important;} Apply now Senior Cloud Solutions Technologist Job Location (Short): Hyderabad, India Workplace Type: Hybrid Business Unit: ALI Req Id: 1542 .buttontextb0d7f9bdde9da229 a{ border1px solid transparent; } .buttontextb0d7f9bdde9da229 a:focus{ border1px dashed #5B94FF !important; outlinenone !important; } Responsibilities Design, build, and manage Azure infrastructure, ensuring high availability, performance, and security. Implement DevOps practices using Azure DevOps, CI/CD pipelines, and infrastructure-as-code (IaC) tools. Manage and optimize Azure Kubernetes Service (AKS) clusters, ensuring scalability, security, and efficiency of containerized applications. Configure and maintain Azure-based servers and Citrix Virtual App environments. Optimize performance, security, and disaster recovery strategies across Azure infrastructure, AKS clusters, and Citrix environments. Automate cloud operations using scripting (Python, Bash, PowerShell) and configuration management tools (Puppet and Terraform). Implement monitoring, logging, and alerting strategies for cloud services, applications, and infrastructure. Apply cloud security best practices, ensuring compliance with organizational and regulatory security standards. Collaborate with developers, architects, and infrastructure teams to streamline cloud deployments and ensure operational efficiency. Participate in an On-Call rotation to provide support for critical cloud systems. Education / Qualifications Hexagon is seeking a highly motivated and experienced Site Reliability Engineer (SRE) to design, build, and manage our Azure cloud infrastructure. This role will be instrumental in implementing DevOps practices using Azure DevOps, optimizing and managing Azure Kubernetes Service (AKS) clusters for containerized applications, and configuring and maintaining Azure-based servers and Citrix Virtual Apps and Desktops environments. Should have relevant bachelors degree in Engineering stream. Proficiency with monitoring tools (e.g., Datadog, Prometheus, Grafana, LogicMonitor). Strong understanding of IT infrastructure, including servers, networks, and cloud environments across different OS platforms. Experience with virtualization platforms and cloud security strategies. Hands-on experience with container orchestration (e.g., Azure Kubernetes Service (AKS) or equivalent). Proficient in automation tools (e.g., Puppet and Terraform) and scripting languages (Python, Bash, PowerShell). Experience in setting up alerting and monitoring for containerized and microservices environments (Kubernetes, Docker). Familiarity with DevOps best practices, including CI/CD pipeline development. Strong problem-solving and analytical skills, with a focus on proactive identification and resolution of issues. Excellent verbal and written communication skills, with the ability to explain technical concepts to non-technical stakeholders. Preferred Qualifications: Azure certifications (e.g., Azure Administrator Associate, Azure Solutions Architect, or Azure DevOps Engineer Expert) are highly desirable. Experience with Citrix Virtual Apps and Desktops administration is a plus. About Hexagon Hexagon is the global leader in digital reality solutions, combining sensor, software and autonomous technologies. We are putting data to work to boost efficiency, productivity, quality and safety across industrial, manufacturing, infrastructure, public sector, and mobility applications. Our technologies are shaping production and people related ecosystems to become increasingly connected and autonomous – ensuring a scalable, sustainable future. Hexagon (Nasdaq StockholmHEXA B) has approximately 24,500 employees in 50 countries and net sales of approximately 5.4bn EUR. Learn more at?hexagon.com?and follow us?@HexagonAB. Hexagon’s R&D Centre in India Hexagon’s R&D Centre in India is the single largest R&D centre for the company globally. More than 2,000 talented engineers and developers create innovation from this centre that powers Hexagon's products and solutions. Hexagon’s R&D Centre delivers innovations and creative solutions for all business lines of Hexagon, including Asset Lifecycle Intelligence, Autonomous Solutions, Geosystems, Manufacturing Intelligence, and Safety, Infrastructure & Geospatial. It also hosts dedicated service teams for the global implementation of Hexagon’s products. R&D India – MAKES THINGS INTELLIGENT Asset Lifecycle Intelligence Produces insights across the asset lifecycle to design, construct, and operate more profitable, safe, and sustainable industrial facilities. Everyone is welcome At Hexagon, we believe that diverse and inclusive teams are critical to the success of our people and our business. Everyone is welcome—as an inclusive workplace, we do not discriminate. In fact, we embrace differences and are fully committed to creating equal opportunities, an inclusive environment, and fairness for all. Respect is the cornerstone of how we operate, so speak up and be yourself. You are valued here. .buttontext1c1d8f096aaf95bf a{ border1px solid transparent; } .buttontext1c1d8f096aaf95bf a:focus{ border1px dashed #0097ba !important; outlinenone !important; } #body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{color:rgb(0,0,0) !important;}#body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{background:rgba(230,231,232,1.0) !important;} Apply now

Posted 2 months ago

Apply

3 - 7 years

13 - 18 Lacs

Hyderabad

Work from Office

About The Role #body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{color:rgb(0,0,0) !important;}#body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{background:rgba(230,231,232,1.0) !important;} Apply now Cloud Solutions Consultant Job Location (Short): Hyderabad, India Workplace Type: Hybrid Business Unit: ALI Req Id: 1406 .buttontextb0d7f9bdde9da229 a{ border1px solid transparent; } .buttontextb0d7f9bdde9da229 a:focus{ border1px dashed #5B94FF !important; outlinenone !important; } Responsibilities Design, build, and manage Azure infrastructure, ensuring high availability, performance, and security. Implement DevOps practices using Azure DevOps, CI/CD pipelines, and infrastructure-as-code (IaC) tools. Manage and optimize Azure Kubernetes Service (AKS) clusters, ensuring scalability, security, and efficiency of containerized applications. Configure and maintain Azure-based servers and Citrix Virtual App environments. Optimize performance, security, and disaster recovery strategies across Azure infrastructure, AKS clusters, and Citrix environments. Automate cloud operations using scripting (Python, Bash, PowerShell) and configuration management tools (Puppet and Terraform). Implement monitoring, logging, and alerting strategies for cloud services, applications, and infrastructure. Apply cloud security best practices, ensuring compliance with organizational and regulatory security standards. Collaborate with developers, architects, and infrastructure teams to streamline cloud deployments and ensure operational efficiency. Participate in an On-Call rotation to provide support for critical cloud systems. Education / Qualifications Hexagon is seeking a highly motivated and experienced Cloud Solutions Consultant to design, build, and manage our Azure cloud infrastructure. This role will be instrumental in implementing DevOps practices using Azure DevOps, optimizing and managing Azure Kubernetes Service (AKS) clusters for containerized applications, and configuring and maintaining Azure-based servers and Citrix Virtual Apps and Desktops environments. Required Skills & Qualifications: A minimum of 6-10 years of relevant work experience. Should have a bachelors/masters degree in engineering. Proficiency with monitoring tools (e.g., Datadog, Prometheus, Grafana, LogicMonitor). Strong understanding of IT infrastructure, including servers, networks, and cloud environments across different OS platforms. Experience with virtualization platforms and cloud security strategies. Hands-on experience with container orchestration (e.g., Azure Kubernetes Service (AKS) or equivalent). Proficient in automation tools (e.g., Puppet and Terraform) and scripting languages (Python, Bash, PowerShell). Experience in setting up alerting and monitoring for containerized and microservices environments (Kubernetes, Docker). Familiarity with DevOps best practices, including CI/CD pipeline development. Strong problem-solving and analytical skills, with a focus on proactive identification and resolution of issues. Excellent verbal and written communication skills, with the ability to explain technical concepts to non-technical stakeholders. Preferred Qualifications: Azure certifications (e.g., Azure Administrator Associate, Azure Solutions Architect, or Azure DevOps Engineer Expert) are highly desirable. Experience with Citrix Virtual Apps and Desktops administration is a plus. About Hexagon Hexagon is the global leader in digital reality solutions, combining sensor, software and autonomous technologies. We are putting data to work to boost efficiency, productivity, quality and safety across industrial, manufacturing, infrastructure, public sector, and mobility applications. Our technologies are shaping production and people related ecosystems to become increasingly connected and autonomous – ensuring a scalable, sustainable future. Hexagon (Nasdaq StockholmHEXA B) has approximately 24,500 employees in 50 countries and net sales of approximately 5.4bn EUR. Learn more at?hexagon.com?and follow us?@HexagonAB. Hexagon’s R&D Centre in India Hexagon’s R&D Centre in India is the single largest R&D centre for the company globally. More than 2,000 talented engineers and developers create innovation from this centre that powers Hexagon's products and solutions. Hexagon’s R&D Centre delivers innovations and creative solutions for all business lines of Hexagon, including Asset Lifecycle Intelligence, Autonomous Solutions, Geosystems, Manufacturing Intelligence, and Safety, Infrastructure & Geospatial. It also hosts dedicated service teams for the global implementation of Hexagon’s products. R&D India – MAKES THINGS INTELLIGENT Asset Lifecycle Intelligence Produces insights across the asset lifecycle to design, construct, and operate more profitable, safe, and sustainable industrial facilities. Everyone is welcome .buttontext1c1d8f096aaf95bf a{ border1px solid transparent; } .buttontext1c1d8f096aaf95bf a:focus{ border1px dashed #0097ba !important; outlinenone !important; } #body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{color:rgb(0,0,0) !important;}#body.unify div.unify-button-container .unify-apply-now:focus, #body.unify div.unify-button-container .unify-apply-now:hover{background:rgba(230,231,232,1.0) !important;} Apply now

Posted 2 months ago

Apply

4 - 9 years

0 Lacs

Bengaluru

Remote

This is Rajlaxmi from the HR department of ISoftStone Inc. we are looking for a TechOps Engineer with 5+ years of experience. Please find the JD below, If Interested, Please Drop CV at "rajlaxmi.chowdhury@isoftstone.com". Location- Bangalore/Remote Relevant Exp- 5+ years Overview We are seeking a highly motivated and skilled TechOps Engineer to join our team. The ideal candidate will be responsible for ensuring the smooth operation and performance of GTP services, provide technical support, troubleshooting issues, and implementing solution to optimize efficiency. This is an opportunity to work in a dynamic and innovative environment. We foster a collaborative and inclusive culture that value creativity, initiative and continuous learning. If you are a self-motivated professional with a passion for technology and a drive for excellence, we invite you to apply and be an integral part of our team. Career progression opportunities exist for suitably skilled and motivated individuals in the wider GTP function. Qualifications: Bachelor's degree in Computer Science, Information Technology, or related field. Certified in ITIL v3 or v4 foundation is a preferred. Excellent communication skills and ability to articulate technical issues / requirements. Excellent problem-solving and troubleshooting skills. Preferred Skills: Demonstrate comprehensive understanding of ITIL processes and best practices. Demonstrate comprehensive understanding in various monitoring systems such as Dynatrace, Sentry, Grafana, Prometheus, Azure Monitor, GCP Operation Suite, etc. Proficiency in Cloud technologies (e.g. AWS, Azure, GCP). Demonstrate understanding in operating Couchbase Database, MongoDB, as well as PostgreSQL is preferred. Demonstrate understanding of backup and disaster recovery concepts and tools to ensure the availability and recoverability of production systems in the event of a disaster. Certification in relevant technologies (e.g. Microsoft Azure, GCP) is a plus. Familiarity of DevOps practices such as CI/CD workflows, experience with GitHub Actions, and proficiency in using infrastructure automation tools Knowledge of software development lifecycle. Knowledge of containerization and orchestration tools such as Kubernetes Technologies and Tools.

Posted 2 months ago

Apply

3 - 8 years

4 - 8 Lacs

Bengaluru

Work from Office

Project Role : Software Development Engineer Project Role Description : Analyze, design, code and test multiple components of application code across one or more clients. Perform maintenance, enhancements and/or development work. Must have skills : Python (Programming Language) Good to have skills : NA Minimum 3 year(s) of experience is required Educational Qualification : bachelors degree in computer science Engineering or a related field Summary :This is a hands-on, technical role where the candidate will design and implement a DevOps Maturity Model by integrating multiple DevOps tools and building backend APIs to visualize data on a front-end interface. The candidate will work closely with cross-functional teams to enable DevOps culture, ensure system reliability, and drive continuous improvement. Roles & Responsibilities:1.DevOps Maturity Model:Design and develop a model to assess and improve DevOps practices by integrating tools like Jenkins, GitLab, and Azure DevOps.2.Backend Development:Build scalable and efficient backend APIs using Python and Azure Serverless.3.Frontend Development:Develop intuitive and responsive front-end interfaces using Angular and Vue.js for data visualization.4.Monitoring & Automation:Implement monitoring, logging, and alerting solutions. Develop automation scripts for reporting and analysis.5.Collaboration:Work with cross-functional teams to resolve production-level disruptions and enable DevOps culture.6.Documentation:Document architecture, design, and implementation details. Professional & Technical Skills: 1.Backend Development :Python and experience with Azure Serverless2.Frontend DevelopmentAngular and Vue.js.3.Databases:Familiarity with Azure SQL, Cosmos DB, or PostgreSQL.4.Containerization:Good understanding of Docker and Kubernetes for basic troubleshooting.5.Networking:Basic understanding of TCP/IP, HTTP, DNS, VPN, and cloud networking.6.Monitoring & Logging:Experience with monitoring tools like Prometheus, Grafana, or Datadog. Additional Information:1.The candidate should have a minimum of 3 years of experience in Python & Angular full stack.2.This position is based at our Bengaluru office.3.A 15 years full time education is required (bachelor's degree in computer science, Engineering, or a related field). Qualification bachelors degree in computer science Engineering or a related field

Posted 2 months ago

Apply

4 - 9 years

20 - 25 Lacs

Bengaluru

Work from Office

Job Title: DevOps Engineer Experience Required: 45 Years Location: 100% Work from Office Schedule: Monday to Friday | 7:00 AM 4:00 PM IST (Brisbane Time) Job Summary: We are hiring a DevOps Engineer with a strong foundation in CI/CD pipelines , container orchestration , and infrastructure automation . The ideal candidate must have hands-on experience with Docker , Kubernetes , Terraform , and scripting languages . Mandatory Technical Skills & Tools (Keywords): DevOps CI/CD Pipelines (e.g., Jenkins, GitLab CI/CD, CircleCI) Docker Kubernetes Infrastructure as Code (IaC) Terraform , CloudFormation Linux Systems Administration Automation Scripting – Python , Shell/Bash Cloud Platforms – AWS, Azure, or GCP Monitoring & Logging – Prometheus, Grafana, ELK, or similar Source Control – Git, GitHub, GitLab Configuration Management – Ansible, Chef, or Puppet Key Responsibilities: Design, build, and maintain automated CI/CD pipelines . Manage and scale containerized applications using Docker and Kubernetes . Write and maintain Infrastructure as Code (IaC) using Terraform or similar tools. Develop automation scripts for deployment, monitoring, and infrastructure operations. Ensure system reliability , scalability, and performance. Collaborate with software engineers, QA, and infrastructure teams to ensure seamless deployment cycles. Troubleshoot and resolve infrastructure issues in a proactive manner. Qualifications: Bachelor's degree in Computer Science, Engineering, or related field. 4–5 years of experience in a DevOps or Site Reliability Engineering role. Proven experience in CI/CD , containers , and cloud infrastructure . Strong analytical and problem-solving skills. Work Environment: 100% Work from Office 5 Days a Week (Monday to Friday) Working hours aligned with Brisbane Time : 7:00 AM – 4:00 PM IST

Posted 2 months ago

Apply

5 - 7 years

10 - 12 Lacs

Bengaluru

Work from Office

Experience in designing and building high-performance, distributed systems. Familiarity with cloud services (AWS, GCP, Azure) and containerization (Docker,Kubernetes). Strong knowledge of asynchronous programming, multithreading, and parallelprocessing. Experience in integrating external APIs, function calling, and plugin-basedarchitectures. Experience with performance monitoring and logging tools (Prometheus, Grafana,ELK stack). Familiarity with search engines, RAG pipelines, and hybrid search strategies. Experience in designing and building high-performance, distributed systems.

Posted 2 months ago

Apply

10 - 18 years

20 - 27 Lacs

Hyderabad, Ahmedabad

Work from Office

Hi Aspirants, Greetings from Tech Block - IT Software & Services - Hyderabad & Ahmedabad !!! About TechBlocks TechBlocks is a global digital product engineering company with 16+ years of experience helping Fortune 500 enterprises and high-growth brands accelerate innovation, modernize technology, and drive digital transformation. From cloud solutions and data engineering to experience design and platform modernization, we help businesses solve complex challenges and unlock new growth opportunities. Job Title: We are looking for SRE Manager and SRE Team Leader (Site Reliability Manager) Location : Hyderabad & Ahmedabad Employment Type: Full-Time Work Model - Hybrid Model ( 3 Days WFO & 2 Days WFH) Job Summary : An SRE Manager is responsible for overseeing a team of Site Reliability Engineers (SREs) and ensuring the reliability, performance, and availability of a company's digital infrastructure . They manage the SRE team, drive automation initiatives, and collaborate with other departments to ensure seamless operations and alignment with business objectives. Experience Required: 10+ years total experience, with 3+ years in a leadership role in SRE (Site Reliability Engineer) or Cloud Operations. Technical Knowledge and Skills: Mandatory: Deep understanding of Kubernetes, GKE, Terraform , and Grafana / Prometheus / Splunk / DataDog Cloud: Advanced GCP administration / or any cloud CI/CD: Jenkins, Argo CD, GitHub Actions Incident Management: Full lifecycle, tools like OpsGenie Nice to Have : Knowledge of service mesh and observability stacks Strong scripting skills (Python, Bash) Big Query/Dataflow exposure for telemetry Scope: Build and lead a team of SREs Standardize practices for reliability, alerting, and response Engage with Engineering and Product leaders Roles and Responsibilities: Establish and lead the implementation of organizational reliability strategies, aligning SLAs, SLOs, and Error Budgets with business goals and customer expectations. Develop and institutionalize incident response frameworks, including escalation policies, on-call scheduling, service ownership mapping, and RCA process governance. Lead technical reviews for infrastructure reliability design, high-availability architectures, and resiliency patterns across distributed cloud services. Champion observability and monitoring culture by standardizing tooling, alert definitions, dashboard templates, and telemetry data schemas across all product teams. Drive continuous improvement through operational maturity assessments, toil elimination initiatives, and SRE OKRs aligned with product objectives. Collaborate with cloud engineering and platform teams to introduce self-healing systems, capacity-aware autoscaling, and latency-optimized service mesh patterns. Act as the principal escalation point for reliability-related concerns and ensure incident retrospectives lead to measurable improvements in uptime and MTTR. Own runbook standardization, capacity planning, failure mode analysis, and production readiness reviews for new feature launches. Mentor and develop a high-performing SRE team, fostering a proactive ownership culture, encouraging cross-functional knowledge sharing, and establishing technical career pathways. Collaborate with leadership, delivery, and customer stakeholders to define reliability goals, track performance, and demonstrate ROI on SRE investments Note : Please send me updated resume to kranthikt@tblocks.com / Reach me on 8522804902 Warm Regards, Kranthi Kumar| kranthikt@tblocks.com Contact: 8522804902 Senior Talent Acquisition Specialist Toronto | Ahmedabad | Hyderabad | Pune www.tblocks.com This communication may be privileged and contain confidential information intended only for the recipients to whom it was intended to be sent. Any unauthorized disclosure, copying, other distribution of this communication, or taking any action on its contents is strictly prohibited. If you have received this message in error, please notify us immediately and delete this message without reading, copying, or forwarding it to anyone.

Posted 2 months ago

Apply

5 - 9 years

4 - 8 Lacs

Kolkata

Work from Office

We are looking for an experienced and motivated DevOps Engineer with 5 to 7 years of hands- on experience designing, implementing, and managing cloud infrastructure, particularly on Google Cloud Platform (GCP). The ideal candidate will have deep expertise in infrastructure, such as code (IaC), CI/CD pipelines, container orchestration, and cloud-native technologies. This role requires strong analytical skills, attention to detail, and a passion for optimizing cloud infrastructure performance and cost. Key Responsibilities Design, implement, and maintain scalable, reliable, and secure cloud infrastructure using Google Cloud Platform (GCP) services, including Compute Engine, Google Kubernetes Engine (GKE), Cloud Functions, Cloud Pub/Sub, BigQuery, and Cloud Storage. Build and manage CI/CD pipelines using GitHub, artifact repositories, and version control systems; enforce GitOps practices across environments. Leverage Docker, Kubernetes, and serverless architectures to support microservices and modern application deployments. Develop and manage Infrastructure as Code (IaC) using Terraform to automate environment provisioning. Implement observability tools like Prometheus, Grafana, and Google Cloud Monitoring for real-time system insights. Ensure best practices in cloud security, including IAM policies, encryption standards, and network security. Integrate and manage service mesh architectures such as Istio or Linkerd for secure and observable microservices communication. Troubleshoot and resolve infrastructure issues, ensuring high availability, disaster recovery, and performance optimization. Drive initiatives for cloud cost management and suggest optimization strategies for resource efficiency. Document technical architectures, processes, and procedures; ensure smooth knowledge transfer and operational readiness. Collaborate with cross-functional teams including Development, QA, Security, and Architecture teams to streamline deployment workflows. Preferred candidate profile 5+ years of DevOps/Cloud Engineering experience, with at least 3 years on GCP. Proficiency in Terraform, Docker, Kubernetes, and other DevOps toolchains. Strong experience with CI/CD tools, GitHub/GitLab, and artifact repositories. Deep understanding of cloud networking, VPCs, load balancing, firewalls, and VPNs. Expertise in monitoring and logging frameworks such as Prometheus, Grafana, and Stackdriver (Cloud Monitoring). Strong scripting skills in Python, Bash, or Go for automation tasks. Knowledge of data backup, high-availability systems, and disaster recovery strategies. Familiarity with service mesh technologies and microservices-based architecture. Excellent analytical, troubleshooting, and documentation skills. Effective communication and ability to work in a fast-paced, collaborative environment.

Posted 2 months ago

Apply

6 - 8 years

15 - 20 Lacs

Gurugram

Work from Office

A Candidate with good skills in RabbitMQ, Docker and Kubernetes, Jenkins Pipelines, Nexus, Nagios / appdynamics, ELK, Kafka, Redis, Prometheus.

Posted 2 months ago

Apply

8 - 13 years

25 - 30 Lacs

Bengaluru

Work from Office

About The Role About The Role At Kotak Mahindra Bank, customer experience is at the forefront of everything we do on Digital Platform. To help us build & run platform for Digital Applications , we are now looking for an experienced Sr. DevOps Engineer . They will be responsible for deploying product updates, identifying production issues and implementing integrations that meet our customers' needs. If you have a solid background in software engineering and are familiar with AWS EKS, ISTIO/Services Mesh/tetrate, Terraform,Helm Charts, KONG API Gateway, Azure DevOps, SpringBoot , Ansible, Kafka/MOngoDB we"™d love to speak with you. Objectives of this Role Building and setting up new development tools and infrastructure Understanding the needs of stakeholders and conveying this to developers Working on ways to automate and improve development and release processes Investigate and resolve technical issues Develop scripts to automate visualization Design procedures for system troubleshooting and maintenance Skills and Qualifications BSc in Computer Science, Engineering or relevant field Experience as a DevOps Engineer or similar software engineering role minimum 5 Yrs Proficient with git and git workflows Good knowledge of Kubernets EKS,Teraform,CICD ,AWS Problem-solving attitude Collaborative team spirit Testing and examining code written by others and analyzing results Identifying technical problems and developing software updates and "˜fixes"™ Working with software developers and software engineers to ensure that development follows established processes and works as intended Monitoring the systems and setup required Tools Daily and Monthly Responsibilities Deploy updates and fixes Provide Level 3 technical support Build tools to reduce occurrences of errors and improve customer experience Develop software to integrate with internal back-end systems Perform root cause analysis for production errors

Posted 2 months ago

Apply

7 - 11 years

17 - 22 Lacs

Bengaluru

Work from Office

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid – running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. ?? Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? ? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise – Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code – Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience – Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering – Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery – Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability – Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability – Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness – Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration – Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability / Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelor’s degree, 15 years and a master’s degree, or a PhD with 12 years’ experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace. The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Please note that F5 only contacts candidates through F5 email address (ending with @f5.com) or auto email notification from Workday (ending with f5.com or @myworkday.com ) . Equal Employment Opportunity It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates . Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting accommodations@f5.com.

Posted 2 months ago

Apply

6 - 10 years

15 - 19 Lacs

Hyderabad, Ahmedabad

Hybrid

Summary: As a Senior SRE, you will ensure platform reliability, incident management, and performance optimization. You'll define SLIs/SLOs, contribute to robust observability practices, and drive proactive reliability engineering across services. Experience Required: 610 years of SRE or infrastructure engineering experience in cloud-native environments. Mandatory: Cloud: GCP (GKE, Load Balancing, VPN, IAM) Observability: Prometheus, Grafana, ELK, Datadog Containers & Orchestration: Kubernetes, Docker Incident Management: On-call, RCA, SLIs/SLOs IaC: Terraform, Helm Incident Tools: PagerDuty, OpsGenie Nice to Have : GCP Monitoring, Skywalking Service Mesh, API Gateway GCP Spanner, MongoDB (basic)

Posted 2 months ago

Apply

5 - 9 years

7 - 17 Lacs

Ahmedabad

Work from Office

We are seeking a highly skilled Senior DevSecOps/DevOps Engineer with extensive experience in cloud infrastructure, automation, and security best practices. The ideal candidate must have 5+ years of overall experience, with at least 3+ years of direct, hands-on Kubernetes management experience. The candidate must have strong expertise in building, managing, and optimizing Jenkins pipelines for CI/CD workflows, with a focus on incorporating DevSecOps practices into the pipeline. Key Responsibilities: Design, deploy, and maintain Kubernetes clusters in cloud and/or on-premises environments. Build and maintain Jenkins pipelines for CI/CD, ensuring secure, automated, and efficient delivery processes. Integrate security checks (static code analysis, image scanning, etc.) directly into Jenkins pipelines. Manage Infrastructure as Code ( IaC ) using Terraform , Helm , and similar tools. Develop, maintain, and secure containerized applications using Docker and Kubernetes best practices. Implement monitoring, logging, and alerting using Prometheus , Grafana , and the ELK/EFK stack . Implement Kubernetes security practices including RBAC , network policies , and secrets management . Lead incident response efforts, root cause analysis, and system hardening initiatives. Collaborate with developers and security teams to embed security early in the development lifecycle (Shift-Left Security). Research, recommend, and implement best practices for DevSecOps and Kubernetes operations. Required Skills and Qualifications: 5+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering roles. 3+ years of hands-on Kubernetes experience, including cluster provisioning, scaling, and troubleshooting. Strong expertise in creating, optimizing, and managing Jenkins pipelines for end-to-end CI/CD. Experience in containerization and orchestration: Docker and Kubernetes . Solid experience with Terraform Helm , and other IaC tools. Experience securing Kubernetes clusters, containers, and cloud-native applications. Scripting proficiency ( Bash , Python , or Golang preferred). Knowledge of service meshes (Istio, Linkerd) and Kubernetes ingress management. Hands-on experience with security scanning tools (e.g., Trivy , Anchore , Aqua , SonarQube ) integrated into Jenkins. Strong understanding of IAM , RBAC , and secret management systems like Vault or AWS Secrets Manager .

Posted 2 months ago

Apply

9 - 14 years

40 - 50 Lacs

Bengaluru

Work from Office

Infrastructure Engineer Experience: 8 - 14 Years Employment Type: Full-Time Joining: Immediate Joiner Preferred Work location Bangalore / Hybrid (weekly 3 days work from office in rotational shift) *Need candidates only from Bangalore location only. About the role We are seeking an experienced Infrastructure Engineer to join our team at, a leader in blockchain technology and solutions. The ideal candidate will have a strong background in infrastructure management and a deep understanding of blockchain ecosystems. You will be responsible for designing, implementing, and maintaining the foundational infrastructure that supports our blockchain platforms, ensuring high availability, scalability, and security. Your expertise in AWS cloud technologies and database management, particularly with RDS, PostgreSQL, and Aurora, will be essential to our success. Responsibilities: Design & Deployment: Develop, deploy, and manage the infrastructure for blockchain nodes, databases, and network systems. Automation & Optimization: Automate infrastructure provisioning and maintenance tasks to enhance efficiency and reduce downtime. Optimize performance, reliability, and scalability across our blockchain systems. Monitoring & Troubleshooting: Set up monitoring and alerting systems to proactively manage infrastructure health. Quickly identify, troubleshoot, and resolve issues in production environments. Security Management: Implement robust security protocols, firewalls, and encryption to protect infrastructure and data from breaches and vulnerabilities. should be aware of VPC Virtual private cloud good in this Collaboration: Work closely with development, DevOps, and security teams to ensure seamless integration and support of blockchain applications. Support cross-functional teams in achieving network reliability and efficient resource management. Documentation: Maintain comprehensive documentation of infrastructure configurations, processes, and recovery plans. Continuous Improvement: Research and implement new tools and practices to improve infrastructure resiliency, performance, and cost-efficiency. Stay updated with blockchain infrastructure trends and industry best practices. Incident management: Incident dashboard management. Integrate dashboard using different power tools. Requirements: Educational Background: Bachelors degree in Computer Science, Information Technology, or a related field. Experience: Minimum of 7 years of experience in AWS infrastructure engineering, using terraforms, Terra-grunt, and Atlantis with incident management and resolution using automation (infrastructure as a code) , AWS infrastructure cloud provisioning. Should be aware of VPC Virtual private cloud. Technical Skills: Terraform and Automation AWS Cloud watch Hands-on experience with monitoring tools (e.g., Prometheus, Grafana). DevOps with CI/CD pipelines. Incident management resolution and reporting. Proficiency in cloud platforms (e.g., AWS, GCP, Azure) and container orchestration (e.g., Docker, Kubernetes). Strong knowledge of Linux/Unix system administration. Understanding of networking protocols, VPNs, and firewalls. Participate in on-call rotations to provide 24/7 support for critical systems. Security Knowledge: Strong understanding of security best practices, especially within blockchain environments. Soft Skills: Excellent problem-solving abilities, attention to detail, strong communication skills, and a proactive, team-oriented mindset. Experience working with consensus protocols and node architecture.

Posted 2 months ago

Apply

6 - 10 years

8 - 12 Lacs

Noida

Work from Office

Job Description Job Description We are looking for a highly skilled and experienced Senior DevOps Engineer to join our team. The ideal candidate will have 5-7 years of experience in a DevOps role and a proven track record of implementing and maintaining complex systems with a focus on automation, scalability, and security. The Senior DevOps Engineer will work closely with our development, operations, and security teams to ensure that our software is released quickly and reliably, with a focus on continuous integration and delivery. Requirements: Bachelors/Masters degree in Computer Science, Information Technology or related field 5-7 years of experience in a DevOps role Strong understanding of the SDLC and experience with working on fully Agile teams Proven experience in coding & scripting DevOps, Ant/Maven, Groovy, Terraform, Shell Scripting, and Helm Chart skills. Working experience with IaC tools like Terraform, CloudFormation, or ARM templates Strong experience with cloud computing platforms (e.g. Oracle Cloud (OCI), AWS, Azure, Google Cloud) Experience with containerization technologies (e.g. Docker, Kubernetes/EKS/AKS) Experience with continuous integration and delivery tools (e.g. Jenkins, GitLab CI/CD) Kubernetes - Experience with managing Kubernetes clusters and using kubectl for managing helm chart deployments, ingress services, and troubleshooting pods. OS Services Basic Knowledge to Manage, configuring, and troubleshooting Linux operating system issues (Linux), storage (block and object), networking (VPCs, proxies, and CDNs) Monitoring and instrumentation - Implement metrics in Prometheus, Grafana, Elastic, log management and related systems, and Slack/PagerDuty/Sentry integrations Strong know-how of modern distributed version control systems (e.g. Git, GitHub, GitLab etc) Strong troubleshooting and problem-solving skills, and ability to work well under pressure Excellent communication and collaboration skills, and ability to lead and mentor junior team members Career Level - IC3 Responsibilities Responsibilities Design, implement, and maintain automated build, deployment, and testing systems Experience in Taking Application Code and Third Party Products and Building Fully Automated Pipelines for Java Applications to Build, Test and Deploy Complex Systems for delivery in Cloud. Ability to Containerize an Application i.e. creating Docker Containers and Pushing them to an Artifact Repository for deployment on containerization solutions with OKE (Oracle container Engine for Kubernetes) using Helm Charts. Lead efforts to optimize the build and deployment processes for high-volume, high-availability systems Monitor production systems to ensure high availability and performance, and proactively identify and resolve issues Support and Troubleshoot Cloud Deployment and Environment Issues Create and maintain CI/CD pipelines using tools such as Jenkins, GitLab CI/CD Continuously improve the scalability and security of our systems, and lead efforts to implement best practices Participate in the design and implementation of new features and applications, and provide guidance on best practices for deployment and operations Work with security team to ensure compliance with industry and company standards, and implement security measures to protect against threats Keep up-to-date with emerging trends and technologies in DevOps, and make recommendations for improvement Lead and mentor junior DevOps engineers and collaborate with cross-functional teams to ensure successful delivery of projects Analyze, design develop, troubleshoot and debug software programs for commercial or end user applications. Writes code, completes programming and performs testing and debugging of applications. As a member of the software engineering division, you will analyze and integrate external customer specifications. Specify, design and implement modest changes to existing software architecture. Build new products and development tools. Build and execute unit tests and unit test plans. Review integration and regression test plans created by QA. Communicate with QA and porting engineering to discuss major changes to functionality. Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Leading contributor individually and as a team member, providing direction and mentoring to others. BS or MS degree or equivalent experience relevant to functional area. 6+ years of software engineering or related experience.

Posted 2 months ago

Apply

8 - 13 years

15 - 20 Lacs

Pune

Work from Office

Job Description Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc. As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems. Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Leading contributor individually and as a team member, providing direction and mentoring to others. Career Level - IC4 Responsibilities Key Qualifications: Bachelors/Masters in Engineering or equivalent qualification. Minimum 8+ years of software engineering experience in software architecture, coding, development, and implementation. Experience in Microservices, application level design and architecting for SaaS applications and cloud-based applications. DevOps, open source tech stack, security, scalability, performance tuning. Working knowledge of building Microservices using a leading technology stack like Microprofile +Helidon. Very proficient in Java/JEE, , RESTful APIs, API Gateway, Microservices communications, RDBMS/NoSQL DB and DevOps methodologies and tools. Good understanding and experience of cloud native application design principles (like micro-services, stateless application meant for cloud, containers, 12 factor app principles etc.) Must have work on at least two Microservices based development project from scratch. Technical hands-on experience with Microservices Architecture Style and the related patterns, where software is developed as small and independently deployable services that work together modeled around a business domain. Experience with Microservices architecture, configuration, development, and deployment with their underlying technologies including Docker/Kubernetes, Helm, and Prometheus Experience with implementing continuous integration and delivery, CI/CD with their underlying technologies / tools (e.g. Jenkins, GIT, Gradle/Maven, Artifactory) Familiarity with application and infrastructure monitoring tools such as New Relic, Splunk Architectural experience throughout the entire software development lifecycle by continuously making critical adjustments to the architecture to ensure desired results. Experience working on agile development teams (ideally using Scrum or Kanban) Experience with production systems and dealing with production issues Ability to influence others without having direct management responsibility Excellent written and verbal communication skills with the ability to present complex information in a clear, concise manner to all audiences. Key Responsibilities: Responsible for architecture and design of the solution delivered by the team Design and Develop highly scalable, available, secure and elastic solutions that implement industry best practices and cutting-edge technologies. Partner closely with the product owners/business analyst to understand key features and functionalities of existing monolithic application and align newer architecture according to business needs. Build resilient and cloud ready solutions based on Micro Services, Multi-tenancy architecture. And own responsibility for the quality of software solutions. Coach developers, testers to deliver the high-quality software. Research, analyze and recommend solutions which meet business and technology needs. Partner with DevOps teams to operationalize the product deliveries. Preferred Skills - Domain knowledge of Banking and Billing

Posted 2 months ago

Apply

7 - 12 years

18 - 33 Lacs

Hyderabad

Work from Office

Mandatory skills AWS Public Cloud Expertise Terraform Proficiency ITIL Management -Change Management, Incident Management and Production Operations Experience in ticketing tools -ServiceNow , Jira. Build Tool Experience like CI/CD pipelines - Jules, Jenkins, Spinnaker Monitoring & Alerting tools experience: Apica, Splunk, Elastic Search, CloudWatch, Grafana, Prometheus, and Dynatrace 100% Basic Coding Experience Java/Python Good to have skills AWS certified Database knowledge Basic SQL queries, RDS Terraform certification SRE certifications Dynatrace Apica Grafana DevOps tools, including Jules, Jenkins, Bitbucket, GitHub, and CI/CD pipelines SRE BAR RAISER

Posted 2 months ago

Apply

9 - 14 years

15 - 25 Lacs

Ghaziabad

Remote

Job Title: Full Stack Engineer (Rust | Terraform/OpenTofu | Serverless) Experience Level: 10 Years Timings: 2pm to 11pm Position Overview We are seeking a highly capable and results-driven Full Stack Engineer with 34 years of professional experience, primarily focused on backend development using Rust . The ideal candidate will bring strong hands-on expertise in Infrastructure as Code (IaC) tools such as Terraform or OpenTofu , and practical experience designing and deploying serverless architectures in a production environment. This role demands a well-rounded engineer who is comfortable working across the stack, committed to engineering excellence, and capable of delivering scalable and maintainable solutions in a fast-paced, agile setting. Key Responsibilities Architect, develop, and maintain robust backend services and APIs using Rust . Design and manage cloud infrastructure using Terraform/OpenTofu , adhering to best practices in security and scalability. Build and integrate client-side components using modern front-end frameworks (e.g., React, Vue). Implement serverless solutions using platforms such as AWS Lambda , API Gateway , and associated cloud services. Collaborate with cross-functional teams including Product, DevOps, and QA to ensure end-to-end delivery. Write clean, modular, and well-tested code, and participate in peer code reviews. Contribute to system design discussions, technical planning, and architectural decisions. Required Qualifications 34 years of experience as a full stack engineer, with a primary focus on Rust for backend development. Proficiency in Terraform or OpenTofu , with proven experience deploying infrastructure in cloud environments (preferably AWS). Practical experience working with serverless technologies and event-driven architecture. Familiarity with front-end technologies such as React , TypeScript , or equivalent. Solid understanding of microservices, RESTful APIs, CI/CD pipelines, and Git-based workflows. Strong problem-solving skills and a proactive attitude toward learning and technical ownership. Preferred Qualifications Experience with Rust-based web frameworks such as Actix , Rocket , or similar. Exposure to WebAssembly (WASM) or systems programming concepts. Knowledge of observability tools such as Prometheus , Grafana , or AWS CloudWatch . Contributions to open-source projects or active involvement in developer communities.

Posted 2 months ago

Apply

10 - 14 years

35 - 40 Lacs

Mumbai, Hyderabad, Bengaluru

Work from Office

Job Description Lead and Manage DevOps Teams: Oversee the design, development, and implementation of CI/CD pipelines, infrastructure automation, and cloud management across OCI and other cloud platforms. Cloud Infrastructure & Data Pipelines: Manage multi-cloud environments with a focus on OCI and other cloud providers such as AWS, Azure, and GCP. Design and manage data pipelines that support large-scale data processing and analytics across cloud services. Cloud-Based Integration & APIs: Implement and manage API-based integrations between cloud services, ensuring seamless communication and data flow. Leverage tools like OCI API Gateway for secure and scalable API management. CI/CD Pipelines: Design and implement robust CI/CD processes using Jenkins, Git, Bitbucket, and other tools. Integrate SonarQube for static code analysis and maintain high code quality. Infrastructure as Code (IaC): Use Terraform and OCI Resource Manager to automate infrastructure provisioning, management, and scaling. Test Automation: Implement and manage test automation frameworks like Selenium to streamline testing in deployment pipelines. Security & Code Quality: Implement security best practices and ensure OCI security features. Gen AI & MLOps Integration: Support the integration of Generative AI technologies, NLP, and MLOps tools to facilitate the deployment and automation of AI models. Monitoring, Logging & Observability: Implement comprehensive observability strategies using OCI Monitoring and other tools, ensuring visibility into the performance, health, and reliability of cloud systems. Utilize OCI Management Platform and third-party solutions for centralized logging, alerting, and metrics aggregation to monitor system behavior, detect anomalies, and respond to incidents in real time. Collaboration & Stakeholder Management: Collaborate with development, QA, operations, and data engineering teams to understand their needs and deliver effective solutions. Work closely with leadership to define and drive the DevOps strategy. Innovation: Explore and implement cutting-edge tools and practices, especially around cloud-native development, automation, data processing, and advanced technology integrations like Generative AI. Career Level - M3 Responsibilities Experience with containerization technologies (Docker) and orchestration tools (Kubernetes).Proven Experience in DevOps Management with multi-cloud environments, especially OCI.Strong hands-on experience with:CI/CD Tools: Jenkins, Git, Bitbucket, etc.Code Quality Tools: SonarQube or similar for static code analysis.Infrastructure as Code (IaC): Terraform, OCI Resource Manager, Ansible (or similar).Automation & Scripting: Python, Shell scripting, etc.Testing Frameworks: Experience with monitoring and logging tools such as Prometheus, Grafana, ELK stack or similar.Data Pipeline Management: Hands-on experience in designing, implementing, and managing data pipelines across cloud platforms.Cloud-Based Integration & APIs: Experience integrating cloud services using APIs and tools like OCI API Gateway for managing secure and scalable API communications.OCI Tooling Expertise: Experience with OCI Resource Manager, Oracle Kubernetes Engine (OKE), OCI Functions, and OCI Monitoring.Monitoring, Logging & Observability: Strong understanding of observability tools and platforms, including experience with OCI Monitoring, OCI Management Platform, centralized logging, and metrics collection tools like Prometheus, Grafana, or similar.Cloud Expertise: In-depth knowledge of OCI, with familiarity in AWS, Azure, or GCP.Strong experience with configuration management and automation tools such as Ansible, Chef puppet.Experience with Generative AI, NLP, and MLOps tools for model development and deployment.Strong understanding of security, compliance, and monitoring in cloud environments.Excellent leadership, communication, and project management skills.

Posted 2 months ago

Apply

5 - 10 years

7 - 12 Lacs

Bengaluru

Work from Office

Oracle Siebel CRM team is looking for a top product management professional to join our Siebel Platform team. As the Product Owner, you will lead the design, development, and continuous improvement of our platform infrastructure, focusing on scalability, reliability, and performance. Your comprehensive knowledge of the system architecture and DevOps practices will be crucial in building robust and efficient platform solutions. Key Responsibilities: Product Vision and Strategy: Develop and clearly communicate the product vision, strategy, and roadmap for platform solutions, focusing on system scalability, resilience, and high availability. Translate high-level business requirements into detailed technical requirements, specifically related to cloud infrastructure, microservices architecture, and container orchestration. Identify opportunities for innovation in platform development and DevOps automation. System Design and Architecture Leadership: Lead the architectural design and implementation of highly available and scalable platform solutions. Collaborate with the DevOps and development teams to define the system architecture, focusing on: Microservices Architecture: Designing modular, loosely coupled services with API-driven communication. Event-Driven Architecture: Implementing real-time data streaming and message brokering (e.g., Kafka, RabbitMQ). Cloud-Native Design: Leveraging managed services and container orchestration (e.g., Kubernetes, Docker) for scalability. Database Architecture: Ensuring data consistency, redundancy, and performance in both SQL and NoSQL environments. API Gateway and Service Mesh: Managing internal and external API interactions, routing, and security. Monitoring and Observability: Integrating metrics, tracing, and log aggregation for end-to-end system visibility. Backlog Management: Prioritize and manage the product backlog with a strong emphasis on infrastructure improvements, system design, and automation. Write detailed user stories that encompass platform-specific features such as automated scaling, disaster recovery mechanisms, and real-time performance monitoring. Define acceptance criteria that validate the robustness of system architecture, including load balancing, fault tolerance, and multi-region support. Facilitate backlog refinement with the engineering team, focusing on the technical feasibility of complex platform enhancements. Team Collaboration and Communication: Act as the primary liaison between engineering teams and stakeholders, facilitating technical discussions related to system design choices and platform capabilities. Provide clear technical documentation and guidance on platform standards, system components, and integration points. Facilitate technical decision-making through data-driven analysis and proof-of-concept evaluations. Stakeholder Engagement: Engage with internal users to gather feedback on platform performance and stability. Prepare reports and presentations that highlight platform metrics, system uptime, and areas of improvement. --- Qualifications: Education: Bachelors degree in Computer Science, Information Technology, or a related field. Advanced degree or relevant certifications (e.g., Certified Scrum Product Owner, AWS Certified Solutions Architect) are preferred. Experience: Minimum of 5 years of experience as a Product Owner, with a focus on platform or DevOps environments. 10+ year of overall experience. Proven track record of delivering platform solutions that prioritize scalability, resilience, and high availability. Hands-on experience designing microservices architectures and implementing DevOps pipelines. Proficiency in cloud platforms (e.g., AWS, Azure, GCP) and their ecosystem services. Strong background in software architecture, including containerization, orchestration, and service mesh implementation. Past experience of working on Siebel platform would be a plus. Technical Skills: In-depth knowledge of system design patterns, including fault tolerance, scalability, and load distribution. Expertise in container technologies (Docker, Kubernetes) and orchestration tools. Strong command of CI/CD tools and practices, including automated testing and rollback strategies. Experience with infrastructure automation using Terraform, Ansible, or similar. Proficiency with monitoring and observability stacks (e.g., Prometheus, Grafana, ELK stack). Familiarity with API management and secure API gateway implementation. Knowledge of version control (Git), branching strategies, and automated build pipelines. Basic scripting skills (Python, Bash) for automating repetitive tasks. Soft Skills: Excellent communication and leadership abilities, with the capability to bridge technical and non-technical stakeholders. Strong analytical mindset to solve complex system issues and optimize performance. Proactive attitude towards problem identification and resolution. Ability to manage competing priorities in a dynamic, fast-paced environment.

Posted 2 months ago

Apply

10 - 15 years

12 - 17 Lacs

Bengaluru

Work from Office

As a member of the Support organization, your focus is to deliver post-sales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales technical questions regarding the use of and troubleshooting for our Electronic Support Services. A primary point of contact for customers, you are responsible for facilitating customer relationships with Support and providing advice and assistance to internal Oracle employees on diverse customer situations and escalated issues.As a Principal Support Engineer, you will offer strategic technical support to assure the highest level of customer satisfaction. A primary focus is to create/utilize automated technology and instrumentation to diagnose, document, and resolve/avoid customer issues. You are expected to be an expert member of the technical problem solving/problem avoidance team, routinely sought after to address extremely complex, critical customer issues. Services may be frequently provided by on-site customer visits.Leading contributor individually and as a team member, providing direction and mentoring to others. Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Career Level - IC4 Career Level - IC4 Responsibilities This position is for a Principal Software Engineer to the established Oracle Database Exadata Cloud support team. The person in this role will be located in India. The teams main responsibility is to fix resolve highly complex technical issues on Oracle Exadata spanning across database areas including Real Application Clusters, High Availability, Data Guard, Corruption, Backup and Recovery, RMAN, ZDLRA, Performance, Memory Management, Parallel query, Query tuning, Storage, ASM, Golden Gate, Replication, Security, Networking, Enterprise Manager etc. The Engineer should have good hands on experience on UNIX, Linux and/or Solaris platforms . You are expected to work in partnership with customers, other support engineers and developers to deliver a superior ownership experience to the customer. You will have opportunities to become an authority / Specialist in Database, Exadata Cloud technologies. You will be widely regarded as a domain authority in their current role and will demonstrate the ability to resolve complex problems or identify acceptable workarounds. They should be able to perform their assigned duties with a great degree of independence requiring minimal direction. The Engineer is expected to be a key member of the technical problem solving as well as problem avoidance team, routinely sought after to address very complex, critical customer issues. Responsibilities The main role of a Support engineer is to fix highly complex technical problems (Oracle Database Exadata) requiring high level of technical expertise Works directly with customers Participates in weekend rotation and shifts Participates in initiatives that improve overall product and documentation quality Participates in product/platform testing Drives improvements in product quality Serves as Situation Manager on highly sensitive Customer issues Consults with Management in advising resolution of critical Customer situations Consults with Customers on sophisticated use of Oracle products Achieves knowledge transfer through development and delivery of training, knowledge sessions, mentoring etc. Creates /reviews Knowledge Articles Contributes significantly towards the My Oracle Support Database communities Analyzes work load, uses standard methodologies and implements changes to improve productivity Proactively contributes to increasing the team efficiency by sharing knowledge, providing feedback about standard methodologies, writing tools / utilities Who are we looking for Qualifications Greater than 10 years of industry experience Technical degree i.e. BE / B.Tech / M.Tech / MCA / M.Sc. in Computer Science / Management Information Systems / Engineering / Math / Physics / Chemistry or proven professional and technical experience. Oracle OCP DBA certification OCI Certification- Preferred. Oracle OCM DBA Certification is a plus. PERSONAL ATTRIBUTES Self-driven and result oriented Strong Problem solving/analytical skills Strong customer support and client relation skills Ability to work effectively in high volume high pressure situations Ability Flexibility to work late shifts Effective communication (verbal written) Ability to Network (internal external) Strong willingness to learn new technologies / skills Ability to Influence/negotiate Team player Customer focused Confident and decisive Enthusiastic Ability to Coach / share knowledge Ability to write technical Bulletins Technical skills We are looking for a core technical person, who has hands-on Database administration experience on UNIX/Linux and/or worked as L3 level support engineer and/or having equivalent knowledge. They should possess the following technical skills: Database architecture knowledge and administration Good knowledge on Exadata, Exadata Cloud and OCI architectures. ZDLRA, Backup and Recovery, RMAN, Dataguard, knowledge of various restore and recovery scenarios. Experience with cloud technologies from different vendors Extensive hands on interaction with large Database management systems General UNIX/Linux concepts Administration Personal attributes Dedicated and result oriented Strong Problem solving/analytical skills Strong customer support and client relation skills Ability to work effectively in high volume high stress situations Ability Flexibility to work late shifts Effective communication (verbal written) Ability to Network (internal external) Strong willingness to learn new technologies / skills Ability to Influence/negotiate Great teammate Customer focused Confident and decisive Enthusiastic Ability to Coach / share knowledge Ability to write technical Bulletins If this sounds like you, we hope to meet you! Life at Oracle and Equal Opportunity An Oracle career can span industries, roles, Countries and cultures, giving you the opportunity to flourish in new roles and innovate, while blending work life in. Oracle has thrived through 40+ years of change by innovating and operating with integrity while delivering for the top companies in almost every industry. In order to nurture the talent that makes this happen, we are committed to an inclusive culture that celebrates and values diverse insights and perspectives, a workforce that inspires thought leadership and innovation. Oracle offers a highly competitive suite of Employee Benefits designed on the principles of parity, consistency, and affordability. The overall package includes certain core elements such as Medical, Life Insurance, access to Retirement Planning, and much more. We also encourage our employees to engage in the culture of giving back to the communities where we live and do business. At Oracle, we believe that innovation starts with diversity and inclusion and to create the future we need talent from various backgrounds, perspectives, and abilities. We ensure that individuals with disabilities are provided reasonable accommodation to successfully participate in the job application, interview process, and in potential roles to perform crucial job functions. Thats why were committed to creating a workforce where all individuals can do their best work. Its when everyones voice is heard and valued that were inspired to go beyond whats been done before. Disclaimer: Oracle is an Equal Employment Opportunity Employer*. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veteransstatus, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law. Which includes being a United States Affirmative Action Employer https://www.oracle.com/corporate/careers/diversity-inclusion/

Posted 2 months ago

Apply

10 - 14 years

8 - 15 Lacs

Mumbai, Hyderabad, Bengaluru

Work from Office

Job Description Position Description We are seeking aspirational candidates who are interested in a career in Consulting to join our niche Banking Domain and Practice. The position will support Territory Heads, Delivery Managers, Portfolio and Project Managers and teams of talented, professional business and technology consultants in the delivery of business focused solutions for our clients using Oracle applications, tools and technology. Utilizing sound product skills and experience, the successful applicant will work on value consulting, solutioning and transforming and addressing complex business requirements into sound and optimal solutions to achieve successful outcomes for our customers, partners and associates and drive towards client and customer reference ability. Longer term you will grow, with the help of extensive training and experience of the team around you, into a seasoned employee and become a Subject Matter experts in Business domain and or Solution Architecture with full accountability and responsibility of the delivered solution for your own projects, programs and territory and larger region and organization. Job Responsibilities Partnering with and acting as a trusted advisor to stakeholders in both Consulting Sales and Delivery to assist in defining and delivering high quality enterprise capable solutions Working closely with stakeholders to develop practical roadmaps to move the enterprise towards the future state vision, while taking into account business, technical and delivery constraints Analyzing stakeholder requirements, current state architecture, and gaps to create a future state architecture vision for one or more parts of the enterprise with a focus on reduced complexity, cost efficiencies, reuse, convergence, reduced risk and/or improved business capabilities Participating in defining and operating the architecture governance process to ensure change initiatives align to the vision and roadmaps Working closely with Domain Architects across key initiatives and projects to apply architecture principles and standards, and develop reference architectures and design patterns Communicating the principles, standards, vision and roadmaps to stakeholders and proactively addressing any questions / concerns identified Providing thought leadership on architectural or other topics, developing a forward looking view of current and emerging technologies and their impact on the Enterprise Architecture Embedding Platform Thinking in everything Owning and enhancing workflows and processes, and delegates with clear accountabilities across the teams to meet objectives / outcomes Promoting an environment of learning and development. Understand and develop team members and others to achieve their professional growth Career Level - IC5 Career Level - IC5 Responsibilities Job Requirements Bachelor's Degree in Engineering, Computer Science or equivalent; Master's degree in Business or Technology is an advantage Formal architecture certification (TOGAF or equivalent) At least 15 years' experience in the IT industry, preferably in large, complex enterprises At least 7 years' experience in Enterprise Architecture in a large, complex, multi-location, multi-national environment Deep experience delivering mission critical, enterprise scale IT solutions in a heterogeneous technology environment Demonstrate deep domain expertise in Application Architecture in EAI, Microservices and Cloud native technologies Experience in Domain driven and Event driven architecture and in technologies such as Kafka and Spark Experience architecting, designing and developing large scale high performance retail business banking solutions utilizing a mixture of Open systems and messaging and high performance DB solutions. Experience in log analysis and log based monitoring (e.g. ELK) and metrics driven monitoring (Grafana, Prometheus) Direct experience with highly scalable enterprise applications, designing high performance, low latency solutions with high availability and near or no data loss. Familiarity with: best practice methodologies and tools for the entire solution lifecycle from ideation to requirements, design, development, testing, deployment and operations one or more formal Architecture frameworks / methodologies (TOGAF, Zachman, BIAN, etc.) architecture governance frameworks heterogeneous technology platforms such as AS400, Unix/Linux, Windows A deep understanding of all domains of Enterprise Architecture, including the business, data, application, infrastructure and security domains Possess strong understanding of business strategies and able to translate them into concrete achievable action plans Practical experience in: data modelling, object modelling, design patterns and Enterprise Architecture tool or other software modelling tools. business capabilities model Advanced Relational Database Experience (RDBMS) in Oracle Multi-tenant database is an advantage Functional Expertise preferred (but not a mandatory requirement) in at least two of the below domains: Branch Banking CRM and e-CIF Transaction Banking - Cash Management and Payments Lending Origination and Servicing Trade Finance & Supply Chain Finance e-Channels, eco-system partnerships and API (Open Banking) Proven experience leading teams resulting in the successful deployment of applications built on Cloud or on-prem enterprise environments for large Tier-1 Banks and Financial institutions Candidates having experience with migrating from legacy applications to a solution utilizing methodologies and Platforms that will ensure least down-time, reduced risk and excellent customer experience both the customer and end-users of those services will be preferred. IT Strategy consulting experience will be an added advantage Comfortable in working in an environment which is mix of several parties and teams from customer as well from Oracle where collaboration is key. Excellent verbal, written and presentation skills to stakeholders at all levels Ability to communicate complex topics in an understandable way using a level of detail and terms appropriate to the situation Capability to think conceptually and identify patterns across seemingly unrelated situations Must be a good team player and able to drive consensus amongst stakeholders with conflicting viewpoints and objectives People and team management in a transversal function Able to collaborate and drive motivation across a diverse slate within and across teams and can deal with difficult conversations effectively Diversity and Inclusion: An Oracle career can span industries, roles, Countries and cultures, giving you the opportunity to flourish in new roles and innovate, while blending work life in. Oracle has thrived through 40+ years of change by innovating and operating with integrity while delivering for the top companies in almost every industry. In order to nurture the talent that makes this happen, we are committed to an inclusive culture that celebrates and values diverse insights and perspectives, a workforce that inspires thought leadership and innovation. Oracle offers a highly competitive suite of Employee Benefits designed on the principles of parity, consistency, and affordability. The overall package includes certain core elements such as Medical, Life Insurance, access to Retirement Planning, and much more. We also encourage our employees to engage in the culture of giving back to the communities where we live and do business. At Oracle, we believe that innovation starts with diversity and inclusion and to create the future we need talent from various backgrounds, perspectives, and abilities. We ensure that individuals with disabilities are provided reasonable accommodation to successfully participate in the job application, interview process, and in potential roles. to perform crucial job functions. Thats why were committed to creating a workforce where all individuals can do their best work. Its when everyones voice is heard and valued that were inspired to go beyond whats been done before.

Posted 2 months ago

Apply

3 - 7 years

4 - 9 Lacs

Mumbai, Hyderabad, Bengaluru

Work from Office

Job Description Responsible for the operation of production environments, including systems and databases, supporting critical business operations. Maintain Availability, Scalability, and Efficiency of Oracle Cloud Services. Solve complex infrastructure problems. Handle customer incident tickets and/or deploy software in test or production systems, and or perform testing on test systems or production systems. You will be required to do RCA when possible; if the issue is complex, beyond your knowledge or skills, escalate to developers in team. Its a critical role to help with availability, scalability, and efficiency of Oracle products and services. Help manage Oracle standards, and methods for large-scale distributed systems. If needed, help facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning About the Group At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. Compute is one of the core organisations within OCI. We are responsible for providing Compute power i.e. VMs and BMs. Cloud pretty much cannot exists without our org. The Compute org comprises of a family of critical foundational infrastructure services that drive OCIs hardware lifecycle activities Work with product team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioural characteristics of production services. Responsible for the mitigating critical customer incidents, or deployments or testing required to improve security, performance, availability, and scalability of service. Authority for end-to-end performance and operability. Partner with development teams in meeting SLA to unblock customers. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilise a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies. Responsible for the operation of production environments, including systems and databases, supporting critical business operations. Will perform administration and analysis for multiple production environments and recommend new and novel solutions to improve availability, performance, and supportability. This is an opportunity to bring a combination of deep technical knowledge with administration/analysis knowledge of Oracle's Cloud Infrastructure to provide escalation support to a wide range of complex production environment problems related to immense growth, scaling, leveraging the cloud, extremely high performance, and high availability requirements. Career Level - IC3 Responsibilities Install, monitor, maintain, support, and optimize all production server hardware and software. Provide escalated technical support for complex technical issues which may include leading problem management cases and providing management status. Coordinate escalated support cases and lead appropriate internal technical resources and/or third party vendors to resolution and coordinate a storage infrastructure of Oracle system and database appliances. Responsible for Oracle production environments; assist with server operating system and application upgrades, bug fixes, and patching; and work on standardization projects for both hardware and software under the Oracle technology stack while providing consistent system uptime as expected in a Cloud environment. Provide on-call support, on a rotating basis. Responsibilities include but not limited to Incident Management Support and troubleshooting of Staging/Production environments Response and Resolve incidents as per SLA's Organise, Anticipate, Plan and work as On-Call in shifts for multiple services (Open to work in shifts & shows flexibility) Maintain Service High Availability Release Management Test and Deploy solutions and automate to replace manual processes Build and maintain deployment tools/procedures Zero downtime deployments and a high availability mindset Define and build innovative solution methodologies and assets around infrastructure, cloud migration and deployment operations at scale. Work with service teams to resolve complex issues that require troubleshooting and knowledge of code. Keep documentation up to date and resolving similar tickets with lower turnaround time and within SLA Ensure production security posture Ensure monitoring is robust and effective Change Management Perform Root Cause Analysis Required Skills: 6+ years overall experience in IT industry Minimum 4 years of experience as a Sys Admin/Support Strong systems architecture skills Strong Linux administration (Understanding of different Hardware family) Virtualisation Technologies Scripting Language (Python/Bash/Shell etc, basic understanding of Java / Go will be good to have) Understanding of Networking, Cloud Computing, Load Balancers Hands on experience at Monitoring/Instrumentation tools (Prometheus/Grafana, new relic, elastic or equivalent). Experience with maintaining high scale deployments, managing high throughput and IO intensive services. Strong knowledge of system configuration tools such as Chef, Terraform, GIT, Jenkins/Hudson, Artifactory Continuous Integration development/deployment, e.g. Docker, Kubernetes

Posted 2 months ago

Apply

5 - 8 years

15 - 25 Lacs

Hyderabad, Pune, Bengaluru

Hybrid

Warm Greetings from SP Staffing!! Role :Azure Devops Engineer Experience Required :5 to 8 yrs Work Location :Bangalore/Pune/Hyderabad/Chennai Required Skills, Azure Devops Terraform Python/Bash/Powershell Grafana ./Prometheus Interested candidates can send resumes to nandhini.spstaffing@gmail.com

Posted 2 months ago

Apply

3 - 7 years

6 - 11 Lacs

Mumbai, Hyderabad, Bengaluru

Work from Office

Job Description Responsibilities- Design and develop core components of the product- Follow coding standards, build appropriate unit tests, integration tests and deployment scripts. Requirements- Experience in delivering of complex Java based solutions- Fintech product development experience is preferred- Strong understanding of microservices architectures and RESTful APIs.- Proven experience developing cloud-native applications.- Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).- Experience with at least one major cloud platform (AWS, Azure, Google Cloud).- Having knowledge or Oracle Cloud is preferred.- Experience with DevOps tools like Jenkins, GitLab CI/CD.- Knowledge of monitoring tools (e.g., Prometheus, Grafana) - Knowledge of event-driven architecture and message brokers (e.g., Kafka)- Monitor and troubleshoot Cloud native application performance and reliability in production environments.- Excellent verbal and written communication skills- Ability to collaborate and work effectively in a team. Career Level - IC2 Responsibilities Responsibilities- Design and develop core components of the product- Follow coding standards, build appropriate unit tests, integration tests and deployment scripts. Requirements- Experience in delivering of complex Java based solutions- Fintech product development experience is preferred- Strong understanding of microservices architectures and RESTful APIs.- Proven experience developing cloud-native applications.- Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).- Experience with at least one major cloud platform (AWS, Azure, Google Cloud).- Having knowledge or Oracle Cloud is preferred.- Experience with DevOps tools like Jenkins, GitLab CI/CD.- Knowledge of monitoring tools (e.g., Prometheus, Grafana) - Knowledge of event-driven architecture and message brokers (e.g., Kafka)- Monitor and troubleshoot Cloud native application performance and reliability in production environments.- Excellent verbal and written communication skills- Ability to collaborate and work effectively in a team.

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies