29 Observability Stacks Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 10.0 years

0 Lacs

gurgaon, haryana, india

On-site

dunnhumby is the global leader in Customer Data Science, empowering businesses everywhere to compete and thrive in the modern data-driven economy. We always put the Customer First. Our mission: to enable businesses to grow and reimagine themselves by becoming advocates and champions for their Customers. With deep heritage and expertise in retail one of the world's most competitive markets, with a deluge of multi-dimensional data dunnhumby today enables businesses all over the world, across industries, to be Customer First. dunnhumby employs nearly 2,500 experts in offices throughout Europe, Asia, Africa, and the Americas working for transformative, iconic brands such as Tesco, Coca-Cola, Mei...

Posted 2 days ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Job Title: Platform Resilience Architect Location: Bangalore, Hybrid Why should you choose us Rakuten Symphony is a Rakuten Group company, that provides global B2B services for the mobile telco industry and enables next-generation, cloud-based, international mobile services. Building on the technology Rakuten used to launch Japan's newest mobile network, we are taking our mobile offering global. To support our ambitions to provide an innovative cloud-native telco platform for our customers, Rakuten Symphony is looking to recruit and develop top talent from around the globe. We are looking for individuals to join our team across all functional areas of our business from sales to engineering, ...

Posted 3 days ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Java Tech Lead (56 Years Experience) About the Role We are seeking a highly skilled Java Tech Lead with 56 years of hands-on experience in backend engineering, architecture design, and leading development teams. The ideal candidate will combine strong technical expertise in Java frameworks with a deep understanding of system design, scalability, and performance optimization. This role involves technical leadership , code reviews , and architectural decision-making for complex enterprise systems with occasional exposure to analytics-driven and Python-based components. Key Responsibilities Architect, design, and develop scalable backend systems using Java (Quarkus, Spring Boot, Spring, Java EE...

Posted 1 week ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

Remote

At ABB, we help industries outrun - leaner and cleaner. Here, progress is an expectation - for you, your team, and the world. As a global market leader, we'll give you what you need to make it happen. It won't always be easy, growing takes grit. But at ABB, you'll never run alone. Run what runs the world. This Position reports to: IS Service Owner -Build & Accelerate Your Role And Responsibilities In this role, you'll help run what runs the world, by taking on meaningful work that drives real impact. Work model: # Li-hybrid/remote Provide day-to-day technical support to DevOps teams, diagnosing and resolving platform issues. Develop and maintain automation scripts and tool integrations to st...

Posted 1 week ago

AI Match Score
Apply

2.0 - 4.0 years

0 Lacs

hyderabad, telangana, india

On-site

The Software Engineer SRE will be responsible for building and maintaining highly reliable, scalable, and secure infrastructure that powers the Albert platform. This role focuses on automation, observability, and operational excellence to ensure seamless deployment, performance, and reliability of core platform services. Key Responsibilities Act as a passionate representative of the Albert product and brand. Collaborate with Product Engineering and other stakeholders to plan and deliver core platform capabilities that enable scalability, reliability, and developer productivity. Work with the Site Reliability Engineering (SRE) team on shared full-stack ownership of a collection of services an...

Posted 2 weeks ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

karnataka

On-site

As a Cloud System Debug Engineer, your primary role will involve diagnosing, analyzing, and resolving complex issues across large-scale public, private, and hybrid cloud environments. You will be responsible for ensuring high availability, performance, and reliability of mission-critical systems across multi-cloud platforms. Your key responsibilities will include: - Debugging complex issues in cloud infrastructure components such as networking, storage, virtualization, and orchestration layers. - Investigating failures in Kubernetes clusters, including nodes, pods, networking (CNI), and storage (CSI). - Troubleshooting problems with container runtimes like Docker, containerd, and CRI-O. - De...

Posted 2 weeks ago

AI Match Score
Apply

2.0 - 8.0 years

0 Lacs

haryana

On-site

As a Service Experience Manager at Dunnhumby, you will be responsible for owning the end-to-end service management strategy for Media systems. With 8+ years of experience in service management, including systems monitoring, site reliability engineering, or infrastructure operations, and 2+ years in a team lead or managerial role, you will coach team members to deepen their technical monitoring skills and business understanding. Your role will involve building a culture of continuous improvement, automation, and proactive detection, while ensuring monitoring coverage aligns with business-critical services and revenue-driving workflows. Key Responsibilities: - Define, maintain, and evolve serv...

Posted 2 weeks ago

AI Match Score
Apply

1.0 - 10.0 years

0 Lacs

ahmedabad, all india

On-site

Role Overview: You will be responsible for building, testing, and releasing developer assets that accelerate experimentation using technologies like Spring Boot/Helidon services, service registry, API gateways, observability, and Kubernetes. Additionally, you will create sample apps and hands-on labs covering various document processing techniques. Your role will involve contributing to and influencing frameworks such as Spring Boot, Spring AI, LangChain, and LangGraph. You will collaborate with Product Management and sales teams to engage developers, architects, CTOs, and CIOs, turning feedback into roadmap inputs and facilitating adoption. Furthermore, you will deliver compelling demos, ta...

Posted 3 weeks ago

AI Match Score
Apply

15.0 - 19.0 years

0 Lacs

hyderabad, telangana

On-site

Job Description: About Mobius Mobius is an AI-native platform that goes far beyond today's AI products. It fuses neural networks, symbolic reasoning, graph intelligence, and autonomous agent coordination into a single, living digital ecosystem. Think of Mobius as the next evolution of cloud-native AI platforms designed to build, manage, and evolve intelligent software automatically. It's where data meets reasoning, automation meets intelligence, and software learns how to build itself. Mobius is not looking for builders. We're looking for architects of the future. If you've ever imagined software that thinks, adapts, and evolves, you've imagined Mobius. Role Overview As the lead architect, y...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a member of the Alerts and Notification Platform team at SolarWinds, you will have the opportunity to contribute to the core of the SolarWinds Observability ecosystem. This platform plays a crucial role in enabling multiple feature teams to deliver alerts seamlessly to end users. Your role will involve collaborating with cross-functional teams to design scalable alert evaluation pipelines, develop flexible notification workflows, and ensure reliable, low-latency communication across various channels. Key Responsibilities: - Design scalable alert evaluation pipelines to support multiple feature teams - Develop flexible notification workflows to enhance user experience - Ensure reliable and...

Posted 1 month ago

AI Match Score
Apply

12.0 - 16.0 years

0 Lacs

maharashtra

On-site

Role Overview: You will be a Director in Infrastructure & Operations within the Product Operating Model (POM) leading Mumbai platform teams to deliver GCP compute, core networking, and foundational services. Your role will focus on reducing handoffs, increasing speed, and ensuring quality at scale. Key Responsibilities: - Lead Mumbai execution for the GCP compute platform including managing multiregion GKE clusters, GCE fleets, autoscaling, capacity planning, image/patch pipelines, upgrades, SLOs, and production runbooks. - Drive Cloud Development enablement by defining reusable Terraform modules, standardizing GitHub Actions workflows, operating GitOps for infra and cluster resources, and e...

Posted 1 month ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Company Description RagaAI is the leading AI testing and observability platform that helps enterprises mitigate AI risks and guarantee model reliability. Traditional AI validation is ad-hoc, time-consuming, and costly. Our end-to-end platform streamlines evaluation, guard-railing, and monitoring so teams can ship GenAI, multimodal, and multi-agent systems with confidence. Learn more at www.raga.ai and join us on the journey to trustworthy AI. Role Description As a Senior Technical Program Manager (TPM) you will orchestrate cross-functional programs that span data science, platform engineering, and go-to-market. You'll own planning, execution, and delivery for initiatives such as LLM evaluati...

Posted 1 month ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

gurugram, haryana, india

On-site

Dunnhumby Hq. in London with offices across countries employs nearly 2,500 experts in offices throughout Europe, Asia, Africa, and the Americas working for transformative, iconic brands such as Tesco, Coca-Cola, Meijer, Procter & Gamble and Metro. dunnhumby is the global leader in Customer Data Science, empowering businesses everywhere to compete and thrive in the modern data-driven economy. We always put the Customer First. Our mission: to enable businesses to grow and reimagine themselves by becoming advocates and champions for their Customers. With deep heritage and expertise in retail one of the world's most competitive markets, with a deluge of multi-dimensional data dunnhumby today ena...

Posted 1 month ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

gurgaon, haryana, india

On-site

dunnhumby is the global leader in Customer Data Science, empowering businesses everywhere to compete and thrive in the modern data-driven economy. We always put the Customer First. Our mission: to enable businesses to grow and reimagine themselves by becoming advocates and champions for their Customers. With deep heritage and expertise in retail one of the world's most competitive markets, with a deluge of multi-dimensional data dunnhumby today enables businesses all over the world, across industries, to be Customer First. dunnhumby employs nearly 2,500 experts in offices throughout Europe, Asia, Africa, and the Americas working for transformative, iconic brands such as Tesco, Coca-Cola, Mei...

Posted 1 month ago

AI Match Score
Apply

2.0 - 7.0 years

0 - 0 Lacs

hyderabad

Work from Office

Primary Skills: Python/Go, IaC, observability stacks, reliability engineering Location: Hyderabad Only Experience : 3 to 14 years

Posted 1 month ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

haryana

On-site

As a highly skilled Lead DevOps Engineer, you will be responsible for designing cloud-native infrastructure, automating deployments, ensuring high availability, and driving operational excellence in a fast-paced environment. You will work on a reliable messaging platform that powers seamless communication for millions of users. **Key Responsibilities:** - **Infrastructure & Deployment** - Design, implement, and manage scalable, resilient cloud infrastructure (AWS/GCP/Azure) for messaging workloads. - Build CI/CD pipelines to enable automated, reliable, and fast delivery of new features. - Containerize applications (Docker/Kubernetes) and optimize orchestration for performance. - **Reliabilit...

Posted 1 month ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Company Description RagaAI is the leading AI testing and observability platform that helps enterprises mitigate AI risks and guarantee model reliability. Traditional AI validation is ad-hoc, time-consuming, and costly. Our end-to-end platform streamlines evaluation, guard-railing, and monitoring so teams can ship GenAI, multimodal, and multi-agent systems with confidence. Learn more at www.raga.ai and join us on the journey to trustworthy AI. Role Description As a Senior Technical Program Manager (TPM) you will orchestrate cross-functional programs that span data science, platform engineering, and go-to-market. You'll own planning, execution, and delivery for initiatives such as LLM evaluati...

Posted 1 month ago

AI Match Score
Apply

10.0 - 12.0 years

0 Lacs

india

On-site

About The Company Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences. Want to be part of the AI revolution and transform how the world's largest global brands do business Join us! We're seeking a Hea...

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

chennai, tamil nadu

On-site

As an experienced Node.js back-end developer, you will be responsible for designing, developing, and deploying Node.js microservices on AWS (Lambda, ECS, EKS) following 12-factor principles. Your key responsibilities will include: - Building RESTful and event-driven APIs that integrate DynamoDB, RDS, and third-party services with robust authentication and logging. - Optimizing code, queries, and infrastructure for performance, reliability, and cost efficiency using CloudWatch metrics. - Implementing automated testing, CI/CD pipelines, and Infrastructure-as-Code (CloudFormation/CDK) to ensure rapid, safe releases. - Collaborating with front-end, DevOps, and product teams to refine requirement...

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

As an engineer joining the Customer Engineering team at Zinier, focused on a low-code platform, you will be responsible for debugging, developing, and collaborating across teams to enhance the product performance and customer experience. You will be tasked with investigating and resolving customer-reported issues in a JavaScript + JSON low-code environment. This includes identifying, fixing bugs, and implementing enhancements to improve product performance, reliability, and usability. Your role will also involve supporting customers worldwide, creating and maintaining documentation, and identifying opportunities for process improvements. **Key Responsibilities:** - Investigate and resolve cu...

Posted 1 month ago

AI Match Score
Apply

1.0 - 10.0 years

0 Lacs

telangana

On-site

As a Platform Engineer, you will play a crucial role in designing, building, and maintaining robust, scalable, and secure infrastructure platforms that support software development and deployment across the organization. Key Responsibilities: - Design and maintain CI/CD pipelines to streamline development workflows. - Build and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, or CloudFormation. - Manage cloud infrastructure (AWS, Azure, or GCP) ensuring high availability, scalability, and security. - Monitor system performance and availability using tools like Prometheus, Grafana, or Datadog. - Collaborate with development, DevOps, and SRE teams to support applicati...

Posted 2 months ago

AI Match Score
Apply

12.0 - 16.0 years

0 Lacs

pune, maharashtra

On-site

Role Overview: As a System Architect with 12+ years of experience, you will be responsible for designing end-to-end architecture covering React-TypeScript front-ends, FastAPI micro-services, Python business logic, Azure data layers, and OpenAI API integration. You will define interface contracts, event flows, observability standards, and non-functional requirements such as quality, security, data residency, and performance. Your role will also involve establishing prompt/response validation, abuse mitigation, and PII safeguards for GPT-4o-class models. Additionally, you will benchmark chat completion and Responses API, embed models, and tune retrieval pipelines for factual accuracy. Your res...

Posted 2 months ago

AI Match Score
Apply

12.0 - 16.0 years

0 Lacs

hyderabad, telangana

On-site

As the Senior Technical Architect for Generative AI and Agent Factory at PepsiCo, your key responsibilities will include: - Architecting and governing the design of scalable, modular AI agent frameworks (Agent Mesh, Orchestrator, Memory, Canvas) for enterprise-wide reuse. - Defining event-driven orchestration and agentic execution patterns (e.g., Temporal, LangGraph, AST-RAG, reflection) to enable intelligent, context-aware workflows. - Driving platform integration across PepGenX, Agent Factory, and PepVigil to ensure consistency in observability, security, and orchestration patterns. - Developing reusable agent templates, blueprints, and context frameworks (e.g., MCP, semantic caching) to a...

Posted 2 months ago

AI Match Score
Apply

12.0 - 14.0 years

0 Lacs

hyderabad, telangana, india

On-site

We are hiring for world-class payments network with human-centric customer service, trusted by 15,000+ partners across 235 territories through our flagship brands. With a diverse team of 800+ professionals across 10 global offices , we are reshaping the international property market by connecting buyers, sellers, legal firms, banks, and real estate agents through innovative payments and embedded software solutions . Our mission is clear: to make buying property abroad faster, simpler, and safer. About the Role We are looking for a dynamic and experienced Senior Engineering Manager with a strong architectural mindset to lead and evolve our Platform & DevOps function . This hybrid technical-le...

Posted 3 months ago

AI Match Score
Apply

15.0 - 19.0 years

0 Lacs

hyderabad, telangana

On-site

About Mobius: Mobius is an AI-native platform that surpasses current AI products by blending neural networks, symbolic reasoning, graph intelligence, and autonomous agent coordination into a cohesive digital ecosystem. It represents the next phase of cloud-native AI platforms, specifically engineered to construct, oversee, and enhance intelligent software automatically. Mobius is the convergence point of data and reasoning, automation and intelligence, where software learns to self-construct. The Role: As a key figure, you will steer the architectural strategy of Mobius's core orchestration and infrastructure layer. This layer is crucial for driving all automation, workflow execution, and ba...

Posted 3 months ago

AI Match Score
Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies