6608 Grafana Jobs - Page 34

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

10.0 - 15.0 years

10 - 20 Lacs

pune

Work from Office

What you will be doing Understanding project KPIs, SLI's, SLO's, MTTD, MTTR, Error budgets, Chaos engineering and eliminating TOILs by automation Exploring observability tools and creating/implementing dashboards Run the production environment by monitoring availability and taking a holistic view of system health Incident Management: Knowledge in handling incidents, participating in blameless postmortem, performing root cause analysis, and implementing post-incident reviews. Improve reliability, quality, and time-to-market of our suite of software solutions Develop scripts to reduce toil and automate repetitive tasks, issues resolution scripting. Measure and optimize system performance, with...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

5 - 8 Lacs

pune

Work from Office

About the job The CIS Observability and Monitoring (O&M) Engineering team is looking for a Datadog Engineer Remote or Hybrid to develop and deliver enterprise-wide Datadog solutions to meet the current and future needs for all of SAS Institute, Inc . As a Datadog Engineer, you will: Gather requirements from various SAS business units. Participate in the planning, design, development, testing, and troubleshooting Datadog features. Provide support, including user and operational problems, documentation, customization, and utilization of Datadog features. Work closely with technology partners on integrations. Define and promote monitoring best practices across the organization. Assist with spec...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 15.0 years

12 - 24 Lacs

hyderabad, pune, greater noida

Work from Office

Roles and Responsibilities : Design, implement, and maintain monitoring systems using Prometheus, Grafana to ensure high availability and performance of cloud-based applications. Collaborate with cross-functional teams to identify and troubleshoot issues affecting system reliability, working closely with development teams to resolve root causes. Develop automated alerting rules and notification configurations for critical system metrics to ensure timely detection of potential issues before they impact users. Analyze data from various sources (e.g., logs, metrics) to identify trends, patterns, and areas for improvement in the monitoring process. Job Requirements : 6-15 years of experience in ...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 7.0 years

8 - 12 Lacs

mumbai, mumbai suburban, mumbai (all areas)

Work from Office

Required Skills & Competencies: • Knowledge of Linux, Windows • Proficiency in cloud platforms (AWS/GCP). • Hands-on with CI/CD tools (Jenkins, GitHub Actions, etc.). • Expertise in monitoring tools (Prometheus, Grafana, ELK, Dynatrace). • Knowledge of networking fundamentals, DNS, load balancers. • Experience with databases Postgres, MongoDB, RDS, Aurora, DynamoDB, Elastic cache, Cloud SQL . • Strong scripting skills (Python, Shell, Bash). • Familiarity with containerisation (Docker, Kubernetes). • Understanding of security best practices and IAM. • Automation using Terraform, Ansible, CloudFormation, • Cloud cost management and optimization . Thanks & Regards Janhavi Gupta janhavi.gupta@cb...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 11.0 years

12 - 20 Lacs

pune

Work from Office

Design, build, and manage CI/CD pipelines using tools such as Jenkins, GitLab. Deploy and manage applications on AWS/Azure cloud environments. Implement Infrastructure as Code (IaC) using Terraform, CloudFormation. Automate configuration management with Ansible Manage containerization and orchestration using Docker. Configure monitoring & alerting using Prometheus, Grafana, Zabbix. Mandatory scripting experience (Shell or python)

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 10.0 years

13 - 23 Lacs

pune

Work from Office

Role & responsibilities Design, build, and manage CI/CD pipelines using tools such as Jenkins, GitLab. Deploy and manage applications on AWS/Azure cloud environments. Implement Infrastructure as Code (IaC) using Terraform, CloudFormation. Automate configuration management with Ansible Manage containerization and orchestration using Docker. Configure monitoring & alerting using Prometheus, Grafana, Zabbix. Mandatory scripting exeperience (Shell or python) Strong analytical and troubleshooting skills

Posted 3 weeks ago

AI Match Score
Apply

1.0 - 3.0 years

5 - 11 Lacs

pune

Work from Office

Role & responsibilities Installation/deployment of new releases , environments for applications. Build and maintain highly scalable, large scale deployments globally Co-Create and maintain architecture for 100% uptime. E.g. creating alternate connectivity. Practice sustainable incident response/management and blameless post-mortems. Monitor and maintain production environment stability. Own entire platforms (prod environments) Deploying, automating, maintaining and managing production systems, to ensure the availability, performance, scalability and security of productions systems Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation a...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 11.0 years

12 - 16 Lacs

hyderabad

Work from Office

Job Description Summary Staff Software Engineer - DevOps will be responsible for providing build and release strategy for highly complex as well as parallel and concurrent releases for a Software Product. Manages Continuous Code Integration within SDLC. Works independently and is seen as a Technical Leader. The role demonstrates deep understanding of concurrent software development, its effect on build management and releasing the builds across versions and environments Job Description Roles and Responsibilities In this role, you will: Own builds, releases and continuous integration process for large and complex releases of a product and at times expands the scope across multiple concurrent ...

Posted 3 weeks ago

AI Match Score
Apply

12.0 - 16.0 years

18 - 22 Lacs

hyderabad

Work from Office

Required Work Experience 12 to 16 years of proven industry experience as Cloud & Solution Architect with a minimum of 7 years of experience working with GCP. Should have expert level Knowledge of broader solutioning architecture. Deep knowledge of GCP services, including computer storage, networking, databases, and security services. Proven skills in GCE, GKE, Google App Engine, Cloud Run, Cloud Storage, Big Query, Cloud SQL, Cloud Dataflow, Pub/Sub, Cloud Dataproc, Looker etc. Proficiency with networking & security tools like VPC, Cloud Load Balancing, IAM, KMS etc. Extensive work experience of migrating applications, data, and infrastructure from on-premises/ other environments to the clou...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 9.0 years

14 - 19 Lacs

pune

Work from Office

Business Operations plays a key role in leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations.We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications. Biz Ops teams also focus on risk manag...

Posted 3 weeks ago

AI Match Score
Apply

3.0 - 8.0 years

8 - 11 Lacs

pune

Work from Office

Overview The Financial Solutions BizOps team is looking for a Site Reliability Engineer with strong expertise in Apache NiFi and Apache Spark to support our mission-critical data workflows and analytics platforms. This role is key to ensuring the smooth, reliability, and efficient operation of our production systems that power day-to-day business functions.You will be responsible for monitoring, troubleshooting, and supporting complex data processing pipelines and integrations in real time. Your work will directly impact business operations, decision-making, and customer experience.Business Operations is leading the DevOps transformation at Mastercard through our tooling and by being an advo...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 11.0 years

10 - 17 Lacs

bengaluru

Work from Office

6+years location- Bangalore/Hyderabad Look for only immediate joiner only. Production Support / Application Support Skills Required: Mandatory Skills- L2 Production Support-application + Linux Server Admin(Moderate Level of Expertise) + Any APM Tools Experience (Grafana/Splunk/Datadog - Preferred) 24*7 Production Support No Hybrid Model / No Work from Home Work from Mphasis Office - ODC Work Location Hyderabad and Bangalore Rotational Shift Timings (6:30 AM TO 3:30 PM // 2:00 PM TO 11:00 PM // 10:00 PM TO 6:00 AM) kindly share updated cv below mail. madhuri.B@mycloudxtreme.com

Posted 3 weeks ago

AI Match Score
Apply

2.0 - 18.0 years

0 Lacs

hyderabad, telangana

On-site

**Job Description:** At Amgen, you will be part of a mission to serve patients living with serious illnesses, pioneering the world of biotech since 1980. As a member of the Amgen team, you will have the opportunity to make a lasting impact on patients" lives by researching, manufacturing, and delivering innovative medicines across various therapeutic areas. The collaborative, innovative, and science-based culture at Amgen provides a platform for you to thrive and transform both patient lives and your career. **Roles & Responsibilities:** - Lead the delivery of overall product and features, managing the product team to ensure business, quality, and functional goals are met with each release. ...

Posted 4 weeks ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

karnataka

On-site

As a Grafana Specialist, you will be responsible for working on dashboard development with underlying technologies and integration from scratch. You should be skilled in deploying and managing observability platforms such as Grafana, Prometheus, Loki, and Tempo. Additionally, you will be automating infrastructure using Python, Bash, Terraform, and Ansible. Your expertise in dashboard design, alerting configuration, incident response workflows, security, RBAC, and compliance best practices will be crucial for this role. You should also have experience in integrating logs, metrics, and traces across multi-cloud systems to ensure real-time visibility and system reliability. Your ability to coll...

Posted 4 weeks ago

AI Match Score
Apply

2.0 - 6.0 years

0 Lacs

haryana

On-site

Role Overview: As a full-stack Software Development Engineer-2 at our client, you will be a crucial member of the engineering team responsible for enhancing the back-end web stack for upcoming experimental projects and optimizing the existing product lines. Your role will involve delivering high-quality backend applications, taking ownership of projects, and contributing to the improvement of the gaming experience for players. The ideal candidate should have a strong passion for technology and gaming, striving to develop products that enhance gamers" experience and drive retention in our game studio clients. Initially, you will focus on enhancing features in existing products, eventually pro...

Posted 4 weeks ago

AI Match Score
Apply

12.0 - 16.0 years

0 Lacs

navi mumbai, maharashtra

On-site

As a Cloud Architect with 12-15 years of experience, you will be responsible for leading the cloud engineering team focusing on AWS and GCP platforms. Your main responsibilities will include: - Leading the design and implementation of cloud infrastructure on AWS and Google Cloud Platform with a focus on compute, storage, networking, and AI/ML services. - Architecting and implementing scalable, resilient, and secure cloud solutions aligned with business requirements. - Providing technical leadership and mentorship to engineers, ensuring best practices in architecture, development, and deployment. - Driving automation and CI/CD pipelines using DevOps tools such as Kubernetes, Jenkins, Docker, ...

Posted 4 weeks ago

AI Match Score
Apply

5.0 - 12.0 years

0 Lacs

thiruvananthapuram, kerala

On-site

As a DevOps Architect at the company, your role will involve driving the design, implementation, and management of scalable, secure, and highly available infrastructure. Your responsibilities will include: - Leading and managing the DevOps team to ensure reliable infrastructure and automated deployment processes. - Designing, implementing, and maintaining highly available, scalable, and secure cloud infrastructure across platforms like AWS, Azure, and GCP. - Developing and optimizing CI/CD pipelines for multiple applications and environments. - Driving Infrastructure as Code (IaC) practices using tools like Terraform, CloudFormation, or Ansible. - Overseeing monitoring, logging, and alerting...

Posted 4 weeks ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

chennai, tamil nadu

On-site

Role Overview: As a Test Automation Lead, you will lead the transformation of testing into a continuous and efficient end-to-end quality engineering function. Your typical day will involve collaborating with various teams to implement quality processes, tools, and methodologies that significantly enhance control, accuracy, and integrity in testing. You will also focus on evolving predictive and intelligent testing approaches, leveraging automation and innovative testing products and solutions to drive improvements in quality engineering. Key Responsibilities: - Own and deliver the performance test and engineering strategy and execution for WMS releases - Define workload models, SLAs, KPIs, a...

Posted 4 weeks ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

karnataka

On-site

As a DevOps Engineer, you will play a crucial role in managing and optimizing the tech infrastructure on AWS. You will have the opportunity to work with cutting-edge technologies, solve complex problems, and drive system scalability. Your passion for ensuring high availability, security, and scalability will be essential in this role. **Key Responsibilities:** - Manage and optimize the cloud infrastructure on AWS and GCP to ensure high availability and scalability. - Monitor system performance, troubleshoot issues, and implement effective solutions. - Ensure infrastructure security through managing access controls, encryption, and compliance with best practices. - Collaborate with software d...

Posted 4 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

Join a dynamic team shaping the tech backbone of operations, where your expertise fuels seamless system functionality and innovation. As a Technology Support II team member in Card Technology, you will play a vital role in ensuring the operational stability, availability, and performance of production application flows. Your efforts in troubleshooting, maintaining, identifying, escalating, and resolving production service interruptions for all internally and externally developed systems support a seamless user experience and a culture of continuous improvement. **Key Responsibilities:** - Provides end-to-end application or infrastructure service delivery to enable successful business operati...

Posted 4 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

Role Overview: Join a dynamic team shaping the tech backbone of operations, where your expertise fuels seamless system functionality and innovation. As a Technology Support II team member in Card Technology, you will play a vital role in ensuring the operational stability, availability, and performance of production application flows. Your efforts in troubleshooting, maintaining, identifying, escalating, and resolving production service interruptions for all internally and externally developed systems support a seamless user experience and a culture of continuous improvement. Key Responsibilities: - Provides end-to-end application or infrastructure service delivery to enable successful busin...

Posted 4 weeks ago

AI Match Score
Apply

3.0 - 5.0 years

10 - 20 Lacs

bengaluru

Remote

Job Application Link: https://app.fabrichq.ai/jobs/fbc63ddd-a3a6-4433-9eb2-ed98bf14577e Job Summary: Build the first engineering foundation for Folens digital learning platform and impact millions of students! Backend Engineer responsible for building scalable FastAPI services, designing efficient data models, and developing cloud-native systems. The role involves following Test-Driven Development practices, integrating third-party services, and working with AWS cloud infrastructure to support a digital learning platform for secondary school students. Key Responsibilities Build clean, scalable, and well-structured FastAPI services and REST APIs Design efficient data models, write performant ...

Posted 4 weeks ago

AI Match Score
Apply

1.0 - 3.0 years

1 - 5 Lacs

noida, greater noida

Work from Office

Application Support _ L1 _ Contractual Role Responsibilities Monitor dashboards and alerts in New Relic, Grafana, and related tools for partner platforms (Zee5, Prime, Hotstar, Apple TV). Track subscription processing queues (size, delays, processing rate) at regular intervals. Validate alerts against subscription flows to detect blind spots or recurring issues. Perform synthetic test monitoring (login, purchase, entitlement checks) and report failures promptly. Categorize and escalate partner-level issues vs. internal infra issues using standard SOPs. Maintain a shift log of incidents, escalations, and resolutions. Participate in daily/weekly monitoring reviews and highlight repeat alerts o...

Posted 4 weeks ago

AI Match Score
Apply

3.0 - 4.0 years

5 - 9 Lacs

pune

Work from Office

Provide 1st level of support Provide Level 1 technical support on Hardware, Software, Network etc. Should have working knowledge troubleshooting mail flow issues. Should be working knowledge of I AM process for onboarding and offboarding. Incident & Service Request Management Oversee incident triage, assignment, and resolution across the team. Ensure all tickets are logged, prioritized, and resolved within defined SLAs. Act as an escalation point for critical or unresolved incidents. Onboarding & Offboarding Ensure timely and accurate user provisioning/de-provisioning. Standardize onboarding/offboarding processes in alignment with compliance requirements. SLA & Performance Management Monitor...

Posted 4 weeks ago

AI Match Score
Apply

8.0 - 12.0 years

22 - 27 Lacs

bengaluru

Work from Office

Your future role Take on a new challenge and apply your DevOps and cloud architecture expertise in a cutting-edge field. Youll work alongside innovative and collaborative teammates. You'll play a key role in ensuring our programs deliver efficiently with high quality while supporting software development and validation through automation. Day-to-day, youll work closely with teams across the business (DevOps engineers, verification and validation engineers, and other stakeholders), design technical solutions for deployment, and contribute to long-term software platform upgrades and maintenance. Youll specifically take care of creating and managing testing and preview environments, as well as ...

Posted 4 weeks ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies