Home
Jobs

541 Grafana Jobs - Page 14

Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
Filter
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 7.0 years

15 - 27 Lacs

Bangalore Rural, Bengaluru

Work from Office

Naukri logo

DevOps, Site Reliability Engineering,loud platforms,GCP,Infrastructure as Code tools (Terraform, Ansible, CloudFormation), Prometheus, Grafana, ELK stack,Python, Bash, Go, Istio, Linkerd

Posted 3 weeks ago

Apply

9.0 - 14.0 years

35 - 40 Lacs

Pune

Work from Office

Naukri logo

Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. As an Assistant Vice President, your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. You will also be working as application lead and will be responsible for technical & operational processes for all application you support. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support. Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Manage regional L2 team and vendor teams supporting the application. Ensure the team is up to speed and picks up the support duties. Build up technical subject matter expertise on the applications being supported including business flows, application architecture, and hardware configuration. Define and track KPIs, SLAs and operational metrics to measure and improve application stability and performance. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time) using an array of monitoring tools. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Approach support with a proactive attitude, desire to seek root cause, in-depth analysis, and strive to reduce inefficiencies and manual efforts. Mentor and guide junior team members, fostering technical upskill and knowledge sharing. Provide strategic input into disaster recovery planning, failover strategies and business continuity procedures Collaborate and deliver on initiatives and install these initiatives to drive stability in the environment. Perform reviews of all open production items with the development team and push for updates and resolutions to outstanding tasks and reoccurring issues. Drive service resilience by implementing SRE(site reliability engineering) principles, ensuring proactive monitoring, automation and operational efficiency. Ensure regulatory and compliance adherence, managing audits,access reviews, and security controls in line with organizational policies. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Weekend on-call coverage needs to be provided on rotational/need basis. Your skills and experience 9-15 years of experience in providing hands on IT application support. Experience in managing vendor teams providing 24x7 support. Preferred : Team lead role experience, Experience in an investment bank, financial institution. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred : ITIL v3 foundation certification or higher. Knowledgeable in cloud products like Google Cloud Platform (GCP) and hybrid applications. Strong understanding of ITIL /SRE/ DEVOPS best practices for supporting a production environment. Understanding of KPIs, SLO, SLA and SLI Monitoring Tools: Knowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring, Airflow,Splunk. Working Knowledge of creation of Dashboards and reports for senior management Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Proven experience in leading L2 support teams, including managing vendor teams and offshore resources. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. Ability to manage high-pressure issues, coordinating across teams to drive swift resolution. Strong negotiation skills with interface teams to drive process improvements and efficiency gains.

Posted 3 weeks ago

Apply

7.0 - 10.0 years

9 - 12 Lacs

Hyderabad

Work from Office

Naukri logo

What youll be doing Youll be part of the Global Infrastructure organization that provides end user support to the workforce in Verizon India. As a Sr Manager of the End User Support team, youll be responsible for the following. Identifying, diagnosing, and resolving the issues in computer hardware and software in a system such as laptops and PCs Working with other infrastructure stakeholders and collaborate with them to address issues around Desktop packaging & distribution, CITRIX VDI & computer networking Building and nurturing high performing teams & Stakeholder collaboration to address Providing ongoing performance feedback, coaching, training and development. Adapting industry leading practices in service management and infrastructure enablement to improve cost of operations and operate with high service levels and customer satisfaction scores. Analyzing employee journey and embarking on transformative initiatives to enhance employee experience Be flexible to work in shifts based on business requirements Mentoring & Coaching the team technically for unresolved issues Stakeholder management, User experience focus, vendor management. Leading virtual teams and be accountable for Support teams Availability Handling escalations Adhoc requests Performance management & Stack ranking Tracking KPIs & Monitoring AYS dashboards for insights Creating executive level dashboards & drive meaningful partnership Manage complex projects and programs Providing desktop support, project coordination, imaging/ re-imaging PCs, and laptops. Coordinating with the third-party service provider for timely repair of the system under the maintenance agreement warranty, and repairing minor flaws in hardware if not covered under the same. Providing assistance in installation of other peripherals like printers, scanners, etc., cabling systems like local area network cables, network interface cards, wired switches, etc. Developing strategies, assisting with designs and applications, software testing, development for preventing a technical breakdown in future. Monitoring and analyzing the performance of an upgraded/changed system, keeping track of its performance, reliability, risks, and benefits. Strengthen Employee Engagement and build a culture of learning & innovation Bringing new ideas and process improvements To reduce ticket Volume To provide better employee experience What were looking for You'll need to have: Bachelors degree or four or more years of work experience. Six or more years of experience in IT Infrastructure support Experience in providing IT support for desktops, printers, peripherals and mobile devices Possess an engineering-centric background and a user first mindset Good communicator with presentation, Interpersonal and Problem solving skills. Program Management skills and an ability to influence internal and external stakeholders without exercising authority Experience in Service Now Tool reporting and usage Even better if you have: Certified ITIL v3 and above Infrastructure certifications such as CCNA, Citrix, Virtual Desktops etc. Good knowledge in Python, Powershell, Looker, Grafana, Splunk Good knowledge in AIOPS

Posted 3 weeks ago

Apply

5.0 - 8.0 years

7 - 10 Lacs

Chennai

Work from Office

Naukri logo

What youll be doing... As a devops engineer, you will design, implement, and manage Kubernetes clusters for our telecom/networking applications. Developing and maintaining CI/CD pipelines for automated build, testing, and deployment. Monitoring and optimizing the performance and scalability of our Kubernetes infrastructure. Implementing and maintaining monitoring and alerting systems to proactively identify and resolve issues. Leading incident response and troubleshooting efforts, including root cause analysis. Automating operational tasks and processes to improve efficiency. Collaborating with development teams to integrate and deploy applications to Kubernetes. Contributing to the development and maintenance of our platform's security posture. Participating in on-call rotations to provide support for production systems. Leveraging network/telecom domain knowledge to effectively triage and resolve network-related issues. Contributing to development efforts by writing code and implementing new features (added advantage). Staying up-to-date with the latest Kubernetes and DevOps technologies and best practices. What were looking for: We are seeking a highly motivated and experienced Engineer with a strong background in Kubernetes and DevOps practices to join our team. This role will focus on building, maintaining, and scaling our network/telecom infrastructure and services in a kubernetes/Openshift based environment. You will play a key role in ensuring the reliability, performance, and security of our platform, working closely with development, operations, and other engineering teams. Experience with triaging and troubleshooting complex issues is essential, as is a willingness to contribute to development efforts. You'll need to have: Bachelors degree or four or more years of work experience. Four or more years of relevant work experience. Four or more years of experience in DevOps engineering or a related role. Proven experience with Kubernetes and containerization technologies (e.g., Docker). Experience with CI/CD tools (e.g., Jenkins, GitLab ). Strong understanding of networking concepts and protocols (e.g., TCP/IP, BGP, MPLS). Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Experience with cloud computing platforms (e.g., AWS, Azure, GCP) is an added advantage. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills. Experience in the telecom/networking domain is essential. Experience with scripting languages (e.g., Python, Bash) is highly desirable. Experience with development and coding is a significant advantage. Even better if you have one or more of the following: Experience with a high-performance, high-availability environment. Experience with Network technologies like SDN/NFV Strong analytical, debugging skills. Good communication and presentation skills. Relevant certifications.

Posted 3 weeks ago

Apply

10.0 - 15.0 years

35 - 55 Lacs

Pune

Work from Office

Naukri logo

BMC is looking for a Princial QA to join a QE team working on complex and distributed software, developing test plans, executing tests, developing automation & assuring product quality. Here is how, through this exciting role, YOU will contribute to BMC's and your own success: Perform manual tests as well as automation development using Python Review requirements, specifications, and technical design documents to provide timely and meaningful feedback. Create detailed, comprehensive, and well-structured test plans and test cases. Estimate, prioritize, plan and coordinate testing activities Collaborate with various teams to ensure quality testing Initiate projects and ideas to improve the teams results On-board and mentor new employees To ensure youre set up for success, you will bring the following skillset & experience: You have 10+ years of experience in a software testing and automation. You have 3+ yexperience in an architecture or lead role. You have strong knowledge of AI/ML testing methodologies, model evaluation metrics, and data pipeline testing. You have expertise in automation tools such as Selenium, PyTest, TestNG, JUnit, or similar. You have strong programming skills in Java, Python, JavaScript, or similar languages. You are familiarity with cloud environments (AWS, Azure, GCP) and containerization (Docker, Kubernetes). You have working knowledge of databases and SQL (Structured Query Language). You have experience with API testing tools like Postman, RestAssured, or SoapUI. You have background in performance testing tools (e.g., JMeter, LoadRunner). Whilst these are nice to have, our team can help you develop in the following skills: Proficiency in observability and monitoring tools (e.g., Prometheus, Grafana, ELK stack, Splunk) and frameworks for real-time metrics, alerting, and logging. BMC Helix Product knowledge is a plus Understanding of CI/CD tools (e.g., Jenkins, GitLab CI, Azure DevOps).

Posted 3 weeks ago

Apply

5.0 - 10.0 years

10 - 20 Lacs

Noida

Work from Office

Naukri logo

JOB DESCRIPTION Experience Level desired 5+ yrs Compensation: Salary commensurate w/ experience Reports to: Team Lead RESPONSIBILITIES Application Performance Monitoring - Using Dynatrace APM tools to optimize application performance and identify performance bottlenecks in web applications and provide solutions. Dynatrace OneAgent Installation and Troubleshoot on all types of platform like on cloud (Azure Infra, AKS ,App Services)/on-premises Dynatrace integration with 3rd Party Tools. Demonstrates thorough knowledge and awareness of application performance issues in a complex multi-tiered environment Knowledge of Customer Experience Management, Application Performance monitoring and log analytics tools like Splunk, Dynatrace Synthetic, Dynatrace Appmon, CA APM, Prometheus etc., is highly desired. On-board new application into Dynatrace, profile configuration, agent setup, instrumentation. Ability to do requirement gathering and target environment analysis from an APM perspective Hands-on implementation experience in Dynatrace On-Premise solutions Experience on Configuration and customization of Dynatrace solution Excellent communication skills (both verbal and written) Knowledge of Azure is preferred Hands on APM and other tools like- DataDog, Glassbox, Splunk, Grafana, Prometheus, New-Relic, Postman, Azure Appinsights, Azure Log, Jenkins, Docker. Power BI (good to have required for reporting and extracting data from tools) QUALIFICATIONS B.Tech or MCA preferred Atleast 5 yrs work exp. Good Communication skills

Posted 3 weeks ago

Apply

5.0 - 7.0 years

5 - 9 Lacs

Mumbai, Bengaluru, Delhi / NCR

Work from Office

Naukri logo

Key Responsibilities : Chaos Engineering : - Design and implement chaos engineering experiments to identify weaknesses in systems and applications. - Develop and execute strategies to improve system resilience and reliability. - Analyze experiment results, provide actionable insights, and drive remediation efforts. - Collaborate with development, operations, and infrastructure teams to integrate chaos engineering practices. Operational Acceptance : - Develop and maintain comprehensive operational acceptance criteria for new and existing systems. - Conduct thorough operational acceptance testing, ensuring systems meet all predefined criteria before go-live. - Work closely with project managers, developers, and QA teams to align operational acceptance processes with project timelines and objectives. - Document and communicate operational readiness findings, providing recommendations for improvement. System Resilience and Reliability : - Implement and manage strategies for continuous improvement of system resilience and reliability. - Monitor and assess system performance, identifying potential risks and areas for enhancement. - Lead initiatives to improve disaster recovery and business continuity plans. - Stay updated with the latest industry trends and best practices in chaos engineering and operational acceptance. Collaboration and Training : - Educate and mentor team members on chaos engineering and operational acceptance methodologies. - Foster a culture of resilience and reliability within the organization. - Engage with external communities, attending conferences and participating in knowledge-sharing events. Requirements : - Extensive experience in chaos engineering, operational acceptance testing, and system resilience. - Strong understanding of cloud platforms (AWS, Azure, GCP) and their resilience features. - Proficiency in scripting and automation tools (Python, Bash, Terraform, etc. - Experience with monitoring and observability tools (Prometheus, Grafana, Splunk, etc. - Experience with Chaos Engineering Tools such as Gremlin, Chaos Monkey etc. - Excellent analytical and problem-solving skills. - Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams. - Certifications in relevant fields (e.g , AWS Certified Solutions Architect, Azure DevOps Engineer) are a plus. Location: Delhi NCR,Bangalore,Chennai,Pune,Kolkata,Ahmedabad,Mumbai,Hyderabad

Posted 3 weeks ago

Apply

3.0 - 8.0 years

5 - 10 Lacs

Pune

Work from Office

Naukri logo

BMCs SaaS Ops team is looking for a DevOps Engineer to join us and design, develop, and implement complex applications, using the latest technologies. Here is how, through this exciting role, YOU will contribute to BMC's and your own success: Participate in all aspects of SaaS product development, from requirements analysis to product release and sustaining. Drive the adoption of the DevOps process and tools across the organization. Learn and implement cutting-edge technologies and tools to build best of class enterprise SaaS solutions. Deliver high-quality enterprise SaaS offerings on schedule Develop Continuous Delivery Pipeline Required Skills: 3+ years of working experience in a software engineering function Hands on experience with CI\CD pipelines and maintenance of containerized deployments Fundamental knowledge of one of automation scripting language Python, Groovy, Ansible, or Shell scripting Hands on experience in creating and maintaining Jenkins pipelines Hands on experience working with Web service protocols (Rest, JSON) Hands on experience working with DevOps and Automation tools like Git, Docker, Helm, Terraform, Jira, Harbor Registry Proficient working on Windows and Linux Operation System platforms. Good exposure and fundamental knowledge of Relational DBs (PostgreSQL, MS SQL) Good exposure and fundamental knowledge of container deployments, persistent storage, PODs, ingress, routes and Kubernetes objects. Good exposure and fundamental knowledge of tools like Elastic Search, Kibana, Grafana, Prometheus Good exposure and fundamental knowledge of Public, Private and hybrid cloud deployments Good exposure and fundamental knowledge of Site Reliability Engineering (SRE) principles and its implementation for SaaS services. Experience working in an Agile methodology with cross functional teams (R&D, DevOps, Operations, Support etc.) Able to design & document the Standard Operating Procedures (SOPs), design document and architecture artifacts Good troubleshooting skills and knowledge of BMC Helix products including ITSM, Digital Workplace, Helix Platform will be an add-on. Ability to work with time bound deadlines Hard Working & dedicated person with effective communication skills Bachelors degree in IT or equivalent professional experience This position is part of BMC SaaS DevOps team. This can include weekend work during scheduled production activities and after-hours work as needed.

Posted 3 weeks ago

Apply

5.0 - 9.0 years

14 - 19 Lacs

Noida

Work from Office

Naukri logo

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by diversity and inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health equity on a global scale. Join us to start Caring. Connecting. Growing together. Primary Responsibility Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so Required Qualifications B.E or B.Tech or M.Tech or MCA Demonstrated ability to address the upcoming deliverables or work orders or tickets independently. For this hands on below technologies are required Solid hands-on experience Java 17 and above Solid hands-on experience Spring, Spring Boot, Hibernate, JSF and ReactJS Hands-on experience Web services (REST or Micro services or API) Hands-on experience Unix scripting Hands-on experience RDBMS DB like Oracle, SQL & DB2 Hands-on experience NoSQL DB preferably MongoDB Hands-on experience Kafka messaging services Hands-on experience eclipse or STS Hands-on experience JBoss and WAS Hands-on experience Cloud (Preferred GCP) Hands-on experience DevOps Demonstrated ability to work on GitHub Actions or Dev ops model Proven solid analytical, debugging and performance tuning skills Demonstrated ability to interact with business. Hence, good communication skill set Preferred Qualifications Knowledge on Grafana, Elastic APM is beneficial. Knowledge on Cucumber is beneficial. Knowledge on Kubernetes is beneficial. At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes — an enterprise priority reflected in our mission.

Posted 3 weeks ago

Apply

8.0 - 12.0 years

15 - 19 Lacs

Bengaluru

Work from Office

Naukri logo

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. Primary Responsibilities Lead end-to-end management of database operations across MSSQL, MySQL, and Oracle environments Own and enhance platform lifecycle management (PLM), including patching, upgrades, and performance tuning Design and implement automated, self-healing systems for proactive fault detection and recovery Build scalable automation for routine DBA tasks (backups, failovers, capacity planning, etc.) Ensure high availability, disaster recovery, and compliance of all data systems Collaborate with architects and engineering leads to define and evolve the data infrastructure roadmap Mentor and guide junior DBAs and data platform engineers, promoting best practices and continuous learning Establish and monitor KPIs for system reliability, performance, and platform health Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so Required Qualifications 10+ years of experience in database administration and operations (MSSQL, MySQL, Oracle) 3+ years in a leadership or managerial role with a solid track record of team development Experience with monitoring tools (e.g., Prometheus, Grafana, OEM, SolarWinds) Experience working in hybrid or cloud-native environments (Azure, AWS, or GCP) Deep understanding of PLM, capacity management, HA/DR, and database security Expertise in scripting (PowerShell, Bash, Python) and automation tools (Ansible, Terraform, etc.) Solid troubleshooting and performance tuning skills across DB platforms Familiarity with CI/CD practices and infrastructure automation Preferred Qualifications Experience with containerized DB deployments (e.g., Docker, Kubernetes) Exposure to self-service data platforms and DevOps for data Knowledge of AI/ML-based alerting or anomaly detection in ops Certifications in MSSQL, Oracle, MySQL, or relevant cloud platforms At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone - of every race, gender, sexuality, age, location and income - deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.

Posted 3 weeks ago

Apply

5.0 - 10.0 years

11 - 16 Lacs

Noida

Work from Office

Naukri logo

Primary Responsibilities Creation of Platform Support/Monitoring portal applications to assist in monitoring of inventories/API performance/data trends. Troubleshoot and resolve production defects at all levels. Make critical architecture decisions to guide the development of robust and scalable software solutions Define and enforce engineering policies and best practices for design, development, and delivery of software's Collaborate cross-functionally to ensure technical alignment with business goals Stay up-to-date with the latest industry trends and technologies Required Qualifications 10+ years of software engineering experience 3+ years of hands-on experience with DevOps principles, CI/CD pipelines and automation tools ( Git/GitHub Actions/Jenkins/GitHub version control...etc.) 2+ years of experience with Angular, .Net Technologies, Databases, Kubernetes and cloud technologies Ability to understand database structures and be able to manipulate, extract, and update data Proficient in SQL; experienced in developing stored procedures/views/triggers/indexes Preferred Qualifications Bachelor’s/Master’s degree in Computer Science/Software Engineering/ or related field 3+ years of experience with Aha, Rally, ServiceNow, and GitHub along with an understanding of DevOps 2+ years' Snowflake cloud data warehouse/platform experience Solid technical and platform knowledge, including some or all ofSPLUNK, Kafka, Queueing, ElasticAPM, Red Gate, Grafana, and Airflow. #LETSGROW At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes — an enterprise priority reflected in our mission.

Posted 3 weeks ago

Apply

3.0 - 6.0 years

8 - 12 Lacs

Bengaluru

Work from Office

Naukri logo

In this Site Reliability Engineer role, you will work closely with entire IBM Cloud organization to maintain and operationally improve the IBM cloud infrastructure. You will focus on the following key responsibilities: Ability to respond promptly to production issues and alerts 24x7 Execute changes in the production environment through automation Implement and automate infrastructure solutions that support IBM Cloud products and services to reduce toil. Partner with other SRE teams and program managers to deliver mission-critical services to IBM Cloud Build new tools to improve automated resolution of production issues Monitor, respond promptly to production alerts, Execute changes in Production through automation Support the compliance and security integrity of the environment Continually improve systems and processes regarding automation and monitoring. Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise Excellent written and verbal communication skills. Minimum 5+ years experience in handling large production systems environment Must be extremely comfortable using and navigating within a Linux environment Ability to do low level debugging and problem analysis by examining logs and running Unix commands Must be efficient in writing and debugging scripts 3-5+ years of experience in Virtualization Technologies and Automation / Configuration Managements Automation and configuration management tools/solutionsAnsible, Python, bash, Terraform, GoLang etc. (at least one) Virtualization technologiesCitrix Xen Hypervisor (Preferred), KVM(also preferred), libvirt, VMware vSphere, etc. (at least one) Monitoring technologiesZabbix, Sysdig, Grafana, Nagios, Splunk, etc. (at least one) Working knowledge with Container technologiesKubernetes, Docker, etc. Flexibility to work on shifts to handle production systems Preferred technical and professional experience Good experience inPublic cloud platforms,Kubernetes clusters and Strong Linux skills for managing services across microservices platform, good SRE knowledge in Cloud Compute, Storage and Network services.

Posted 3 weeks ago

Apply

4.0 - 9.0 years

17 - 22 Lacs

Bengaluru

Work from Office

Naukri logo

Job Summary: We are seeking a highly skilled Site Reliability Engineer (SRE) with experience to join our team in Bangalore. The ideal candidate will excel in implementing SRE principles to foster a culture of reliability, automation, and monitoring across our software engineering projects. This role is pivotal in ensuring the effective design, development, testing, and support of applications and systems, particularly within cloud environments. Software Requirements: Required Proficiency: Programming LanguagesTypeScript, Node.js Cloud EnvironmentsAWS (ECS Fargate, Vault, Lambda services, Artifactory) CI/CD ToolsGitHub Actions, JFrog Artifactory, Sysdig, Octopus, Terraform Observability ToolsObStack, Prometheus, Grafana, PagerDuty, Observe Infrastructure as Code (IaC) ToolsCloudFormation, Terraform Preferred Proficiency: Familiarity with additional programming languages or frameworks Experience with cloud platforms other than AWS Overall Responsibilities: Partner with senior stakeholders to lead a culture focused on data-driven reliability, monitoring, and automation in alignment with SRE principles. Design, develop, test, and support applications and systems, emphasizing managing and scaling distributed systems across cloud environments. Create and develop tools essential for the operational management and security of software applications and systems. Identify technology limitations and deficiencies in existing systems and implement scalable improvements. Drive automation efforts and enhance application monitoring capabilities. Review code developed by other engineers to ensure adherence to best practices. Thrive in incident response environments, conducting post-mortem analyses and designing secure solutions. Measure and optimize system performance, addressing customer needs and innovating for continuous improvement. Technical Skills (By Category): Programming Languages: Required: TypeScript, Node.js Cloud Technologies: Required: AWS (ECS Fargate, Lambda, Vault, Artifactory) Development Tools and Methodologies: Required: GitHub Actions, JFrog Artifactory, Sysdig, Octopus, Terraform Observability Tools: Required: ObStack, Prometheus, Grafana, PagerDuty, Observe Infrastructure as Code (IaC): Required: CloudFormation, Terraform Experience Requirements: 7 to 10 years of experience in software engineering and SRE practices. Experience in applying SRE practices in large organizations. Familiarity with modern software development practices and DevSecOps environments. Day-to-Day Activities: Collaborate with stakeholders to understand business needs and implement SRE practices. Lead cross-functional teams in enhancing system reliability and performance. Develop and maintain operational management tools for applications. Conduct regular code reviews and ensure adherence to best practices. Participate in incident response and post-mortem analysis to improve system resilience. Qualifications: Required: Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. Commitment to continuous professional development through industry certifications and training. Professional Competencies: Strong critical thinking and problem-solving skills. Excellent leadership and teamwork abilities. Effective communication and stakeholder management skills. Adaptability and a learning-oriented mindset. Innovative thinking to drive continuous improvement. Strong time and priority management skills.

Posted 3 weeks ago

Apply

4.0 - 8.0 years

10 - 15 Lacs

Chennai, Bengaluru

Work from Office

Naukri logo

Key Responsibilities: Design and develop performance test plans, test scripts, and test scenarios based on business requirements and technical specifications. Execute performance tests using industry-standard tools (e.g., JMeter, Blazemeter) to assess system performance, scalability, and reliability. Analyze test results and identify performance bottlenecks, providing detailed reports and recommendations for improvement. Collaborate with development, QA, and operations teams to troubleshoot and resolve performance issues. Maintain and update performance testing documentation, including test plans, test cases, and test results. Stay up-to-date with the latest performance testing tools, trends, and best practices. Implementation of AI in performance testing Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. Proven experience as a Performance Tester or similar role, with a strong understanding of performance testing methodologies and tools. Proficiency in using performance testing tools such as JMeter, Blazemeter. Strong analytical and problem-solving skills, with the ability to interpret complex data and provide actionable insights. Experience with monitoring and profiling tools (e.g., Dynatrace, Datadog, Graffana) Excellent communication and collaboration skills, with the ability to work effectively in a team environment. Strong attention to detail and a commitment to delivering high-quality results. Preferred Qualifications: Experience with cloud-based performance testing and monitoring. Knowledge of CI/CD pipelines and integration of performance testing into the development lifecycle. Certification in performance testing or related areas is a plus. Apply Now! also you can share your updated resume on bheemashankar.nashi@photon.com

Posted 3 weeks ago

Apply

1.0 - 5.0 years

8 - 15 Lacs

Bengaluru

Work from Office

Naukri logo

Junior DevOps Engineer / DevOps Engineer Location: Bengaluru South, Karnataka, India Experience: 1.53 Years Compensation: 815 LPA Employment Type: Full-Time | Work From Office Only ________________________________________ Are you an aspiring DevOps professional ready to work on a transformative platform? Join a purpose-led team building India’s most disruptive ecosystem at the intersection of technology, property, and sustainability. This role is ideal for engineers who are eager to learn, automate, and contribute to building reliable, scalable, and secure infrastructure. Key Responsibilities Assist in designing, implementing, and managing CI/CD pipelines using tools like Jenkins or GitLab CI to automate build, test, and deployment processes. Support the deployment and management of cloud infrastructure, primarily on AWS, with exposure to Azure or GCP. Contribute to infrastructure as code practices using Terraform, CloudFormation, or Ansible. Participate in maintaining and operating containerized applications using Docker and Kubernetes. Implement and manage monitoring and logging solutions using Grafana, Loki, Prometheus, or ELK stack. Collaborate with engineering and QA teams to streamline release pipelines, ensuring high availability and performance. Develop basic automation scripts in Python or Bash to optimize and streamline operational tasks. Gain exposure to serverless and event-driven architectures under guidance from senior engineers. Troubleshoot infrastructure issues and contribute to system security and performance optimization. Requirements 1.5 to 3 years of experience in DevOps, SRE, or related infrastructure roles. Solid understanding of cloud environments (AWS preferred; Azure/GCP a plus). Basic to intermediate scripting knowledge in Python or Bash. Familiarity with CI/CD concepts and tools such as Jenkins, GitLab CI, etc. Working knowledge of Docker and introductory experience with Kubernetes. Exposure to monitoring and logging stacks (Grafana, Loki, Prometheus, ELK). Understanding of infrastructure as code using tools like Terraform or Ansible. Familiarity with networking, DNS, firewalls, and system security practices. Strong problem-solving skills and a learning mindset. Preferred Qualifications Certifications in AWS, Azure, or GCP. Exposure to serverless architectures and event-driven systems. Experience with additional monitoring tools or scripting languages. Familiarity with geospatial systems, virtual mapping, or sustainability-oriented platforms. Passion for eco-conscious technology and impact-driven development. Why You Should Join Contribute to a next-gen PropTech platform promoting sustainable and inclusive land ownership. Work closely with senior engineers committed to mentorship and ecosystem building. Join a team where your ideas are valued, your skills are sharpened, and your work has real-world impact. Be part of a vibrant, office-first culture that encourages innovation, collaboration, and growth.

Posted 3 weeks ago

Apply

7.0 - 12.0 years

20 - 25 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Naukri logo

Hi, Wishes from GSN!!! Pleasure connecting with you!!! We been into Corporate Search Services for Identifying & Bringing in Stellar Talented Professionals for our reputed IT / Non-IT clients in India. We have been successfully providing results to various potential needs of our clients for the last 20 years. At present, GSN is SRE Production Support hiring for one of our leading MNC client. PFB the details for your better understanding: Experience: 6+ Yrs Budget: 15LPA- 25LPA Work Location: BLR/HYD/PUNE Mode: WFO (5 Days in Office) Work Timing : 24/7 (cab facility and shift allowance will be provided) Whom we look for? We are looking for an experienced SRE (Site reliability Engineer) Should have worked in both Application Support(Java/.Net) Experience in L2 or L3 application support ( Alert Configuration + Dashboard Creation ) Experience with Release Management and Production Deployment Experience in Splunk Experience with Grafana If interested, kindly APPLY for IMMEDIATE response. Thanks & Rgds KAVIYA | GSN | Kaviya@gsnhr.net |Google Reviews: https://g.co/kgs/UAsF9W

Posted 3 weeks ago

Apply

7.0 - 10.0 years

15 - 20 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Naukri logo

Urgent Hiring | SRE L2 Support | 5 Days WFO | Immediate Joiners Only Job Details: Experience: 7-12 Years Location: Work From Office all 5 days Work Days: 5 Days/Week Joining: Immediate Only Key Responsibilities: SRE + Application Support(Java/.Net) + Release Management + Production Deployment + L2 Support(Alert Configuration + Dashboard Creation) + Splunk + Grafana KNOWLEDGE JD : Experience in supporting Large-Scale distributed systems involving multi-data center/PODS, load balancers, databases, Middleware, and multiple backend services, including microservices. Hands-on experience in diagnosing and resolving production issues, including Performance degradation, intermittent issues, log analysis, network failures, database failures, and code errors etc. Experience in implementing standard maintenance activities like DR Failover & Testing, security patching, Service Account & Certificate renewals. Experience in Production Deployment using CI/CD pipelines Splunk Query Skills: Ability to write effective Splunk queries for data analysis and monitoring Thanks & Rgds GSN HR || Email :Shobana@gsnhr.net || Web : www.gsnhr.net Google Reviews : https://g.co/kgs/UAsF9W

Posted 3 weeks ago

Apply

5.0 - 7.0 years

3 - 7 Lacs

Pune

Remote

Naukri logo

We are seeking a Grafana Implementation Expert with deep expertise in Grafana and Prometheus, focusing on core development and customization rather than SRE or DevOps responsibilities. This role requires a specialist in monitoring tools, responsible for designing, developing, and optimizing Grafana dashboards, plugins, and data sources to provide real-time observability and analytics. Key Responsibilities : - Develop, customize, and optimize Grafana dashboards with advanced visualizations, queries, and alerting mechanisms.- Integrate Grafana with Prometheus and other data sources (i.e. Loki, InfluxDB, Elasticsearch, MySQL, PostgreSQL, OpenTelemetry).- Extend Grafana capabilities by developing custom plugins, panels, and data sources using JavaScript, TypeScript, React, and Go.- Optimize Prometheus queries (PromQL) and storage solutions to ensure efficient data retrieval and visualization.- Automate dashboard provisioning using JSON, Terraform, or Grafana APIs for seamless deployment across environments.- Work closely with engineering teams to translate monitoring requirements into scalable and maintainable solutions.- Troubleshoot and enhance Grafana performance, including load balancing, scaling, and security hardening.- Implement advanced alerting mechanisms using Alertmanager, Grafana Alerts, and webhook integrations.- Stay updated on Grafana ecosystem advancements and contribute to best practices in observability tooling.- Document configurations, implementation guidelines, and best practices for internal stakeholders. Required Skills & Experience : - 5+ years of experience in monitoring and observability tools with a strong focus on Grafana and Prometheus.- Expertise in Grafana internals, including API usage, dashboard templating, and custom plugin development.- Strong hands-on experience with Prometheus, including metric collection, relabeling, and PromQL queries.- Proficiency in JavaScript, TypeScript, React, and Go for Grafana plugin and dashboard development.- Familiarity with infrastructure monitoring, including Kubernetes, cloud services (AWS, GCP, Azure), and system-level metrics. - Experience with time-series databases and log aggregation tools (i.e., Loki, Elasticsearch, InfluxDB). - Knowledge of security best practices in Grafana, including authentication, RBAC, and API security.- Experience with automation and infrastructure-as-code (IaC) for monitoring stack deployment.- Strong problem-solving skills with the ability to debug and optimize dashboards and alerting configurations.- Excellent communication and documentation skills to collaborate with cross-functional teams. Preferred Qualifications : - Grafana Certified Observability Engineer or equivalent certifications.- Experience contributing to open-source Grafana projects or plugin development.- Knowledge of distributed tracing tools like Jaeger or Zipkin.- Familiarity with service meshes (Istio, Linkerd) and their monitoring strategies.- This is a high-impact role focused on developing and enhancing Grafana-based monitoring solutions for enterprise-grade observability

Posted 3 weeks ago

Apply

3.0 - 6.0 years

4 - 8 Lacs

Karnataka

Work from Office

Naukri logo

Key Responsibilities : - Design, develop, and maintain backend services and automation tools using Golang - Build scalable and efficient microservices, RESTful APIs, and background jobs - Automate repetitive tasks and system processes across CI/CD, deployments, and data pipelines - Optimize code and systems for performance, reliability, and scalability - Collaborate with DevOps, QA, and other engineering teams to streamline operations and workflows - Write scripts and automation for provisioning, monitoring, and self-healing infrastructure - Maintain technical documentation for developed services, APIs, and scripts - Debug and troubleshoot issues across services and systems - Participate in code reviews, testing, and continuous integration activities - Research and implement tools and frameworks to improve development and automation efficiency Required Technical Skills : Programming Languages : - Strong proficiency in Go (Golang) - Familiarity with Python, Bash, or Shell scripting is a plus Automation & DevOps : - Hands-on experience with CI/CD pipelines (e., GitHub Actions, GitLab CI, Jenkins) - Proficiency in writing automation scripts and job schedulers - Familiarity with Ansible, Terraform, or other automation tools is a plus API Development : - RESTful API design, development, testing, and documentation - JSON, gRPC, and protocol buffers experience is a bonus Database Technologies : - Experience with both SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Redis) databases - Understanding of database schema design and query optimization Cloud & Containers : - Hands-on experience with Docker, Kubernetes, or other container orchestration tools - Familiarity with cloud platforms like AWS, GCP, or Azure Monitoring & Logging : - Working knowledge of tools like Prometheus, Grafana, ELK Stack, or Splunk Version Control : - Proficient in Git and Git workflows Preferred Qualifications : - Bachelors degree in Computer Science, Engineering, or related field - 36 years of backend development experience, with 2+ years in Golang - Experience working in Agile/Scrum teams - Exposure to event-driven architecture and message brokers like Kafka, RabbitMQ, or NATS Soft Skills : - Strong problem-solving and debugging skills - Good communication and documentation habits - Ability to work independently and within a team - Strong attention to detail and proactive attitude

Posted 3 weeks ago

Apply

4.0 - 8.0 years

13 - 17 Lacs

Bengaluru

Work from Office

Naukri logo

Roles & Responsibilities : - Working closely with the CTO and members of technical staff to meet deadlines. - Working with an agile team to setup and configure GitOps (CI/CD) based pipelines on GitLab - Create and deploy Edge AIoT pipelines using AWS Greengrass or Azure IoT - Design and develop secure cloud system architectures in accordance with enterprise standards - Package and automate deployment of releases using Helm charts - Analyze and optimize resource consumption of deployments - Integrate with Prometheus, Grafana, Kibana etc. for application monitoring - Adhering to best practices to deliver secure and robust solutions Requirements : - Experience with Kubernetes and AWS - Knowledge of cloud architecture concepts (IaaS, PaaS, SaaS) - Knowledge of Docker and Linux bash scripting - Strong desire to expand knowledge in modern cloud architectures - Knowledge of System Security Concepts (SAST, DAST, Penetration Testing, Vulnerability analysis) - Familiarity with version control concepts (Git)

Posted 3 weeks ago

Apply

5.0 - 10.0 years

13 - 22 Lacs

Hyderabad

Work from Office

Naukri logo

Hello Everyone, We are looking for the below requirement: Looking for the banking domain project/Trading Platform/ Investment Banking or Capital Marketing projects. Key Responsibilities: Maintain and enhance platform infrastructure across Linux and Windows environments Develop scripts to automate system monitoring, deployment, and recovery processes Troubleshoot and resolve environment-level issues impacting application performance Build and manage CI/CD pipelines using tools like Jenkins, Azure DevOps, or GitHub Actions Collaborate with development, support, and cloud teams to ensure high platform availability Support and automate tasks like patching, environment readiness, and DR test setups Work with DBAs, application teams, and product vendors to resolve infra-related performance bottlenecks Document processes, create knowledge articles, and ensure knowledge continuity Mandatory Skills: 5-9 years of experience in infrastructure/platform engineering Strong hands-on skills in Windows, Linux, Bash scripting and Powershell Experience with CI/CD pipelines and deployment automation Proficiency in tools such as Ansible, Jenkins, Azure DevOps, Git Experience with log aggregation and monitoring (e.g., ELK, Grafana, Prometheus) Comfortable supporting enterprise-grade applications in a financial services environment Preferred Skills: Exposure to Cloud platforms like AWS (especially EC2, S3, IAM, CloudWatch) Familiarity with application support tools and release pipelines SQL knowledge and ability to work with DB teams for performance tuning Prior experience working with geographically distributed teams. Interested candidates or references please drop cv: sireesha.r@thehirewings.com/ careers@thehirewings.com/9346429928/6304852810

Posted 3 weeks ago

Apply

12.0 - 16.0 years

40 - 45 Lacs

Hyderabad

Work from Office

Naukri logo

Overview The Grafana and Elastic Architect will maintain and optimize the observability platform, ensure cost-effective operations, define guardrails, and promote best practices. This role will oversee the platforms BAU support, manage vendors and partners, and collaborate closely with application owners to onboard applications. The Architect will also lead the deployment of AI Ops and other advanced features within Grafana and Elastic while working with other observability, ITSM, and platform architects. This position includes people management responsibilities and involves leading a team to achieve operational excellence. Responsibilities Key Responsibilities: 1. Platform Ownership & Cost Optimization Maintain and enhance the Grafana and Elastic platforms to ensure high availability and performance. Implement cost control mechanisms to optimize resource utilization across Observability platforms. Establish platform guardrails, best practices, and governance models. 2. BAU Support & Vendor/Partner Management Manage day-to-day operations, troubleshooting, and platform improvements. Engage and manage third-party vendors and partners to ensure SLA adherence and platform reliability. Work closely with procurement and finance teams to manage vendor contracts and renewals. 3. Application Onboarding & Collaboration Partner with application owners and engineering teams to onboard applications onto the Observability platform. Define standardized onboarding frameworks and processes for application teams. Ensure seamless integration with existing observability solutions like AppDynamics, ServiceNow ITOM, and other monitoring tools. 4. AI Ops & Advanced Features Implementation Deploy AI Ops capabilities within Grafana and Elastic to enhance proactive monitoring and anomaly detection. Implement automation and intelligent alerting to reduce MTTR and operational overhead. Stay updated with industry trends and recommend innovative AI-driven observability enhancements. 5. Cross-Functional Collaboration Work closely with architects of AppDynamics, ServiceNow, and other Observability platforms to ensure an integrated monitoring strategy. Align with ITSM, DevOps, and Cloud teams to create a holistic observability roadmap. Lead knowledge-sharing sessions and create technical documentation for the team. 6. People & Team Management Lead and managed a team responsible for Grafana and Elastic observability operations. Provide mentorship, coaching, and career development opportunities for team members. Define team goals, monitor performance, and drive continuous improvement in Observability practices. Foster a culture of collaboration, innovation, and accountability within the team. Qualifications Technical Expertise 12+ years of experience in IT Operations, Observability, or related fields. Strong expertise in Grafana and Elastic Stack (Elasticsearch, Logstash, Kibana). Experience in implementing AI Ops, machine learning, or automation within observability platforms. Proficiency in scripting and automation (Python, Ansible, Terraform) for Observability workloads. Hands-on experience with cloud-based Observability solutions, particularly in Azure environments. Familiarity with additional monitoring tools like AppDynamics, ServiceNow ITOM, SevOne, and ThousandEyes. Leadership & Collaboration Experience in managing vendors, contracts, and external partnerships. Strong stakeholder management skills and ability to work cross-functionally. Excellent communication and presentation skills. Ability to lead and mentor junior engineers in Observability best practices.

Posted 3 weeks ago

Apply

12.0 - 15.0 years

55 - 60 Lacs

Ahmedabad, Chennai, Bengaluru

Work from Office

Naukri logo

Dear Candidate, We are hiring an Observability Engineer to enhance monitoring, tracing, and system visibility. Key Responsibilities: Build and maintain observability dashboards. Integrate logging, metrics, and tracing. Help teams debug using distributed tracing and logs. Required Skills & Qualifications: Experience with OpenTelemetry, Grafana, Prometheus, Loki. Strong scripting and DevOps background. Understanding of distributed systems. Soft Skills: Strong troubleshooting and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Srinivasa Reddy Kandi Delivery Manager Integra Technologies

Posted 3 weeks ago

Apply

4.0 - 7.0 years

10 - 15 Lacs

Noida

Work from Office

Naukri logo

As a Consultant in Automation domain, you will be responsible for delivering automation use cases enabled by AI and Cloud technologies. In this role, you play a crucial part in building the next-generation autonomous networks. You will develop efficient and scalable automation solutions, you will leverage your technical expertise, problem-solving abilities, and domain knowledge to drive innovation and efficiency. You have: Bachelor's degree in Computer Science, Engineering, or a related field preferred, with 8-10+ years of experience in automation or telecommunications. Understanding of telecom network architecture, including Core networks, OSS, and BSS ecosystems, along with industry frameworks like TM Forum Open APIs and eTOM. Practical experience in programming and scripting languages such as Python, Go, Java, or Bash, and automation tools like Terraform, Ansible, and Helm. Hands-on experience with CI/CD pipelines using Jenkins, GitLab CI, or ArgoCD, as well as containerization (Docker) and orchestration (Kubernetes, OpenShift). It would be nice if you also had: Exposure to agile development methodologies and cross-functional collaboration. Experience with real-time monitoring tools (Prometheus, ELK Stack, OpenTelemetry, Grafana) and AI/ML for predictive automation and network optimization is a plus. Familiarity with GitOps methodologies and automation best practices for telecom environments. Design, develop, test, and deploy automation scripts using languages such as Python, Go, Bash, or YAML. Automate the provisioning, configuration, and lifecycle management of network and cloud infrastructure. Design and maintain CI/CD pipelines using tools like Jenkins, GitLab CI/CD, ArgoCD, or Tekton. Automate continuous integration, testing, deployment, and rollback mechanisms for cloud-native services. Implement real-time monitoring, logging, and tracing using tools such as Prometheus, Grafana, ELK, and OpenTelemetry. Develop AI/ML-driven observability solutions for predictive analytics and proactive fault resolution, integrating AI/ML models to enable predictive scaling. Automate self-healing mechanisms to remediate network and application failures. Collaborate with DevOps and Network Engineers to align automation with business goals.

Posted 3 weeks ago

Apply

3.0 - 6.0 years

8 - 13 Lacs

Bengaluru

Work from Office

Naukri logo

As a vLab R&D Cloud Engineer, your job requires expertise in Cloud Computing Platforms, Linux and Networking. You have: 8+ years of relevant experience on deployment and troubleshooting of Infrastructure/Platforms especially OpenShift and ACM. Red Hat Certified OpenShift Administrator certification is must Prior troubleshooting experience on OpenStack, Kubernetes and OpenShift Platforms. Expert in software engineering practices like DevOps, Agile Methodologies, Continuous Integration and Test Automation. Practical experience with Kubernetes (K8s), podman and containerized infrastructure management. Expertise in Git, Gerrit, Jenkins, ArgoCD, Ansible, Python scripting for automation and deploying and maintaining common services like Kafka, Redis, Prometheus, Grafana, etc. Expertise in Layer 2/Layer 3 Data Networking. It would be nice if you also had: BE/BTech/MTech in Engineering Degree required. Good knowledge in troubleshooting Ceph and Openshift Data Foundation (ODF) issues and good knowledge of HP/Airframe/Dell NFVI x.x hardware. Good communication, organizational and problem-solving skills. Ability to identify and implement platform/process improvements, create new procedures and ability to work with a global team. Red Hat Certified Specialist in MultiCluster Management certification Learn to Deploy and maintain common cloud services platforms for Cloud and Network Services to meet security, performance, scalability, and reliability requirements. Collaborate with global cross-functional teams to design and implement solutions in a microservices architecture. Explore and implement best practices for continuous integration and continuous deployment (CI/CD). Contribute to short / mid-term decisions in own area and be part of high-performance team. Learn new platform as it evolves

Posted 3 weeks ago

Apply

Exploring Grafana Jobs in India

Grafana is a popular tool used for monitoring and visualizing metrics, logs, and other data. In India, the demand for Grafana professionals is on the rise as more companies are adopting this tool for their monitoring and analytics needs.

Top Hiring Locations in India

  1. Bangalore
  2. Hyderabad
  3. Pune
  4. Mumbai
  5. Delhi

Average Salary Range

The average salary range for Grafana professionals in India varies based on experience level: - Entry-level: ₹4-6 lakhs per annum - Mid-level: ₹8-12 lakhs per annum - Experienced: ₹15-20 lakhs per annum

Career Path

A typical career path in Grafana may include roles such as: 1. Junior Grafana Developer 2. Grafana Developer 3. Senior Grafana Developer 4. Grafana Tech Lead

Related Skills

In addition to Grafana expertise, professionals in this field often benefit from having knowledge or experience in: - Monitoring tools such as Prometheus - Data visualization tools like Tableau - Scripting languages (e.g., Python, Bash) - Understanding of databases (e.g., SQL, NoSQL)

Interview Questions

  • What is Grafana and how is it used? (basic)
  • Explain the difference between Grafana and Kibana. (basic)
  • How do you create a dashboard in Grafana? (medium)
  • What are plugins in Grafana and how can they be used? (medium)
  • How can you integrate Grafana with Prometheus for monitoring? (advanced)
  • Explain how alerting works in Grafana. (advanced)
  • How do you optimize queries in Grafana for better performance? (advanced)

Closing Remark

As the demand for Grafana professionals continues to grow in India, it is essential to stay updated with the latest trends and technologies in this field. Prepare thoroughly for interviews and showcase your skills confidently to land your dream job in Grafana. Good luck!

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies