Jobs
Interviews

7 Alerting Tools Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0.0 - 4.0 years

0 Lacs

nagpur, maharashtra

On-site

As a MySQL Database Administrator, you will be responsible for installing, configuring, and maintaining Microsoft SQL Server databases across development and production environments. Your role will involve ensuring high levels of performance, availability, sustainability, and security of the databases. You will be required to monitor database performance, troubleshoot database-related issues, and implement changes to optimize efficiency. Additionally, you will be responsible for performing regular data backups, recovery, and disaster recovery processes. Collaboration with application developers and business analysts to support database needs will be a crucial aspect of your responsibilities. You will manage database access, user roles, and permissions while also writing and maintaining scripts and automation tools for database maintenance. Participation in on-call rotation and providing after-hours support when necessary will be expected. The ideal candidate for this role should possess a Bachelor's degree in Computer Science, Information Technology, or a related field, along with more than 6 months of experience as a SQL Database Administrator. Proficiency in SQL Server Administration, T-SQL & Scripting, Database Performance Tuning, Backup and Recovery, Monitoring and Alerting Tools, as well as strong analytical and problem-solving skills are essential. Good communication and team collaboration skills are also required for this position. This is a full-time job with rotational work timings and a flexible work location. The work will be conducted in person at the specified location in Nagpur.,

Posted 2 days ago

Apply

3.0 - 7.0 years

0 Lacs

haryana

On-site

Job Description: You are a skilled and experienced Site Reliability Engineering (SRE) Consultant with over 7 years of experience. As an SRE Consultant, you will be responsible for implementing, maintaining, and enhancing the reliability, scalability, and performance of systems. Your role will involve collaborating closely with development teams to design and deploy robust and scalable solutions. Your responsibilities will include implementing best practices for reliability, scalability, and performance, collaborating with development teams to meet SRE standards, monitoring system performance, troubleshooting issues, implementing automation for process optimization, planning and executing system upgrades and migrations, providing on-call support for critical incidents, documenting processes and procedures, and staying updated on industry trends and best practices. To excel in this role, you should have a Bachelor's degree in Computer Science or a related field, at least 3 years of experience in Site Reliability Engineering or a related field, strong knowledge of cloud technologies and platforms such as AWS, GCP, and Azure, experience with monitoring and alerting tools like Prometheus, Grafana, and Datadog, proficiency in scripting and automation using tools like Python and Bash, strong problem-solving skills, attention to detail, excellent communication and teamwork skills, and the ability to work independently and collaborate effectively with cross-functional teams. Key Skills: SRE Engineer, Site Reliability, Resiliency, Cloud Technologies, AWS, GCP, Azure, Monitoring Tools, Automation, Problem Solving, Communication Skills, Teamwork, Computer Science, Reliability, Performance, Scalability, On-Call Support.,

Posted 3 days ago

Apply

2.0 - 5.0 years

5 - 10 Lacs

Hyderabad

Remote

Job Title: Network Engineer NOC Team Work Type : Rotational Shifts (including nights/weekends/on-call support as needed) Position Summary: We are looking for an enthusiastic and detail-oriented L1+ Network Engineer to join our Network Operations Center (NOC) team. The ideal candidate will play a key role in monitoring, triaging, and supporting enterprise network infrastructure, ensuring high availability and performance across all systems. This role is perfect for professionals with foundational networking knowledge and hands-on experience in troubleshooting and escalation within a 24/7 operational environment. Key Responsibilities: Proactively monitor network infrastructure using tools and dashboards (e.g. PingMon, SNMP-based monitoring, syslogs, and alerting platforms). Perform first-level (L1.5) network incident analysis, troubleshooting, and documentation. Execute basic network troubleshooting commands and tasks such as: ping, traceroute, Routing Management (Add/Delete/Change) Respond to alerts, triage issues, and perform Catch & Dispatch activities to appropriate Tier 2/3 teams or vendors. Document incidents, workarounds, and resolutions accurately in the ticketing system (e.g., ServiceNow). Escalate unresolved or critical issues in a timely manner with all necessary diagnostics. Support implementation of standard operating procedures (SOPs) and maintain process compliance. Assist with maintenance activities and participate in planned outage support. Coordinate with internal teams and vendors for timely incident resolution and follow-up. Perform basic configuration changes on network devices under supervision. Assist in maintaining and updating NOC documentation and network topology records. Required Skills & Experience: 1–3 years of hands-on experience in a NOC or network support role. Strong knowledge of networking fundamentals (TCP/IP, DNS, DHCP, VPN, LAN/WAN, VLAN). Familiarity with network troubleshooting tools and CLI commands (ping, traceroute, route configuration). Understanding of monitoring systems and alerting tools (PingMon, SolarWinds, Nagios, etc.). Experience with ticketing tools such as ServiceNow, JIRA or Remedy. Basic knowledge of routers, switches, firewalls, and wireless access points. Exposure to Cisco, Aruba, or similar network technologies. Strong verbal and written communication skills. Ability to remain calm and effective under pressure and during incidents. Willingness to work in a rotating shift and be part of 24/7 support. Nice to Have: CompTIA Network+ / CCNA / CCNP certification (or working toward it). Exposure to SD-WAN, cloud networking, or network automation tools. Scripting knowledge (Bash, Python) for automation is a plus. Working Conditions: Must be available for 24/7 shift rotations, including weekends and holidays. On-call support for incident escalations and major outages. Office and NOC environment with standard and extended monitoring hours.

Posted 5 days ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be to ensure the reliability and uptime of critical services. Your focus will include Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for a self-motivated individual with excellent communication skills, both oral and written, who can work effectively both independently and collaboratively. Your responsibilities will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be conducting routine tasks for system and application maintenance and following SOPs to correct or prevent issues. Responding to and managing running incidents, including post-mortem meetings, root cause analysis, and timely resolution will also be part of your responsibilities. Additionally, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collecting information to help developers understand the issues. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary, and learning from post-mortems to prevent new incidents are also key aspects of the role. Technical skills required for this position include: - 5+ years of experience in a SaaS and Cloud environment - Administration of Kubernetes clusters, including management of applications using ArgoCD - Linux scripting to automate routine tasks and improve operational efficiency - Experience with database systems like MySQL and DB2 - Experience as a Linux (CentOS / RHEL) administrator - Understanding of change management procedures and enforcement of safe and compliant changes to production environments - Knowledge of on-call responsibilities and maintaining on-call management tools - Experience with managing deployments using Jenkins - Prior experience with monitoring tools like New Relic, Splunk, and Nagios - Experience with log aggregation tools such as Splunk, Loki, or Grafana - Strong scripting knowledge in one of Python, Ruby, Bash, Java, or GoLang - Experience with API programming and integrating tools like Jira, Slack, xMatters, or PagerDuty If you are a dedicated professional who thrives in a high-pressure environment and enjoys working on critical services, this opportunity could be a great fit for you.,

Posted 1 week ago

Apply

15.0 - 20.0 years

0 Lacs

karnataka

On-site

As a Deputy Manager for Global GM DEC UBIX APS at BNP Paribas India Solutions in Bangalore, your primary responsibility is to oversee the Application Production Support teams for multiple Transversal applications across regions such as APAC, EUR, and AMERICAS. You will serve as the main point of contact for Users, Support team, and management to ensure expected service levels are met. Additionally, you will drive governances, manage stakeholder expectations, and lead various Automation, Monitoring & Tooling initiatives across Transversal APS and other teams within CIB APS. Your key responsibilities include performing Application Stability initiatives, Service Management activities, incident, problem, and change management reviews, and driving IPC Improvements initiatives across Multiple APS Teams. You will also be involved in Hiring and Recruitment topics, generating and reporting Production KPIs, SLs, and Dashboards, maintaining Training Dashboards, and preparing presentations for governances and steering committees. Furthermore, you will be accountable for the maintenance of Business continuity Plans, IT continuity plans, coordinating BCP and Disaster recovery exercises for Transversal, and contributing to various technical and behavioral competencies. Strong project management skills with a technical background in Unix, Oracle, SQL, knowledge of Project management tools, ITIL, and domain expertise in Global Markets and/or Global Banking are essential for this role. The ideal candidate should possess 15-20 years of IT experience, strong analytical skills, experience in managing international teams, and the ability to work under pressure. Certifications such as PMP, Prince2, ITIL, Devops, Cloud, Kubernetes, and prior knowledge of Application Production Support and DevOPS methodology are desirable qualifications. Education Level required for this position is a Bachelor's Degree or equivalent, and the experience level should be at least 15 years. Strong behavioral skills like ability to deliver, creativity & innovation, collaboration, and organizational skills, along with transversal skills like process development, strategic thinking, skills development, performance indicators setup, and analytical ability are crucial for success in this role.,

Posted 1 week ago

Apply

1.0 - 3.0 years

3 - 7 Lacs

Noida

Work from Office

About the Role: As a TechOps Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills. This job requires you to constantly hit the ground running and your ability to learn quickly and work on disparate and overlapping tasks will define your success. Key Responsibilities • Deployment of new releases , environments for applications. • Responding to emails and incident tickets, maintaining issue ownership. • Build and maintain highly scalable, large scale deployments globally • Co-Create and maintain architecture for 100% uptime. E.g. creating alternate connectivity. • Practice sustainable incident response/management and blameless post-mortems. • Monitor and maintain production environment stability. • Perform production support activities which involve the assignment of issues and issue analysis and resolution within the specified SLAs. • Coordinate with the Application Development Team to resolve issues on production. • Suggest fixes to complex issues by doing a thorough analysis of root cause and impact of the defect. • Provide daily support with a resolution of escalated tickets and act as a liaison to business and technical leads to ensure issues are resolved in a timely manner. • Technical hands-on troubleshooting, including parsing logs and following stack traces. • Efficiently do multi-tasking where the job holder will have to handle multiple customer requests from various sources. • Identifying and documenting technical problems, ensuring timely resolution. • Prioritize workload, providing timely and accurate resolutions. • Should be highly collaborative with the team, and other stakeholders. Experience and Skills: • Self-motivated, ability to do multitasking efficiently. • Database queries execution experience in any of DB (MySQL,Postgres /Mongo) • Basic Linux OS knowledge • Hands-on experience on Shell/UNIX commands. • Experience in Monitoring tools like Grafana, Logging tool like ELK. • Rest API working experience to execute curl, Analysing request and response, HTTP codes etc. • Knowledge on Incidents and escalation practices. • Ability to troubleshoot issues and able to handle different types of customer inquiries. • Should have worked in incident management tools like service now.

Posted 1 month ago

Apply

1.0 - 3.0 years

3 - 6 Lacs

Pune

Work from Office

About the Role: As a TechOps Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills. This job requires you to constantly hit the ground running and your ability to learn quickly and work on disparate and overlapping tasks will define your success. Key Responsibilities • Deployment of new releases , environments for applications. • Responding to emails and incident tickets, maintaining issue ownership. • Build and maintain highly scalable, large scale deployments globally • Co-Create and maintain architecture for 100% uptime. E.g. creating alternate connectivity. • Practice sustainable incident response/management and blameless post-mortems. • Monitor and maintain production environment stability. • Perform production support activities which involve the assignment of issues and issue analysis and resolution within the specified SLAs. • Coordinate with the Application Development Team to resolve issues on production. • Suggest fixes to complex issues by doing a thorough analysis of root cause and impact of the defect. • Provide daily support with a resolution of escalated tickets and act as a liaison to business and technical leads to ensure issues are resolved in a timely manner. • Technical hands-on troubleshooting, including parsing logs and following stack traces. • Efficiently do multi-tasking where the job holder will have to handle multiple customer requests from various sources. • Identifying and documenting technical problems, ensuring timely resolution. • Prioritize workload, providing timely and accurate resolutions. • Should be highly collaborative with the team, and other stakeholders. Experience and Skills: • Self-motivated, ability to do multitasking efficiently. • Database queries execution experience in any of DB (MySQL,Postgres /Mongo) • Basic Linux OS knowledge • Hands-on experience on Shell/UNIX commands. • Experience in Monitoring tools like Grafana, Logging tool like ELK. • Rest API working experience to execute curl, Analysing request and response, HTTP codes etc. • Knowledge on Incidents and escalation practices. • Ability to troubleshoot issues and able to handle different types of customer inquiries. • Should have worked in incident management tools like service now.

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies