Jobs
Interviews

618 Zabbix Jobs - Page 14

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0 years

1 - 3 Lacs

Mohali

On-site

Job Summary: We are looking for an experienced Server and Network Administrator to manage, maintain, and optimize our company’s IT infrastructure. This role involves overseeing servers, network devices, and security systems to ensure high availability, performance, and security across the organization. Key Responsibilities: Install, configure, and maintain physical and virtual servers (Windows/Linux). Manage and monitor network infrastructure including routers, switches, firewalls, and VPNs. Ensure server and network security through proper configurations, updates, and patches. Monitor system performance and troubleshoot hardware, software, and network issues. Manage backups, disaster recovery plans, and data integrity. Plan and implement network upgrades and expansions. Maintain documentation of network configurations, server setups, and IT procedures. Collaborate with IT team to design and implement IT infrastructure solutions. Respond to and resolve incidents affecting servers and network services promptly. Administer Active Directory, DNS, DHCP, and other network services. Manage user access and permissions to systems and network resources. Ensure compliance with company policies and industry best practices. Provide technical support and training to staff related to network and server systems. Required Skills and Qualifications: Proven experience as a Server Administrator, Network Administrator, or similar role. Strong knowledge of Windows Server and Linux operating systems. Hands-on experience with network devices (Cisco, Juniper, etc.). Proficiency with Active Directory, DNS, DHCP, and group policy management. Understanding of network protocols (TCP/IP, VLAN, VPN, etc.). Experience with virtualization technologies (VMware, Hyper-V). Familiarity with cloud services (AWS, Azure) is a plus. Knowledge of firewall and security best practices. Strong problem-solving skills and ability to work under pressure. Good communication and documentation skills. Relevant certifications (e.g., CompTIA Network+, Cisco CCNA, Microsoft MCSE) preferred. Preferred Qualifications: Experience managing enterprise-grade IT infrastructure. Familiarity with automation tools and scripting (PowerShell, Bash). Knowledge of monitoring tools (Nagios, Zabbix, SolarWinds). Understanding of compliance standards (ISO 27001, GDPR, PCI-DSS). What We Offer: Competitive salary and benefits. Professional development opportunities. Dynamic and collaborative work environment. Job Type: Full-time Pay: ₹10,000.00 - ₹30,000.00 per month Schedule: Day shift Work Location: In person

Posted 1 month ago

Apply

5.0 - 8.0 years

4 - 8 Lacs

Bengaluru

Work from Office

Bachelor’s degree in Network and/or Telecoms Engineering , 5 to 8 years of computer Networks and deployment experience, Network design, systems engineering, and network security experience mandatory Hands-on experience with network management (Zabbix, SolarWinds & etc) and traffic analysis tools (e.g. Wireshark, iperf) CCNA/CCNP is preferrable. Roles and Responsibilities Under general supervision, develops, codes, test, and debugs new software or enhancements to existing software customers. Requires good understanding of business application. Works with technical staff to understand problems with software and resolve them. Resolves customer complaints with software and responds to suggestions for improvements and enhancements from customers. May assist in development of software user manuals. Demonstrates software. Note: If the incumbent is responsible for the development of software for internal use, please match to a position in the Application Development sub-family grouping.

Posted 1 month ago

Apply

10.0 - 15.0 years

0 Lacs

India

On-site

Company Description Extreme Compute is India's first secure cloud service provider that combines high-speed computing with banking-grade security, storage solutions, multi-cloud, disaster recovery, and APM. Our integrated solutions eliminate the need for separate procurement and integration of each element. At Extreme Compute, we address the growing need for enterprise-grade cloud solutions that offer flexibility, scalability, simplicity, and vendor independence. Role Description We are looking to hire a seasoned Ansible Automation Expert with 10-15 years of experience in Linux infrastructure and automation. The ideal candidate will have deep command over Ansible, strong scripting ability, and a practical understanding of OpenSCAP or related compliance automation tools. This is a strategic role driving automation-first infrastructure and ensuring secure, scalable deployment practices. Key Responsibilities Design, build, and maintain Ansible playbooks, roles, and modules for infrastructure provisioning, configuration management, and application deployment. Lead infrastructure-as-code (IaC) practices across multi-environment setups (dev/test/prod). Collaborate with security teams to integrate OpenSCAP or other SCAP tools for compliance automation. Ensure idempotency, scalability, and reusability in all automation artifacts. Automate Linux system hardening, patching, and audit logging as per industry benchmarks. Provide technical leadership and mentorship to junior engineers on Ansible and automation best practices. Work closely with DevOps, Security, and Cloud teams for continuous integration and delivery. Required Skills: 10-15 years of overall IT infrastructure experience with a focus on Linux (RHEL/SLES/Ubuntu). Proven expertise in Ansible Core & Tower/AWX , including dynamic inventories, roles, conditionals, Jinja2 templating, and error handling. Experience in automating compliance/security policies using tools like OpenSCAP, SCAP Security Guide, or CIS Benchmarks. Strong knowledge of YAML, Bash/Shell scripting , and secure secrets handling (Ansible Vault, HashiCorp Vault). Deep understanding of CI/CD pipelines , GitOps workflows, and integration with Jenkins, GitLab, or similar tools. Solid grasp of RBAC, MFA, audit trails , and secure SSH practices in automation. Preferred skills (Good-to-have): Certifications like Red Hat Certified Specialist in Ansible Automation or RHCE . Hands-on with cloud platforms (AWS, Azure, GCP) and hybrid deployments. Familiarity with containerized environments (Docker, Kubernetes) and automating them with Ansible. Experience with monitoring, logging, and backup tools (Zabbix, ELK, Restic, etc.). Why Join Us: Work with cutting-edge automation and compliance tooling. Be part of a security-first, innovation-led infrastructure team. Opportunity to lead and shape automation practices in a high-impact role.

Posted 1 month ago

Apply

2.0 - 4.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Sygitech Solutions is a team of dedicated solutions consultants helping businesses transform data into profit by linking technology to business goals. Whether businesses require reliable software solutions or custom application development services, Sygitech is a full-service technology provider. We offer customized IT solutions and services that reduce costs and improve agility for enterprises. We have worked with renowned brands and small businesses, receiving exceptional feedback and reports of increased conversion rates. Role Overview We are looking for an experienced Linux – DevOps Engineer to join our team. The ideal candidate will have strong expertise in Linux systems, DevOps practices, and cloud environments (AWS, GCP, Azure). The role involves troubleshooting system issues, optimizing performance, managing infrastructure, and automating processes to ensure seamless IT operations. Key Responsibilities: Infrastructure Management Understand client infrastructure and various applications running on the servers. Periodically review and provide infrastructure updates and recommendations. Configure, optimize, and maintain Linux-based environments (CentOS, RedHat, Ubuntu, etc.). Monitoring & Troubleshooting Monitor infrastructure and running services; take proactive action on alerts. Diagnose and resolve operating system-related issues, including system booting, system recovery, and performance tuning. Ensure high availability of cloud-based infrastructure. Cloud & DevOps Work with AWS services, including EC2, Auto Scaling, VPC, Load Balancer, CodeDeploy, CodePipeline, IAM, and security groups. Experience with GCP and Azure is a plus. Manage Kubernetes clusters and CI/CD pipelines for deployments. Automate and optimize infrastructure using Terraform, Ansible, or scripting. Security & Compliance Ensure adherence to security best practices and compliance requirements. Implement security policies, backup strategies, and disaster recovery procedures. Incident & Problem Management Provide on-call support and resolve incidents within SLAs. Conduct Root Cause Analysis (RCA) for system failures and performance issues. Take a proactive approach to avoid downtime and optimize system health. Innovation and Knowledge Sharing Work with the business development team to provide infrastructure insights for proposals. Continuously innovate and implement automation to reduce manual efforts. Share knowledge and mentor junior team members. Qualifications: Technical Expertise: Strong understanding of Linux OS (CentOS, RedHat, Ubuntu, etc.). Hands-on experience with monitoring tools (Nagios, Zabbix, NewRelic, etc.). Good knowledge of performance monitoring tools (IOSTAT, VMSTAT, etc.). Strong troubleshooting skills for OS, cloud infrastructure, and network issues. Experience in Apache, Nginx, Varnish, HAProxy, MySQL replication, MongoDB Replica configuration. Knowledge of DevOps tools (Terraform, Ansible, Jenkins, Docker, Kubernetes). Soft Skills: Ability to work in crisis situations and provide quick resolutions. Strong communication and collaboration skills. A team player with an analytical mindset. Education & Experience: Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience). 2-4 year of experience Why Join Sygitech? Work with cutting-edge cloud and DevOps technologies. Competitive compensation and performance-based incentives. A collaborative culture focused on innovation and growth. Opportunities for professional development and career advancement.

Posted 1 month ago

Apply

4.0 years

0 Lacs

Mumbai Metropolitan Region

On-site

We are seeking a highly skilled and proactive System Administrator with over 4 years of hands-on experience in managing IT infrastructure, system performance, and security across multi-platform environments. The ideal candidate should have strong expertise in Windows and/or Linux systems, network configurations, cloud services, and troubleshooting enterprise-level issues. Key Responsibilities Manage, configure, and monitor Windows and Linux servers (on-premises and/or cloud). Maintain and upgrade system software and firmware on a regular basis. Apply OS patches and upgrades regularly, and upgrade administrative tools and utilities. Monitor system performance, availability, and capacity planning. Administer cloud environments (AWS / Azure / GCP). Manage virtual environments (VMware, Hyper-V, or similar platforms). Implement backup and disaster recovery plans using cloud and local tools. Ensure system security through firewalls, antivirus, patch management, and access controls. Conduct regular audits and vulnerability scans. Assist in compliance efforts (e.g., ISO, GDPR, SOC2, HIPAA depending on industry). Troubleshoot and resolve network issues including DNS, DHCP, VPN, routing, and firewalls. Manage network hardware (switches, routers, access points, etc.). Collaborate with Network Engineers on LAN/WAN deployments and performance tuning. Provide Level 2/3 support to internal users for hardware, software, and connectivity issues. Manage user accounts, permissions, and access rights in Active Directory or similar identity systems. Train and support junior team members and document processes. Use system monitoring tools (Nagios, Zabbix, SolarWinds, etc.) to track performance and incidents. Generate reports for uptime, usage, and anomalies. Maintain logs for audit and Skills & Qualifications : Bachelors degree in Computer Science, IT, or a related field (or equivalent experience). 4+ years of experience in System Administration. Strong knowledge of Windows Server and/or Linux OS administration. Experience with cloud platforms (AWS, Azure, GCP). Proficiency in scripting (PowerShell, Bash, or Python). Working knowledge of networking protocols and tools (TCP/IP, DNS, DHCP, VPN). Hands-on with Active Directory, Group Policy, DNS, DHCP, WSUS, etc. Familiarity with backup and recovery software and DR strategies. Solid understanding of ITIL processes (preferred). Nice To Have Relevant certifications (MCSA/MCSE, RHCSA/RHCE, AWS/Azure Associate, CompTIA Security+). Experience with containerization (Docker, Kubernetes). Familiarity with DevOps tools (Ansible, Jenkins, Terraform) is a plus. Exposure to ITSM tools like Jira, ServiceNow, etc (ref:hirist.tech)

Posted 1 month ago

Apply

4.0 years

0 Lacs

Sadar, Uttar Pradesh, India

On-site

About The Role We're looking for a highly experienced DevOps Engineer with a strong foundation in Linux, Kubernetes, AWS, and scripting. You will play a crucial role in automating, scaling, and securing infrastructure for high-traffic systems, particularly in FinTech and banking environments. The ideal candidate brings a hands-on mindset, excellent troubleshooting skills, and a passion for system optimization and automation. What You'll Be Doing Manage and maintain high-traffic production systems in cloud and on-premise environments Deploy, configure, and monitor infrastructure using tools like Ansible, Chef, or Puppet Administer Kubernetes clusters and Docker containers for microservice deployments Implement and support edge caching systems using Redis and Aerospike Design scalable, secure cloud architectures on AWS and OpenStack Automate system processes with Bash, Python, or other scripting languages Ensure high availability and performance through HA/load balancing strategies Maintain and optimize Linux-based infrastructure (Ubuntu, Fedora, CentOS) Work with application stacks including LAMP, Nginx, HAProxy, OpenResty Configure and troubleshoot DNS, TCP/IP, firewalls, and network protocols Manage databases including MySQL, MongoDB, Cassandra, and SQL-based systems Monitor infrastructure health using tools such as Nagios, Zabbix, Cacti, and Ganglia Support version control and CI/CD pipelines using Git/SVN Develop infrastructure code and automation scripts for seamless deployments Your Toolkit Core Skills : 4+ years of hands-on DevOps or system engineering experience In-depth Linux/Unix administration experience Strong expertise with Kubernetes (must-have) and Docker Solid experience with AWS services and OpenStack cloud infrastructure Proficiency in at least two scripting/programming languages (Python, Bash, Perl, Ruby, or PHP) Understanding of Linux kernel subsystems (memory, storage, network) Networking fundamentals : DNS, TCP/UDP, routing, load balancing Experience with automation/configuration management tools (Ansible, Chef, Puppet) Familiarity with monitoring tools (Nagios, Zabbix, Cacti, Ganglia) Database knowledge: MySQL, MongoDB, Cassandra, SQL/RDBMS Bonus Points Experience managing infrastructure in FinTech or banking domains Certifications: AWS Certified DevOps Engineer, Google Professional DevOps Engineer Certifications in tools like Certified Kubernetes Administrator or Terraform Associate Experience with edge caching technologies and high-performance environments Strong debugging, analytical, and problem-solving skills Why Join Us Work on mission-critical systems that power high-growth businesses Join a collaborative, forward-thinking DevOps culture Opportunity to implement and own end-to-end infrastructure strategies Competitive compensation and flexible work environment Contribute to a scalable, secure, and modern cloud-first engineering practice (ref:hirist.tech)

Posted 1 month ago

Apply

7.0 years

0 Lacs

Bhubaneswar, Odisha, India

Remote

About ServerHub: At ServerHub, we power businesses with high-performance cloud and hosting solutions. Our mission is to provide customers with reliable, scalable, and secure infrastructure worldwide. As an L3 Linux System Engineer, you will be at the forefront of our operations, ensuring our hosting platforms are optimized, secure, and always online. Job Responsibilities: 🔹 Escalation & Troubleshooting: Act as the final escalation point (L3) for complex server, hosting, and network-related issues Diagnose and resolve critical system failures, network outages, and performance issues 🔹 Linux Server Administration: Manage, optimize, and secure Linux-based hosting environments (CentOS, Ubuntu, RHEL) Administer and fine-tune web servers (Apache, Nginx, LiteSpeed), databases (MySQL, PostgreSQL), and caching layers (Redis, Memcached) Build and deploy servers from the ground up, ensuring optimal configurations for performance and security 🔹 Automation & DevOps: Develop and maintain automation scripts (Bash, Python, Perl) for server provisioning and configuration management Utilize Ansible, Terraform, or similar tools for automating infrastructure deployments 🔹 Cloud & Virtualization: Deploy and manage KVM, OpenStack, VMware, or containerized environments (Docker, Kubernetes) Support cloud-based hosting solutions (AWS, Google Cloud, Azure) 🔹 Security & Compliance: Implement security best practices, including firewall rules, SELinux/AppArmor, IDS/IPS Perform vulnerability assessments and patch management to secure customer environments 🔹 Monitoring & Incident Response: Set up and manage monitoring tools like Prometheus, Grafana, Zabbix, Nagios Participate in 24/7 on-call rotation for urgent system issues 🔹 Collaboration & Mentoring: Work closely with NOC, DevOps, and Engineering teams to ensure smooth operations Provide guidance and mentorship to L1 & L2 support engineers Requirements: ✔ 7+ years of hands-on Linux system administration experience in a web hosting or cloud environment. ✔ Expert knowledge of web hosting technologies (cPanel, Plesk, WHM, LAMP/LEMP stacks). ✔ Strong scripting ability in Perl, Bash, and Python. ✔ Experience with configuration management tools (Ansible, Puppet, Chef). ✔ Networking expertise – understanding of TCP/IP, DNS, VPN, Firewalls, Load Balancing. ✔ Familiarity with RAID, SAN, NAS, and distributed storage systems. ✔ Experience working in a 24/7 production environment with on-call duties. ✔ Ability to build and configure servers from the ground up. ✔ Certifications such as RHCE, AWS Certified SysOps Administrator, or Kubernetes (CKA) are a plus. What ServerHub Offers: 🚀 A fast-paced, innovative environment in a growing cloud hosting company. 💻 Cutting-edge technologies and challenging projects. 📈 Career growth opportunities and professional development. 🎯 Paid leaves and fully remote set-up 🔹 Join ServerHub and be part of a team that keeps the internet running! Apply today! Powered by JazzHR pPJnffn4Vk

Posted 1 month ago

Apply

5.0 - 8.0 years

5 - 9 Lacs

Bengaluru

Work from Office

Network design, systems engineering, and network security experience mandatory Hands-on experience with network management (Zabbix, SolarWinds & etc) and traffic analysis tools (e. g. Wireshark, iperf) CCNA/CCNP is preferrable. Work Experience Bachelor degree in network and/or Telecoms Engineering, 5 to 8 years of computer Networks and deployment experience,

Posted 1 month ago

Apply

4.0 years

0 - 0 Lacs

Gurgaon

On-site

Job Title: DevOps Engineer Experience: Minimum 4 years Salary: upto 38000/- per month Shift: 10 AM to 7 PM (Monday to Friday) Location: Magnum Galaxy Tower-1, Sector-58, Gurgaon - 122011 Job Responsibilities and Required Skills Manage and configure physical, dedicated, VPS, or VDS servers (e.g., DigitalOcean, Bluehost, Hostinger). Install and configure LAMP stack (Linux, Apache, MySQL, PHP). Perform server performance tuning and optimization. Set up and manage GitLab with Jenkins for CI/CD pipelines. Configure load balancers for efficient traffic distribution. Handle database setup and replication (e.g., MySQL, PostgreSQL). Manage site management platforms (e.g., cPanel, CWP, Webmin). Implement server security measures and continuous data protection. Automate server tasks using tools like Ansible, Puppet, or Bash scripting. Configure monitoring systems (e.g., Nagios, Zabbix, Prometheus). Manage network settings, including Squid, OpenVPN, and proxy configurations. Set up and manage server firewalls (e.g., iptables, UFW). Install and configure applications like WordPress, Laravel, and Magento. Demonstrate expertise in server setup, maintenance, and troubleshooting. Knowledge of containerization tools like Docker for application deployment. Familiarity with backup and disaster recovery solutions. Experience with log management and analysis tools (e.g., ELK Stack, Splunk). Requirements 4+ years of experience with physical/dedicated/VPS server environments. Strong knowledge of Linux-based systems and server management tools. Proficiency in automation, monitoring, and security best practices. Ability to work independently and collaboratively to ensure reliable server operations. Eligible candidates can send updated resumes to hr@cosmoindia.in or via WhatsApp 9953690702 Job Types: Full-time, Permanent Pay: ₹30,000.00 - ₹38,000.00 per month Benefits: Leave encashment Paid time off Schedule: Day shift Supplemental Pay: Yearly bonus Ability to commute/relocate: Gurugram, Haryana: Reliably commute or planning to relocate before starting work (Required) Application Question(s): Are you available for interview in person? Education: Bachelor's (Required) Experience: DevOps: 4 years (Required) GitLab: 4 years (Required) Jenkins: 4 years (Required) LAMP stack: 4 years (Required) VPS or VDS servers: 4 years (Required) Physical servers: 4 years (Required) Dedicated servers: 4 years (Required) Linux: 4 years (Required) Location: Gurugram, Haryana (Required) Work Location: In person Speak with the employer +91 9953692702 Expected Start Date: 01/07/2025

Posted 1 month ago

Apply

5.0 years

0 Lacs

Ahmedabad

On-site

Job Summary: We are seeking a highly skilled Infrastructure Monitoring Engineer to join our dynamic IT operations team. This role focuses on proactive monitoring, incident management, and performance optimization of our critical infrastructure systems, ensuring high availability and reliability. The ideal candidate will have strong technical expertise, problem-solving skills, and a proactive approach to infrastructure monitoring. Key Responsibilities: Must Have Skills: Windows and Linux knowledge along with at least 5 Years of experience in monitoring infrastructure devices. Working experience of Logic Monitor/SolarWinds. Good To Have Skills: Zabbix/Nagios/Nagios XI tool experience or scripting knowledge Monitoring & Incident Management: Monitor infrastructure components (servers, networks, databases, cloud environments) using industry-standard tools. Identify, diagnose, and resolve infrastructure issues efficiently. Escalate complex issues to L3 or appropriate teams while maintaining clear communication. Vendor co-ordination Performance Tuning & Optimization: Analyze system performance metrics and recommend improvements. Implement proactive measures to prevent recurring issues. Tool Management: Manage and configure monitoring tools such as Logic Monitor, SolarWinds, Zabbix, Nagios or similar. Customize alerts and dashboards to optimize incident detection. Monitoring Tool Integration with ServiceNow and other ITSM Tool Documentation & Reporting: Maintain detailed documentation of incidents, procedures, and system configurations. Provide regular reports on infrastructure health, incidents, and system performance. Collaboration & Communication: Work closely with Windows, Linux, DevOps, Network, and Security teams to ensure seamless operations. Participate in root cause analysis (RCA) for major incidents and suggest preventive actions. Candidate Requirements: Education: Bachelor’s degree in computer science, Information Technology, or a related field. Experience: 3-5 years of experience in infrastructure monitoring, IT operations, or a similar role. Technical Proficiency: Strong knowledge of Linux/Unix and Windows operating systems. Familiarity with cloud platforms (AWS, Azure, GCP) is a plus. Experience with scripting languages like Python, Bash, or PowerShell for automation. Understanding of networking concepts, TCP/IP, DNS, DHCP, VPNs, etc. Proficiency with monitoring tools (Logic Monitor, SolarWinds, Zabbix, Nagios etc.). Certifications: Any certification for Infrastructure monitoring tool will have an added advantage Job Types: Full-time, Permanent Benefits: Provident Fund Schedule: Rotational shift Ability to commute/relocate: Ahmedabad, Gujarat: Reliably commute or planning to relocate before starting work (Preferred) Application Question(s): Notice Period? Experience: monitoring infrastructure: 5 years (Required) Work Location: In person

Posted 1 month ago

Apply

5.0 years

16 - 18 Lacs

Vadodara

On-site

Experience 5+ Infrastructure Monitoring Engineer Location: Ahmedabad/Vadodara Job Type: Full Time / Onsite Department: IT Infrastructure Shift: Rotational Shift Job Summary: We are seeking a highly skilled Infrastructure Monitoring Engineer to join our dynamic IT operations team. This role focuses on proactive monitoring, incident management, and performance optimization of our critical infrastructure systems, ensuring high availability and reliability. The ideal candidate will have strong technical expertise, problem-solving skills, and a proactive approach to infrastructure monitoring. Key Responsibilities: Must Have Skills: Windows and Linux knowledge along with at least 5 Years of experience in monitoring infrastructure devices. Working experience of Logic Monitor/SolarWinds. Good To Have Skills: Zabbix/Nagios/Nagios XI tool experience or scripting knowledge Monitoring & Incident Management: Monitor infrastructure components (servers, networks, databases, cloud environments) using industry-standard tools. Identify, diagnose, and resolve infrastructure issues efficiently. Escalate complex issues to L3 or appropriate teams while maintaining clear communication. Vendor co-ordination Performance Tuning & Optimization: Analyze system performance metrics and recommend improvements. Implement proactive measures to prevent recurring issues. Tool Management: Manage and configure monitoring tools such as Logic Monitor, SolarWinds, Zabbix, Nagios or similar. Customize alerts and dashboards to optimize incident detection. Monitoring Tool Integration with ServiceNow and other ITSM Tool Documentation & Reporting: Maintain detailed documentation of incidents, procedures, and system configurations. Provide regular reports on infrastructure health, incidents, and system performance. Collaboration & Communication: Work closely with Windows, Linux, DevOps, Network, and Security teams to ensure seamless operations. Participate in root cause analysis (RCA) for major incidents and suggest preventive actions. Candidate Requirements: Education: Bachelor’s degree in computer science, Information Technology, or a related field. Experience: 3-5 years of experience in infrastructure monitoring, IT operations, or a similar role. Technical Proficiency: Strong knowledge of Linux/Unix and Windows operating systems. Familiarity with cloud platforms (AWS, Azure, GCP) is a plus. Experience with scripting languages like Python, Bash, or PowerShell for automation. Understanding of networking concepts, TCP/IP, DNS, DHCP, VPNs, etc. Proficiency with monitoring tools (Logic Monitor, SolarWinds, Zabbix, Nagios etc.). Certifications: Any certification for Infrastructure monitoring tool will have an added advantage Job Category: Infrastructure Monitoring Engineer Job Type: Full-time Pay: ₹1,600,000.00 - ₹1,800,000.00 per year Work Location: In person

Posted 1 month ago

Apply

3.0 - 6.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Role Overview and Responsibilities Dhruva Space is seeking a highly skilled and experienced Network Security Engineer to join our dynamic and innovative team. As a leading space technology company, we specialize in developing cutting-edge solutions for the evolving needs of the space industry. The Network Security Engineer will play a pivotal role in designing, implementing, and maintaining our network and security infrastructure, ensuring seamless performance, robust reliability, and comprehensive protection against potential threats. This position demands technical expertise, a proactive approach to problem-solving, and a commitment to upholding the highest standards of cybersecurity within our operations. Key responsibilities include but are not limited to: Design, implement, and maintain complex network infrastructures, including LAN, WAN, and wireless networks. Configure and manage network devices such as routers, switches, firewalls, and load balancers. Troubleshoot network issues and implement solutions to minimize downtime. Deploy and manage network security solutions, including firewalls, intrusion detection/prevention systems (IDS/IPS), and VPNs. Conduct security assessments and vulnerability scans to identify and mitigate potential threats. Develop and enforce security policies, procedures, and best practices. Configure and maintain network protocols (TCP/IP, DHCP, DNS, VLANs) and secure file transfer protocols (SFTP, SSH). Monitor network performance and optimize resource utilization. Diagnose and resolve complex network and security issues. Provide technical support to end-users and stakeholders. Stay updated with the latest advancements in network and security technologies. Candidate Requirements: Diploma or Bachelor's degree in Computer Science, Engineering, or a related field. 3-6 years of hands-on experience in network and security engineering. Strong knowledge of network protocols (TCP/IP, UDP, HTTP, HTTPS, FTP, SFTP, VLAN, DHCP). Expertise in configuring and managing network devices (routers, switches, firewalls, load balancers). Proficiency in network security technologies (IDS/IPS, VPN, firewall management). Experience with network monitoring tools (Nagios, Zabbix, Solar winds or similar). Strong troubleshooting and problem-solving skills. Excellent communication and interpersonal skills. Certifications such as CCNA, CCNP (preferred). Experience with cloud-based network solutions (AWS, Azure, GCP) is an advantage. Familiarity with automation tools (Ansible, Puppet, Chef) is a plus. Show more Show less

Posted 1 month ago

Apply

5.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

L2 NOC Engineer – Noida Seeking skilled professionals with 2–5 years’ experience in advanced network support and 24x7 operations. If you're passionate about root cause analysis, incident escalation, and performance tuning across complex infrastructure, this is your arena. 🔍 Key Highlights • Hands-on with routers, switches, firewalls, VPNs • Deep dive into TCP/IP, BGP, OSPF, MPLS, VLANs • Tools like SolarWinds, Zabbix, PRTG, and scripting in Python/Bash • Strong command of security, compliance, and RCA documentation • Certifications: CCNP, ITIL, CEH (preferred) Join a dynamic environment that values ownership, precision, and continuous improvement. 📩 DM to apply or share with someone who fits the role! Show more Show less

Posted 1 month ago

Apply

4.0 years

0 Lacs

Gurugram, Haryana

On-site

Job Title: DevOps Engineer Experience: Minimum 4 years Salary: upto 38000/- per month Shift: 10 AM to 7 PM (Monday to Friday) Location: Magnum Galaxy Tower-1, Sector-58, Gurgaon - 122011 Job Responsibilities and Required Skills Manage and configure physical, dedicated, VPS, or VDS servers (e.g., DigitalOcean, Bluehost, Hostinger). Install and configure LAMP stack (Linux, Apache, MySQL, PHP). Perform server performance tuning and optimization. Set up and manage GitLab with Jenkins for CI/CD pipelines. Configure load balancers for efficient traffic distribution. Handle database setup and replication (e.g., MySQL, PostgreSQL). Manage site management platforms (e.g., cPanel, CWP, Webmin). Implement server security measures and continuous data protection. Automate server tasks using tools like Ansible, Puppet, or Bash scripting. Configure monitoring systems (e.g., Nagios, Zabbix, Prometheus). Manage network settings, including Squid, OpenVPN, and proxy configurations. Set up and manage server firewalls (e.g., iptables, UFW). Install and configure applications like WordPress, Laravel, and Magento. Demonstrate expertise in server setup, maintenance, and troubleshooting. Knowledge of containerization tools like Docker for application deployment. Familiarity with backup and disaster recovery solutions. Experience with log management and analysis tools (e.g., ELK Stack, Splunk). Requirements 4+ years of experience with physical/dedicated/VPS server environments. Strong knowledge of Linux-based systems and server management tools. Proficiency in automation, monitoring, and security best practices. Ability to work independently and collaboratively to ensure reliable server operations. Eligible candidates can send updated resumes to hr@cosmoindia.in or via WhatsApp 9953690702 Job Types: Full-time, Permanent Pay: ₹30,000.00 - ₹38,000.00 per month Benefits: Leave encashment Paid time off Schedule: Day shift Supplemental Pay: Yearly bonus Ability to commute/relocate: Gurugram, Haryana: Reliably commute or planning to relocate before starting work (Required) Application Question(s): Are you available for interview in person? Education: Bachelor's (Required) Experience: DevOps: 4 years (Required) GitLab: 4 years (Required) Jenkins: 4 years (Required) LAMP stack: 4 years (Required) VPS or VDS servers: 4 years (Required) Physical servers: 4 years (Required) Dedicated servers: 4 years (Required) Linux: 4 years (Required) Location: Gurugram, Haryana (Required) Work Location: In person Speak with the employer +91 9953692702 Expected Start Date: 01/07/2025

Posted 1 month ago

Apply

0.0 - 5.0 years

0 Lacs

Ahmedabad, Gujarat

On-site

Job Summary: We are seeking a highly skilled Infrastructure Monitoring Engineer to join our dynamic IT operations team. This role focuses on proactive monitoring, incident management, and performance optimization of our critical infrastructure systems, ensuring high availability and reliability. The ideal candidate will have strong technical expertise, problem-solving skills, and a proactive approach to infrastructure monitoring. Key Responsibilities: Must Have Skills: Windows and Linux knowledge along with at least 5 Years of experience in monitoring infrastructure devices. Working experience of Logic Monitor/SolarWinds. Good To Have Skills: Zabbix/Nagios/Nagios XI tool experience or scripting knowledge Monitoring & Incident Management: Monitor infrastructure components (servers, networks, databases, cloud environments) using industry-standard tools. Identify, diagnose, and resolve infrastructure issues efficiently. Escalate complex issues to L3 or appropriate teams while maintaining clear communication. Vendor co-ordination Performance Tuning & Optimization: Analyze system performance metrics and recommend improvements. Implement proactive measures to prevent recurring issues. Tool Management: Manage and configure monitoring tools such as Logic Monitor, SolarWinds, Zabbix, Nagios or similar. Customize alerts and dashboards to optimize incident detection. Monitoring Tool Integration with ServiceNow and other ITSM Tool Documentation & Reporting: Maintain detailed documentation of incidents, procedures, and system configurations. Provide regular reports on infrastructure health, incidents, and system performance. Collaboration & Communication: Work closely with Windows, Linux, DevOps, Network, and Security teams to ensure seamless operations. Participate in root cause analysis (RCA) for major incidents and suggest preventive actions. Candidate Requirements: Education: Bachelor’s degree in computer science, Information Technology, or a related field. Experience: 3-5 years of experience in infrastructure monitoring, IT operations, or a similar role. Technical Proficiency: Strong knowledge of Linux/Unix and Windows operating systems. Familiarity with cloud platforms (AWS, Azure, GCP) is a plus. Experience with scripting languages like Python, Bash, or PowerShell for automation. Understanding of networking concepts, TCP/IP, DNS, DHCP, VPNs, etc. Proficiency with monitoring tools (Logic Monitor, SolarWinds, Zabbix, Nagios etc.). Certifications: Any certification for Infrastructure monitoring tool will have an added advantage Job Types: Full-time, Permanent Benefits: Provident Fund Schedule: Rotational shift Ability to commute/relocate: Ahmedabad, Gujarat: Reliably commute or planning to relocate before starting work (Preferred) Application Question(s): Notice Period? Experience: monitoring infrastructure: 5 years (Required) Work Location: In person

Posted 1 month ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Kyndryl Software Engineering Bengaluru, Karnataka, India Posted on Jun 18, 2025 Apply now Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The Role Are you passionate about solving complex problems? Do you thrive in a fast-paced environment? Then there’s a good chance you will love being a part of our Software Engineering – Development team at Kyndryl, where you will be able to see the immediate value of your work. Your Responsibilities Learning about existing processes and creating new ways to solve them with automation and/or updating processes Working independently and collaboratively on projects, and doing periodic demonstrations for stakeholders Evaluating new technologies and services for the environment Participating in a daily standup and team meetings Collaborating with the team on projects and day-to-day work Updating and writing documentation for our services and standard operating procedures Researching architectural performance and risks Establish monitoring around performance and risk vectors Develop automated issue response / prevention mechanisms Participating in an on-call rotation Your Future at Kyndryl The career path ahead is full of exciting opportunities to grow and advance within the job family. With dedication and hard work, you can climb the ladder to higher bands, achieving coveted positions such as Principal Engineer or Vice President of Software. These roles not only offer the chance to inspire and innovate, but also bring with them a sense of pride and accomplishment for having reached the pinnacle of your career in the software industry. Who You Are You’re good at what you do and possess the required experience to prove it. However, equally as important – you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused – someone who prioritizes customer success in their work. And finally, you’re open and borderless – naturally inclusive in how you work with others. Required Technical And Professional Experience Minium 5 years of experience Programming – ruby and and python API and 3rd party integrations Solid understanding of infrastructure concepts, including server hardware (x86 and/or IBM Power), storage, containerization (docker/Kubernetes), and cloud platforms (Azure/AWS/GCP) Version Control Systems (i.e., git and GitHub/GitLab) CI/CD tools (i.e., GitHub actions or Jenkins) Monitoring tools (i.e., Zabbix or New Relic) Visualization tools (i.e., Grafana or Power BI) Working with metrics technologies (rrdtool, ganglia, InfluxDB) Configuration automation and deployment orchestration (i.e., ansible, pup pet, chef) Linux systems administration Strong scripting (UNIX shell, bash) ability Strong experience wrangling Linux systems Deploying software using containers (docker, kubernetes) Experience with machine learning and AI is a plus Monitoring systems programmatically and taking action to resolve issues Working in an agile environment Solving problems using automation, tools, and scripts Systems and server lifecycle management Building and modifying collections of automation scripts and tools Break/fix and ticket-based work Preferred Technical And Professional Experience Bachelor's degree in computer science, related technical field, or equivalent practical experience Experience with DevOps tools and modern engineering practices Being You Diversity is a whole lot more than what we look like or where we come from, it’s how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. But we’re not doing it single-handily: Our Kyndryl Inclusion Networks are only one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you – and everyone next to you – the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. That’s the Kyndryl Way. What You Can Expect With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees and support you and your family through the moments that matter – wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed. Get Referred! If you know someone that works at Kyndryl, when asked ‘How Did You Hear About Us’ during the application process, select ‘Employee Referral’ and enter your contact's Kyndryl email address. Apply now See more open positions at Kyndryl Show more Show less

Posted 1 month ago

Apply

6.0 - 10.0 years

5 - 9 Lacs

Mumbai

Work from Office

We are looking for a Splunk Admin/Developer to help with the onboarding, deployment & support of Splunk Infrastructure & applications.This role will join Cognitive & Robotics (Automation Development Centre) team under EAF (Enterprise Automation Fabrics) Business unit that is responsible for managing the global Tools Splunk infrastructure. This is a strategic position and will be instrumental in the design, implementation, support, performance, and integrity of the Splunk ecosystem. You will work closely with multiple stakeholders and global partners. This is a multi-disciplinary role that will interact directly with developers and different functional IT, Security and Engineering teams to gather requirements, architect solutions and ensure the Splunk platform is leveraged as a key data collection. Primary Skills Splunk administration experience Managing Splunk on-premises core infrastructure Experience with Splunk App and addon installation & upgrades Splunk Knowledge Object Management experience Expertise in Splunk Search Language (SPL) Linux administration experience Premium Splunk apps IT Service Intelligence expertise System integration experience using web services (SOAP, REST, JSON) Experience with UNIX shell scripting or Python Secondary Skills Knowledge of APM/Monitoring tools like Zabbix, Centrion, etc. Problem-solving skills and ability to work independently Team mentoring and leadership skills Result-oriented mindset with strong prioritization skills Experience in a global support model with 24x7 functionalities

Posted 1 month ago

Apply

3.0 - 7.0 years

2 - 6 Lacs

Bengaluru

Work from Office

Zabbix + Python: Proficiency in Zabbix tool which includes Create and configure hosts, host groups, templates, items, triggers, and actions within Zabbix to monitor various devices and servers. Install and configure Zabbix agent & Proxy on VMs. Understanding of networking concepts and protocols (TCP, UDP, IP, ICMP, DNS, SNMP v2/v3, etc.). for integrations Knowledge on SNMP MIB files and OIDs Good knowledge on Zabbix API methods and Zabbix backend DB schema. Good troubleshooting and analytical skills. Proficiency in Python/ Shell scripting to create utilities based out of Zabbix DB or API calls. Good Knowledge on database technologies such as MySQL & Maria DB. Linux OS knowledge and cronjob scheduling. Domain knowledge on Network and Infra devices Experience in API based monitoring. Primary Skills Perl/Shell Scripting Familiarity with other monitoring and logging tools like Grafana, Prometheus etc. Good understanding of Containerized Architecture like Docker, Kubernetes, OpenShift etc.

Posted 1 month ago

Apply

1.0 - 3.0 years

3 - 7 Lacs

Noida

Work from Office

About the Role: As a TechOps Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills. This job requires you to constantly hit the ground running and your ability to learn quickly and work on disparate and overlapping tasks will define your success. Key Responsibilities • Deployment of new releases , environments for applications. • Responding to emails and incident tickets, maintaining issue ownership. • Build and maintain highly scalable, large scale deployments globally • Co-Create and maintain architecture for 100% uptime. E.g. creating alternate connectivity. • Practice sustainable incident response/management and blameless post-mortems. • Monitor and maintain production environment stability. • Perform production support activities which involve the assignment of issues and issue analysis and resolution within the specified SLAs. • Coordinate with the Application Development Team to resolve issues on production. • Suggest fixes to complex issues by doing a thorough analysis of root cause and impact of the defect. • Provide daily support with a resolution of escalated tickets and act as a liaison to business and technical leads to ensure issues are resolved in a timely manner. • Technical hands-on troubleshooting, including parsing logs and following stack traces. • Efficiently do multi-tasking where the job holder will have to handle multiple customer requests from various sources. • Identifying and documenting technical problems, ensuring timely resolution. • Prioritize workload, providing timely and accurate resolutions. • Should be highly collaborative with the team, and other stakeholders. Experience and Skills: • Self-motivated, ability to do multitasking efficiently. • Database queries execution experience in any of DB (MySQL,Postgres /Mongo) • Basic Linux OS knowledge • Hands-on experience on Shell/UNIX commands. • Experience in Monitoring tools like Grafana, Logging tool like ELK. • Rest API working experience to execute curl, Analysing request and response, HTTP codes etc. • Knowledge on Incidents and escalation practices. • Ability to troubleshoot issues and able to handle different types of customer inquiries. • Should have worked in incident management tools like service now.

Posted 1 month ago

Apply

5.0 years

0 - 0 Lacs

Hyderābād

On-site

Role: Senior Linux Admin Experience: 5-15 years Job Type: Fulltime Location: Hyderabad, India Job Title: System Admin/Linux Admin- Linux CentOS / Rocky Linux / Alma Linux,Ansible · Good verbal and written communication skills to be able to clearly & effectively communicate with clients based out of the US / Europe / India. · Should possess a good knowledge in the Linux Administration to be able to tackle L1 & L2 tickets. And a hands-on & recent experience in installing, configuring, and maintaining RHEL or RHEL based distros like CentOS / Rocky Linux / Alma Linux is a must. · Should possess a good knowledge & have hands-on experience in setting up local file systems(ext4, XFS etc), shared file systems (NFS etc) and maintaining them including troubleshooting all the issues related to them. Should possess good knowledge in Linux file permissions. · Should possess a good knowledge in Linux networking, including setting up network interfaces, routing, firewall (iptables) for various scenarios and be able to troubleshoot any network issues at OS level. · Should have a good knowledge in Linux / Unix user management, local / LDAP / NIS etc and how to integrate external user databases like AD with Linux. · Should possess a good knowledge & hands-on experience in Ansible & scripting (bash or python). · Should have hands-on experience with any monitoring tools like zabbix, prometheus, solarwinds etc and ticketing tools like JIRA, github / gitlab issues etc. · Should possess the ability to understand new concepts & technologies quickly and should also have good troubleshooting & analyzing skills to work with minimum supervision. · Should possess a basic understanding of version control systems like Git & git repo hosting websites like GitHub. And should also have a basic understanding of DevOps concepts & DevOps tools like Jenkins / GitLab. · Should possess at least a basic knowledge & some exposure at workplace to at least one of the major cloud service providers (AWS / Azure / GCP). · Should be willing to explore & learn new technologies as per the projects' requirements and come up with suggestions / solutions for the issues at hand. · Should be willing to work flexible hours & contribute to the improvement of processes within the team and to be a team player. Job Type: Full-time Pay: ₹9,646.90 - ₹46,622.88 per month Experience: AWS: 5 years (Preferred) Redhat: 5 years (Preferred) Work Location: In person Speak with the employer +91 7799816682

Posted 1 month ago

Apply

3.0 years

0 Lacs

Navi Mumbai, Maharashtra, India

On-site

Senior Engineer - Global Technical Assistance Center Company: Alepo Technologies Inc. Department: GTAC (Global Technical Assistance Centre) Location: Navi Mumbai, India Employment Type: Full-time Experience Level: Senior Individual Contributor Years of Experience: 3-6 years Company Overview Alepo makes next-generation Gen AI transformation opportunities for telcos a reality, delivering advanced software solutions and services that enable communications service providers to accelerate revenue growth, market share, and business success on fixed and mobile networks. Alepo helps accelerate digital enablement for networks of all sizes, including leading service providers globally. Known as the go-to partner for all things data, Alepo’s innovations are highly scalable and cloud-agnostic, enabling digital-first customer experiences. Alepo is based in Austin, Texas, with a presence in all regions of the world. Alepo was founded by internet pioneers and has grown from powering some of the first ISPs, to some of the first LTE implementations, and now leading the drive to 5G. We maintain a unique project success record by combining our delivery and software development teams, who work together to meet your needs. We extensively utilize modern frameworks, microservices, open standards, and virtualization technologies. Coupled with a customer-first approach, we can facilitate complex projects, provide functionality that exceeds market standards, and remain competitively priced. Alepo is a proud member of TM Forum, collaborating with global telecom leaders to drive innovation, enable seamless interoperability, and accelerate digital transformation. Position Summary We are seeking a Senior Support Engineer to join our Global Technical Assistance Center team. The successful candidate will provide advanced technical support for telecommunications products, independently handle complex troubleshooting scenarios, and drive resolutions while maintaining exceptional service levels. This role requires deep technical expertise in telecommunications systems, scripting, and analytical problem-solving. Key Responsibilities Advanced Technical Support (70%) Provide senior-level technical support via phone, email, chat, and support portal Independently troubleshoot and resolve complex technical issues for telecommunications products, using AI tools. Perform advanced root cause analysis and incident management, using AI tools. Configure Alepo products in production, staging, and lab environments. Handle critical alerts and escalated tickets with minimal supervision. Install and deploy patches in coordination with R&D team following PAR guidelines. Achieve 90% closure rate for assigned tickets, alerts, and patch deployments within SLA Maintain maximum 3 wrong escalations to R&D Support annually Customer Relationship Management (20%) Create accurate incident reports and root cause analysis documents within SLA Manage third-party integration L1 issues independently Communicate and resolve vendor issues within SOW and OLA requirements Participating in customer service review meetings and drumbeat calls Generate comprehensive technical reports and documentation Technical Leadership and Automation (10%) Write automation scripts using prompt engineering, Shell, SQL, Java, Perl, Lua, and Bash Serve as Subject Matter Expert (SME) for minimum 2 Alepo product modules Perform a minimum of 2 AI based tasks which will help the GTAC team to evolve with customer success. Contribute a minimum of 24 knowledge base articles annually. Provide technical training and mentoring to junior team members. Install, configure, and customize open-source tools Required Qualifications Education Bachelor’s degree in computer science, Computer Engineering, Electronics, or Telecommunications from accredited institutions Experience Requirements 3-6 years’ experience in technical support or application support roles 2+ years of telecommunications industry experience Proven experience handling alerts, server health monitoring, and troubleshooting Demonstrated ability to handle L1 third-party integration issues independently Technical Skills - Must Have Programming and Scripting: Advanced Java programming and scripting AI Prompt Engineering Perl scripting proficiency Lua scripting experience Bash scripting expertise Shell script development and automation SQL script writing and optimization Database Technologies: MySQL database administration and troubleshooting Oracle database management and optimization Relational database concepts and performance tuning Database monitoring and maintenance Operating Systems: Linux system administration Linux command line proficiency System monitoring and troubleshooting Performance optimization and tuning Telecommunications Technologies: OSS/BSS systems expertise Advanced networking concepts Telecommunications protocols and standards 4G LTE, 2G/3G, WiMAX, WiFi technologies Radius and Diameter protocol knowledge AAA authentication systems CDR processing and analysis Monitoring and Management Tools: ICINGA monitoring system expertise ZABBIX monitoring and reporting PRTG traffic analysis collected system statistics monitoring JIRA issue tracking and project management Trouble ticketing system administration Standards and Processes: ISO 20000 standard implementation SLA management and compliance Incident management processes Change management procedures Service restoration protocols Advanced Technical Competencies Subject Matter Expert (SME) level knowledge in minimum 2 Alepo product modules Business configuration and system customization Log analysis and performance troubleshooting Vendor management and third-party integration Production deployment and patch management Essential Soft Skills Excellent verbal and written communication skills (80% minimum on internal assessment) Advanced analytical and problem-solving abilities Strong customer service orientation Leadership and mentoring capabilities Ability to work in 24x7 shift environment Cross-functional collaboration and teamwork Adaptability and resourcefulness in dynamic environments Preferred Qualifications Advanced Experience Onsite customer engagement with successful closure and sign-off Positive stakeholder feedback on hand-holding assignments Proactive issue identification from monitoring alerts Experience with telecommunications carrier environments Multi-vendor integration project experience Technical Certifications Telecommunications industry certifications Database administration certifications Linux system administration certifications ITIL or ISO 20000 certifications Performance Metrics and Success Indicators SLA and Quality Metrics 90% closure rate for assigned tickets, alerts, and patches within SLA Maximum 3 wrong escalations to R&D Support annually 100% accuracy in severity and priority assignment for critical issues 80% minimum score on communication skills assessment Zero complaints on shift handover processes Knowledge Management Minimum 24 knowledge base contributions annually Successful completion of onsite assignments with positive feedback SME certification for minimum 2 Alepo product modules Active participation in 30% of customer service review meetings Technical Excellence Demonstrated proficiency in automation script development Successful completion of patch deployments and production activities Effective vendor relationship management for third-party integrations Continuous improvement contributions to support processes Career Development Opportunities Technical leadership roles within GTAC organization Specialization in emerging telecommunications technologies Cross-functional project leadership opportunities International assignment and customer engagement roles Professional certification and training programs Compensation and Benefits Competitive salary package commensurate with experience Comprehensive health and medical insurance Professional development and certification support Performance-based incentives and recognition programs Flexible work arrangements and shift differentials Show more Show less

Posted 1 month ago

Apply

5.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Role Overview We are looking for an experienced and proactive Problem Manager manage the problem management process in a large, high-tech enterprise. The Problem Manager will be responsible for identifying, analyzing, and resolving recurring issues by conducting Root Cause Analysis (RCA), implementing long-term fixes, and delivering training to improve operational excellence. This role also involves continuously improving the problem management process, deploying best practices across the organization, and collaborating with global teams to drive service reliability and stability. This role closely collaborates with the Problem Management Process Owner. Key Responsibilities Problem Management Process Manage the end-to-end problem management process, ensuring all problems are logged, investigated, and resolved. Establish and maintain policies and procedures for effective problem management, adhering to ITIL/ITSM best practices. Collaborate with incident and change management teams to ensure a seamless flow of information and resolution. Root Cause Analysis (RCA) Lead Root Cause Analysis (RCA) for major incidents and recurring issues to identify underlying causes. Drive the implementation of permanent solutions to prevent future occurrences of known issues. Ensure accurate and timely documentation of RCA findings, action plans, and resolutions in the problem management system. Follow up on RCA action items to ensure successful completion and closure. Training and Knowledge Sharing Develop and deliver training programs for teams to promote awareness of the problem management process and RCA methodologies. Provide coaching to technical teams on identifying and addressing recurring issues effectively. Build and maintain a knowledge base of known problems, workarounds, and solutions. Proactive Problem Identification and Prevention Analyze incident trends, system performance data, and other inputs to identify potential problems proactively. Work with operations, and other stakeholders to implement preventative measures and improve system reliability. Develop metrics and reports to track the effectiveness of problem management efforts and identify areas for improvement. Process Improvement & Deployment Continuously evaluate and improve the problem management process to increase efficiency and effectiveness. Ensure alignment of the problem management process with business objectives and operational needs. Collaboration & Communication Collaborate with cross-functional teams, including operations, and global technical service desk, to address complex problems. Act as a central point of contact for problem management-related inquiries and escalations. Provide regular updates and reports to leadership on problem trends, root causes, and resolution progress. Qualifications Required: Proven experience (5+ years) in problem management in a large-scale high-tech enterprise environment. Strong understanding of ITIL/ITSM frameworks, with expertise in the problem management process. Demonstrated experience conducting Root Cause Analysis (RCA) and implementing long-term fixes. Familiarity with ITSM tools (e.g., ServiceNow) and data analysis tools. Excellent analytical, problem-solving, and decision-making skills. Exceptional communication and presentation skills for interacting with technical and non-technical stakeholders. Preferred ITIL v4Certification (Intermediate or higher). Experience in automation and predictive analysis for proactive problem management. Knowledge of monitoring tools (e.g., Splunk, SolarWinds, Zabbix) and incident management systems. Experience working in agile or DevOps environments. Key Attributes Proactive and Analytical: Anticipates issues, identifies patterns, and takes initiative to address recurring problems. Collaborative Manager: Works effectively with diverse teams and drives accountability for resolving problems. Detail-Oriented: Ensures thorough documentation and follow-up on RCA findings. Continuous Improver: Strives to enhance processes and share knowledge across the organization. What We Offer A challenging and impactful role in a global high-tech enterprise. Opportunities to drive meaningful improvements in service reliability and operational efficiency. Competitive compensation and benefits package. Access to professional development and certification opportunities. If you’re passionate about solving complex problems, improving processes, and driving operational excellence, we invite you to join our team and make a significant impact! More information about NXP in India... Show more Show less

Posted 1 month ago

Apply

5.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Role Overview We are seeking a proactive and experienced (Major) Incident Manager to oversee and manage the end-to-end incident management process in a dynamic, large-scale high-tech enterprise environment. The Incident Manager, together with a team of ITSM experts, will be responsible for handling major incidents, ensuring swift resolution, root cause identification, and driving continuous improvements to minimize service disruptions and optimize response processes. This role demands excellent coordination skills, the ability to work under pressure, and a strong commitment to 24/7 incident resolution and process improvement. Key Responsibilities Incident Management Manage the incident management lifecycle, from identification to resolution, ensuring adherence to SLAs and minimizing business impact. Manage major incidents (P1/P2) with urgency, coordinating cross-functional teams to restore services as quickly as possible. Act as the central point of communication for all stakeholders during incidents, providing regular updates on status, impact, and resolution timelines. Ensure accurate documentation of incidents, including root cause analysis (RCA) follow up and post-incident reports. 24/7 Coverage Together with the Operations Command Center team, provide 24/7 support for incident response, including on-call responsibilities as part of a rotational schedule. Proactively monitor high-priority services and potential risks, taking preventative action where necessary. Develop and maintain escalation procedures to ensure critical incidents receive appropriate attention. Process Optimization & Improvement Continuously analyze the incident management process to identify opportunities for efficiency, speed, and accuracy improvements. Collaborate with problem management teams to address recurring incidents and implement permanent solutions. Deploy process enhancements to improve metrics like First Time Resolution and MTTR, KPIs, and dashboards to measure incident management performance. Collaboration & Leadership Foster strong relationships with internal teams (Global Technical ServiceDesk, Level 2 operations, Project teams, etc.) and external vendors to ensure streamlined communication during incidents. Drive incident-related meetings, including war rooms, service reviews, and RCA sessions. Train and mentor Operations Command Center team members and stakeholders on incident management best practices. Qualifications Required: Proven experience (5+ years) in incident management within a large-scale, high-tech enterprise environment. Strong understanding of ITIL/ITSM frameworks and processes. Experience managing major incidents (P1/P2) and coordinating resolution efforts across multiple teams. Familiarity with monitoring tools (e.g., Splunk, SolarWinds, Zabbix) and ticketing systems (e.g., ServiceNow, Jira). Strong leadership, decision-making, and problem-solving skills, with the ability to remain calm under pressure. Exceptional communication skills for liaising with both technical and non-technical stakeholders. Preferred ITIL v4 Certification (Foundation or higher). Experience with cloud environments (AWS, Azure) and DevOps methodologies. Understanding of automation tools and processes for proactive incident management. Key Attributes Proactive Mindset: Anticipates and addresses potential issues before they escalate. Analytical Thinker: Identifies patterns in incidents and proposes systemic improvements. Team Player: Works collaboratively with diverse teams to achieve swift resolutions. Customer-Focused: Prioritizes service availability and business continuity. What We Offer A dynamic, fast-paced work environment in a leading high-tech enterprise. Opportunities for professional growth and certifications. Competitive salary and benefits package. Work-life balance with rotational shifts and on-call support schedules. If you are passionate about driving efficient incident resolution and continuous improvement in a 24/7 operational environment, we invite you to apply and become a key part of our team! More information about NXP in India... Show more Show less

Posted 1 month ago

Apply

2.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

Mizuho Global Services India Pvt. Ltd. Mizuho Global Services Pvt Ltd (MGS) is a subsidiary company of Mizuho Bank, Ltd, which is one of the largest banks or so called ‘Mega Banks’ of Japan. MGS was established in the year 2020 as part of Mizuho’s long-term strategy of creating a captive global processing centre for remotely handling banking and IT related operations of Mizuho Bank’s domestic and overseas offices and Mizuho’s group companies across the globe. At Mizuho we are committed to a culture that is driven by ethical values and supports diversity in all its forms for its talent pool. Direction of MGS’s development is paved by its three key pillars, which are Mutual Respect, Discipline and Transparency, which are set as the baseline of every process and operation carried out at MGS. Know more about MGS: https://www.mizuhogroup.com/asia-pacific/mizuho-global-services What’s in it for you? o Immense exposure and learning o Excellent career growth o Company of highly passionate leaders and mentors o Ability to build things from scratch Position: Monitoring Analyst- Officer 1 Location : Chennai We are seeking a proactive and detail-oriented L0 Monitoring Analyst to provide 24x7 monitoring and first-level support for IT infrastructure, applications and services. This role serves as the first line of defense in identifying and escalating systems issues, ensuring the smooth and uninterrupted functioning of business-critical systems. Key Responsibilities: · Perform real-time monitoring of servers, network devices, applications, jobs and services using monitoring tools (e.g. JP1, Solarwinds, Nagios, Zabbix, SCOM, Appdynamics, Dynatrace, OpsRamp). · Identify, acknowledge and log alerts, events and anomalies · Escalate incidents to L1/L2 support teams based on defined SOPs and severity · Perform basic health checks and validations as per runbooks · Track and update incident tickets in ITSM tools · Perform initial triage of alerts (e.g. service restarts, disk clean-up commands if permitted). · Document incidents, actions taken, and resolution/escalation steps · Communicate with on-call engineers and shift leads for follow-up or escalations · Ensure monitoring dashboards and tools are operational and report any issues · Provide shift-wise handover updates and contribute to incident reviews Required Skills: Basic understanding of IT infrastructure (servers, OS, networks, applications) Hands-on experience in monitoring tools and ticketing systems must have. Good communication skills and ability to follow SOPs Ability to work 24x7 rotational shifts (including weekends and holidays) Strong attention to detail and sense of urgency Requires candidates to Work from Office. Preferred Qualifications: 2-4 years of experience in IT monitoring or helpdesk roles Exposure to ITIL processes (Incident, Event, Change Management) Certification in basic IT/Networking (e.g: CompTIA A+, ITIL Foundation, CCNA) is a plus. Address: Chennai Location Mizuho Global Services India Private Limited, 16th Floor, Tower-B Brigade, World Trade centre, 142, Rajiv Gandhi Salai, OMR, Perungudi, Chennai, Tamil Nadu 600096. Show more Show less

Posted 1 month ago

Apply

2.0 years

0 Lacs

Mumbai Metropolitan Region

On-site

Job Description Candidate Specification: Candidate with minimum 2+ years of experience in IT monitoring or helpdesk roles with mandate skills monitoring tools, network operation center Candidate must have basic understanding of IT infrastructure (servers, OS, networks, applications) Hands-on experience in monitoring tools and ticketing systems must have. Ability to work 24x7 rotational shifts Requires candidates to Work from Office. Job Description Perform real-time monitoring of servers, network devices, applications, jobs and services using monitoring tools (e.g. JP1, Solarwinds, Nagios, Zabbix, SCOM, Appdynamics, Dynatrace, OpsRamp). Identify, acknowledge and log alerts, events and anomalies Escalate incidents to L1/L2 support teams based on defined SOPs and severity Perform basic health checks and validations as per runbooks Track and update incident tickets in ITSM tools Perform initial triage of alerts (e.g. service restarts, disk clean-up commands if permitted). Document incidents, actions taken, and resolution/escalation steps Communicate with on-call engineers and shift leads for follow-up or escalations Ensure monitoring dashboards and tools are operational and report any issues Provide shift-wise handover updates and contribute to incident reviews Skills Required RoleL0 Monitoring Analyst- Chennai/ Mumbai Industry TypeITES/BPO/KPO Functional Area Required Education B. Tech Employment TypeFull Time, Permanent Key Skills HELP DESK NETWORKING Other Information Job CodeGO/JC/328/2025 Recruiter NameAckshaya Show more Show less

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies