Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
1087 jobs matched
4.0 - 5.0 years
8 - 12 Lacs
gurugram
Work from Office
Position Overview : We are seeking an SRE to join our high-impact platform engineering team. You will maintain SLAs for real-time services deployed across hybrid clouds and Kubernetes clusters, contributing to automation, observability, and availability goals. Roles and Responsibilities : - Monitor application and infrastructure metrics; build dashboards and alerts (Prometheus, Grafana, ELK). - Automate health checks, incident remediation, and reliability guardrails. - Manage on-call rotations, conduct root cause analysis, and implement postmortem action plans. - Define and track SLOs, SLIs, and error budgets. - Use chaos engineering and resilience testing to ensure fault tolerance. Must Have Skills : - 4 - 5 years of experience in managing production-grade Kubernetes clusters and cloud-native platforms. - Proficiency in Linux system internals, containers, and networking. - Scripting/automation expertise in Python/Go/Shell. - Familiarity with incident management, runbooks, and observability standards. - Exposure to service discovery, DNS routing, and load balancing is a bonus. Qualification : BE/BTech/MCA/ME/MTech/MS in Computer Science or a related technical field or equivalent practical experience.
Posted 8 hours ago
3.0 - 6.0 years
5 - 8 Lacs
noida, chennai, gurugram
Work from Office
Develop and deploy Java-based applications on Google Cloud Platform. Design and implement cloud-native solutions using GCP services (Cloud Functions, Pub/Sub, Cloud Storage, BigQuery, GKE, etc.). Optimize application performance and scalability. Write clean, testable, and efficient Java code using frameworks such as Spring Boot. Integrate third-party APIs and GCP-native services. Collaborate with DevOps to automate CI/CD pipelines and deployments. Monitor and troubleshoot production issues. Requirements and Skills Strong programming experience in Java (8/ 11 /17). Experience with Spring Boot or similar frameworks. Solid hands-on experience with Google Cloud Platform (GCP). Experience with GCP services like: Cloud Functions, Pub/Sub, Cloud Storage, BigQuery, Cloud Run, GKE. Familiarity with microservices architecture and REST APIs. Proficient in using CI/CD tools like Jenkins, GitLab CI, or Cloud Build. Experience with containerization (Docker) and orchestration (Kubernetes). Strong understanding of Agile methodologies. Preferred Qualifications GCP Certification (e.g., Associate Cloud Engineer, Professional Cloud Developer). Experience with infrastructure as code (e.g., Terraform, Deployment Manager). Familiarity with logging/monitoring tools (e.g., Stackdriver, Prometheus). Experience with other backend technologies (e.g., Node.js, Python) is a plus. Soft Skills Strong analytical and problem-solving skills. Excellent communication and collaboration skills. Ability to work independently and in a team-oriented environment.
Posted 9 hours ago
8.0 - 12.0 years
14 - 16 Lacs
bengaluru
Work from Office
Job Description: 8-12 Years experience in . Net Technologies Hands-on service design, schema design and application integration design Hands-on software development using C#, . Net Core Use of multiple Cloud native database platforms including DynamoDB, SQL, Elasticache, and others Hands-on application design for high availability and resiliency Hands-on problem resolution across a multi-vendor ecosystem Conduct Code reviews and peer reviews Unit test and Unit test automation, defect resolution and software optimization Actively engaged with Client IT and Client Business during daily work sessions Code deployment using CI/CD processes Contribute to each step of the development process from ideation to implementation to release, including rapidly prototyping, running A/B tests, continuous Integration, Automated Testing and Continuous Delivery Understand business requirements and technical limitations Ability to learn new technologies and influence the team and leadership to constantly implement modern solutions Experience in using Elasticsearch, Logstash, Kibana (ELK) stack for Logging and Analytics Experience in container orchestration using Kubernetes Knowledge and Experience working with public cloud AWS services Knowledge of Cloud Architecture and Design Patterns Ability to prepare documentation for Microservices Monitoring tools such as Datadog, Logstash Excellent Communication skills Airline industry knowledge is preferred but not required Recruitment fraud is a scheme in which fictitious job opportunities are offered to job seekers typically through online services, such as false websites, or through unsolicited emails claiming to be from the company. These emails may request recipients to provide personal information or to make payments as part of their illegitimate recruiting process. DXC does not make offers of employment via social media networks and DXC never asks for any money or payments from applicants at any point in the recruitment process, nor ask a job seeker to purchase IT or other equipment on our behalf. More information on employment scams is available here .
Posted 10 hours ago
4.0 - 7.0 years
9 - 10 Lacs
bengaluru
Work from Office
Position: Performance Engineer Experience: 4 - 5 Years Job Overview: The candidate will be responsible conducting performance testing of web and APIs. Requirements gathering, planning the performance testing, scripting and execution, performance analysis and creating the report for performance testing Required skills: Minimum 4+ Years of experience in performance testing Strong scripting skills with performance testing tools like load runner , JMeter etc. Good Experience in developing scripts in load runner using different protocols Performing the test run in performance center and analyzing the test runs Experience in cloud performance testing, monitoring and analyzing the logs Experience with performance monitoring tools like Dynatrace. Web applications and web services performance analysis , scripting, and testing Experience of working in Agile methodology Strong written and verbal communication skills Good to know JIRA, Confluence, GitLab Good understanding / experience in working under an Agile methodology
Posted 10 hours ago
7.0 - 12.0 years
20 - 35 Lacs
bengaluru
Work from Office
Job Summary: We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies. This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems. Experience: 7-12 years Key Responsibilities: Reliability & Performance: Lead efforts to maintain high availability and reliability of critical services. Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met. Proactively identify and resolve performance bottlenecks and system inefficiencies. Incident Management & Response: Establish and improve incident management processes and on-call rotations. Lead incident response and root cause analysis for high-priority outages. Drive post-incident reviews and ensure actionable insights are implemented. Automation & Tooling: Develop and implement automated solutions to reduce manual operational tasks. Enhance system observability through metrics, logging, and distributed tracing tools (e.g., Prometheus, Grafana, Elastic APM). Optimize CI/CD pipelines for seamless deployments. Collaboration: Partner with software engineering teams to improve the reliability of applications and infrastructure. Work closely with product/ engineering teams to design scalable and robust systems. Ensure seamless integration of monitoring and alerting systems across teams. Leadership & Team Building: Manage, mentor, and grow a team of SREs. Promote SRE best practices and foster a culture of reliability and performance across the organization. Drive performance reviews, skills development, and career progression for team members. Capacity Planning & Cost Optimization: Perform capacity planning and implement autoscaling solutions to handle traffic spikes. Optimize infrastructure and cloud costs while maintaining reliability and performance. Required Skills: Technical Expertise: Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes. Hands-on knowledge of infrastructure-as-code tools like Terraform /Helm/ Ansible. Proficiency in Java Expertise in distributed systems, databases, and load balancing. Monitoring & Observability: Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic. Understanding of metrics-driven approaches for system monitoring and alerting. Automation & CI/CD: Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines etc). Skilled in automation frameworks and tools for infrastructure and application deployments. Incident Management: Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence. Leadership & Communication Skills: Strong people management and leadership skills with the ability to inspire and motivate teams. Preferred Qualifications: Experience with database optimization, Kafka, or other messaging systems. Knowledge of autoscaling techniques Previous experience in an SRE, DevOps, or infrastructure engineering leadership role. Understanding of compliance and security best practices in distributed systems.
Posted 11 hours ago
7.0 - 12.0 years
15 - 17 Lacs
noida, gurugram, delhi / ncr
Work from Office
Key Responsibilities: Manage and maintain Red Hat, CentOS, Oracle Linux, or Ubuntu systems across production and non-production environments. Expertise in troubleshooting, performance tuning, and security. Troubleshoot and resolve complex server, network, and application issues. Experience with enterprise monitoring tools (Zabbix, Nagios etc.). Understanding of LVM, RAID, iSCSI, and networking concepts. Lead critical incident resolution and perform root cause analysis. Plan and perform kernel upgrades, OS patching, package management, and system hardening, scripting and automation (Bash, Python, Ansible). Configure and manage system services like Apache/Nginx, SSH, FTP, DNS, NFS, LAMP, LDAP, and SMTP. Implement and monitor security compliance (e.g., CIS hardening, SELinux, UFW, firewalld, auditd). Perform backup and disaster recovery planning and execution. Participate in capacity planning, performance tuning, and system audits. Document configurations, procedures, and change management records. Good To Have: Experience with cloud environments (AWS, OCI, or Azure). Administer virtualization platforms (e.g., VMware, KVM) and cloud platforms (AWS, OCI, Azure). Networking experience. Preferred candidate profile
Posted 11 hours ago
5.0 - 10.0 years
15 - 30 Lacs
bengaluru, delhi / ncr, mumbai (all areas)
Hybrid
Were Hiring: L2 Production Support – Application & System Monitoring Roles (50+ Openings | 3–12 Yrs Experience | 2 Levels & Multiple Designations) Location: Mumbai / Pune / Chandigarh / Bengaluru / Gurugram / Noida / New Delhi (WFO, Hybrid, or Remote Based on Role) Industry: Fortune 500 Client Projects (Staffing via Hatchtra Innotech Pvt. Ltd.) Employment Type: Full-Time | Contract (C2H) Rapid hiring with 50+ open positions across two levels About us: We are a leading staffing and workforce solutions company, trusted by Fortune 500 organizations and global enterprises to build their technology teams. When you join us, you’ll work directly with our Fortune 500 client teams that power mission-critical systems worldwide. About the Role We are hiring skilled L2 Production Support professionals to join a high-performance operations team responsible for real-time monitoring, issue resolution, and stability of mission-critical enterprise systems. Whether you're a mid-level engineer or a senior technical lead, this is your chance to work on global platforms, contribute to uptime and automation, and collaborate with cross-functional teams in a high-stakes production environment. Open Positions & Designations Level 1 – L2 Production Support Engineer (3–8 Yrs) Designations: Support Engineer Senior Support Engineer Application Support Analyst Application Support Specialist Ideal For: Mid-level engineers who thrive on debugging, automation, and ensuring smooth system operations. Level 2 – L2 Production Support Lead (8+ Yrs) Designations: Technical Lead – Production Support Production Support Lead Ideal For: Senior professionals ready to own production stability, lead incidents, and guide technical teams in high-availability environments. Key Responsibilities (Role-Based) L2 Production Support Engineer (3–8 Yrs) Monitor production systems, resolve incidents in real time Write and optimize SQL queries for issue resolution Automate recurring UNIX/Linux tasks Use tools like Splunk, Grafana, and AppDynamics for diagnostics Investigate and close L2 tickets, ensuring SLA compliance Collaborate with Dev, QA, and Infra teams during releases and issues L2 Production Support Lead (8+ Yrs) Own the resolution of critical incidents and escalations Perform root cause analysis and implement long-term fixes Debug Java-based systems and analyze logs Lead and mentor junior support engineers Oversee production stability, uptime, and performance tuning Partner with global stakeholders to drive support strategy Skills & Tools Core Technologies SQL: Querying, tuning, data investigation UNIX/Linux: Shell scripting, cron jobs, automation Java: Debugging and log analysis (Lead Level) Monitoring Tools Splunk Grafana AppDynamics Methodologies & Practices ITIL (Incident, Change, Problem Management) Escalation Management On-call & 24x7 Production Support Root Cause & Impact Analysis Qualifications Bachelor’s degree in Computer Science, Engineering, or related IT field 3–12 years of relevant experience in Production or Application Support Proficiency in SQL, UNIX/Linux; Java experience for senior roles Exposure to monitoring tools and ticketing systems Strong communication, documentation, and cross-team collaboration skills Why Join Us? Be part of a high-impact, mission-critical operations team Work with global cross-functional engineering and support groups Hands-on with modern production tooling & enterprise systems Flexible work modes – WFO, Hybrid, or Fully Remote (role-based) How to Apply For quick consideration, please email your resume and include the desired position and experience level (e.g., “Data Engineer – Mid-Level”) in the subject line.
Posted 12 hours ago
10.0 - 14.0 years
0 Lacs
hyderabad, telangana
On-site
As a member of the Group Technology and Operations (T&O) team, your primary responsibility will be to design, implement, and maintain Elasticsearch clusters to ensure an efficient and resilient infrastructure for the bank. You will work towards optimizing Elasticsearch queries for efficient data retrieval and integrating Elasticsearch with other systems like Kibana, Logstash, or custom applications. Monitoring the performance and health of Elasticsearch clusters, troubleshooting and resolving related issues, and implementing security best practices will be key aspects of your role. Collaboration with data scientists, developers, and system administrators to enhance search and analytics capabilities, as well as maintaining backups and disaster recovery processes for Elasticsearch clusters, will be essential tasks. Additionally, you will be required to work on data ingestion pipelines using tools such as Logstash, Beats, or custom scripts, and provide documentation, training, and support to end users. To excel in this role, you should possess strong experience with Elasticsearch architecture and APIs, proficiency in query DSL and scripting for Elasticsearch, and familiarity with related tools in the Elastic Stack like Kibana, Logstash, and Beats. An understanding of distributed systems and cluster management, along with experience in performance tuning and scaling Elasticsearch clusters, will be beneficial. Knowledge of JSON, RESTful APIs, and scripting languages like Python and Bash is preferred, as well as familiarity with Linux system administration and cloud platforms such as AWS, GCP, and Azure. Experience with monitoring tools like Elastic APM, Grafana, or Prometheus, and in large-scale data environments, will be advantageous. Ideally, you should hold a Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field, or possess equivalent experience. A minimum of 10 years of experience working with Elasticsearch or search technologies is typically required for this role.,
Posted 1 day ago
5.0 - 9.0 years
0 Lacs
maharashtra
On-site
As a Cloud Managed Services Engineer (L3) at NTT DATA, you will be responsible for providing a high level of managed services to clients by proactively identifying and resolving cloud-based incidents and problems. Your role involves ensuring zero missed service level agreement (SLA) conditions, managing tickets of high complexity, and providing resolutions to a diverse range of complex problems. You will use your considerable judgment and independent analysis skills within defined policies and practices to achieve client outcomes while coaching and mentoring junior team members. Key Responsibilities: - Configure, install, test, and ensure operational infrastructure at client sites. - Monitor infrastructure, respond to alerts, and apply necessary checks. - Identify problems and errors, log incidents with required level of detail, and escalate support calls. - Investigate and identify root causes of incidents and problems, escalating to third-party vendors if necessary. - Provide onsite technical support and field engineering services to clients. - Conduct monthly reviews of incidents and service requests, analyse for quality improvement. - Proactively identify work optimization opportunities, including automation possibilities. - Manage and implement projects within the technology domain as required. - Implement and deliver Disaster Recovery functions and tests. - Perform any other related tasks as needed. Knowledge and Attributes: - Effective communication and collaboration across different cultures and social groups. - Excellent planning skills considering possible changing circumstances. - Positive outlook, ability to work well under pressure, and willingness to work hard when necessary. - Active listening techniques and adaptability to changing circumstances. - Prioritizing clients, understanding their requirements, and ensuring a positive client experience. Academic Qualifications and Certifications: - Bachelor's degree in Information Technology/Computing or equivalent qualification. - Relevant certifications such as VMware Certified Professional, Microsoft Certified, AWS Certified, Veeam Certified Engineer, etc. - Certifications relevant to the services provided carry additional weightage. Required Experience: - Seasoned work experience in an Engineering function within a medium to large ICT organization. - Managed services experience, excellent working knowledge of ITIL processes, and experience working with vendors and third parties. - Experience managing platforms including Windows Server Administration, Linux Server Administration, Virtualization Administration, and more. Workplace Type: - On-site Working NTT DATA is an Equal Opportunity Employer committed to diversity and inclusion in the workplace.,
Posted 1 day ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
You will be responsible for managing and maintaining network infrastructure in our Hyderabad office located in Nanakramguda. This role requires you to work in rotational shifts. We are looking for a Network Engineer L2 with a minimum of 6-8 years of experience in network engineering. The ideal candidate should have a minimum of 5 years of experience in network engineering and hold CCNP certification (required), with JNCIA/JNCIS certification being preferred. Your technical skills should include expertise in network architecture, routing protocols (BGP, OSPF, EIGRP), VLANs, firewalls, automation using Python and Bash, as well as familiarity with monitoring tools. In addition to technical skills, we value soft skills such as strong communication, troubleshooting abilities, the capacity to work effectively in a US-based team, adaptability, and leadership qualities. You should be comfortable working in rotational shifts, including 24/7 availability. A Bachelor's Degree in Computer Science or a related field is required for this position. If you are a quick learner and have the required experience and certifications, we encourage you to apply for this critical role in our team.,
Posted 1 day ago
10.0 - 14.0 years
0 Lacs
pune, maharashtra
On-site
Hashlist is a platform dedicated to projects within the automotive industry, fostering supplier relationships with automotive companies and offering a comprehensive solution for individuals interested in pursuing a career in this sector. As a Platform Architect at Hashlist, you will be responsible for conceptualizing and implementing scalable architectures tailored for connected vehicle platforms and cloud-native solutions. Your role will involve leading technical discussions, establishing data standards, and ensuring compliance with industry regulations. By collaborating with cross-functional teams, you will drive the adoption of cloud technologies, enhance system performance, and facilitate enterprise integration. This is a full-time, permanent position with a planned start date in April-May, offering the flexibility to work from either India or Germany. Freelancers are also welcome to apply. Key Responsibilities: - Design and deploy scalable architectures for connected vehicle platforms and cloud-native applications. - Spearhead technical discussions to align global architecture principles with business objectives. - Define data standards for connected vehicle systems, prioritizing security, compliance, and adherence to regional regulations. - Execute proof-of-concept projects by embracing cloud-native and hybrid solutions, while optimizing cloud expenses. - Collaborate with diverse teams to incorporate IoT, cloud, and edge technologies into the automotive ecosystem. Required Qualifications: - Bachelor's degree in Information Technology, Software Engineering, or a related field. - Over 10 years of hands-on experience in managing large-scale distributed systems and cloud platforms. - Proficiency in Kafka, Spring Boot, microservices, REST APIs, SQL/NoSQL databases, and cloud architectures. - Background in connected vehicle systems, encompassing event-driven and domain-driven design principles. - Sound knowledge of CI/CD pipelines, test automation frameworks, and monitoring tools. To take the next step towards joining our team, simply click "Apply." We will carefully assess your application and if deemed suitable, you will gain access to the Hashlist network for potential consideration in this and other relevant projects.,
Posted 1 day ago
6.0 - 10.0 years
0 Lacs
hyderabad, telangana
On-site
You should have at least 6+ years of experience working as a PostgreSQL DBA or in a similar database administration role. Your responsibilities will include demonstrating a strong understanding of PostgreSQL architecture, installation, configuration, and management. Proficiency in SQL, PL/pgSQL, and implementing database optimization techniques is crucial for this role. Experience with PostgreSQL replication methods such as streaming and logical replication is required. You should also possess a robust knowledge of database security practices and be well-versed in data privacy laws. Familiarity with backup and recovery techniques and tools like pg_dump and pg_restore is essential. Additionally, experience with cloud database platforms like AWS RDS for PostgreSQL and Azure Database for PostgreSQL is preferred. You should be proficient in Linux/Unix systems administration and have a good grasp of scripting languages such as Bash and Python. Knowledge of other skills, such as PostgreSQL on Kubernetes or containerized environments, NoSQL databases, and relational databases like MySQL and Oracle, is considered a plus. Understanding automated deployment tools like Ansible and Chef, as well as experience with monitoring tools like pgAdmin, Prometheus, and Grafana, will be beneficial in this role.,
Posted 1 day ago
3.0 - 7.0 years
0 Lacs
haryana
On-site
As a skilled Oracle RAC Database Administrator, you will be responsible for installing, configuring, and managing Oracle RAC nodes on the cluster. Your role will involve monitoring the cluster health and resource utilization across all nodes, as well as managing inter-node communication and network configuration. Additionally, you will play a key part in implementing failover and disaster recovery strategies for the RAC environment. Your day-to-day responsibilities will include performing standard DBA tasks such as creating users, tablespaces, indexes, views, and managing database objects. You will be expected to optimize database performance by analyzing query plans, tuning SQL statements, and managing cache sizes. Ensuring database security by implementing security policies and managing user privileges will also be part of your core duties. In the event of issues related to RAC specific features like Global Enqueue, node failures, and cluster synchronization, you will be the go-to person for diagnosing and resolving them. Your expertise in analyzing Oracle Alert logs and tracing files will be crucial in pinpointing problems and ensuring smooth operations. Moreover, setting up monitoring tools to track database performance, resource utilization, and cluster status will fall under your purview. You will configure alerts for critical events such as node failures, database errors, and performance thresholds to ensure proactive management of the RAC environment.,
Posted 1 day ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
As a Senior Network DevOps Engineer at Rakuten India, you will leverage your 6+ years of experience to design, implement, and maintain robust network infrastructure solutions that support the organization's operational needs. Your role will involve collaborating with cross-functional teams to gather requirements, assess system limitations, and provide scalable network solutions. You will be responsible for automating network provisioning, configuration, and monitoring processes to enhance operational efficiency. Additionally, you will develop and maintain CI/CD pipelines for network infrastructure, ensuring seamless deployment and integration. Your expertise will be crucial in troubleshooting and resolving complex network automation-related issues to ensure high availability and optimal performance. You will also create and maintain monitoring systems to guarantee high availability and performance for software applications. To excel in this role, you should hold a Bachelor's degree in Computer Science, Information Technology, or a related field. Your proven experience as a Network DevOps Engineer, with a focus on designing and implementing scalable and secure network solutions, will be invaluable. Strong proficiency in automation tools and scripting languages such as Ansible, Python, and Jenkins for network configuration and deployment is essential. In-depth knowledge of networking protocols, security principles, and infrastructure as code (IaC) concepts is required. Experience or knowledge with cloud platforms like AWS, Azure, and GCP will be beneficial. Familiarity with Cisco ASR, Nexus, Catalyst, Juniper SRX, Cumulus automation, as well as monitoring tools like Grafana, Kentik, Elastic, and Kibana, will be advantageous. Furthermore, having relevant certifications such as CCNP, DevNet, or AWS Certified DevOps Engineer would be a plus. Excellent communication and collaboration skills are essential, enabling you to work effectively in a team environment. By staying abreast of industry trends, emerging technologies, and best practices, you will continuously improve network infrastructure to ensure the success of Rakuten India's operations.,
Posted 1 day ago
5.0 - 9.0 years
0 Lacs
maharashtra
On-site
As an IT Infrastructure and Application Monitoring Services expert at Capgemini, you will play a crucial role in shaping your career while being supported by a collaborative global community. Your responsibilities will include integrating monitoring tools for event management and automation, demonstrating proficiency in ITIL Processes and Monitoring Concepts, and managing Budget, Project, and Vendor requirements. You will lead a team of 5-10 members and deliver monitoring solutions that meet customer requirements, offering consultation on the best possible solutions. With a minimum of 5 years of experience, you will be well-versed in Monitoring Tools such as Dynatrace SaaS, ScienceLogic, Appdynamics, New Relic, and Microfocus tools. Additionally, you will possess basic knowledge of IT Infrastructure and Application Monitoring Services, understand ITSM Processes with Agile Methodology, and have the ability to manage multiple concurrent projects and stakeholders effectively. Strong verbal and written communication skills are essential, along with the capability to handle conflict constructively. Experience in handling large monitoring tool setups and migrations to new tools will be beneficial, as well as familiarity with the GSI Domain (Global System Integrator). Your flexibility to work a flexible schedule will be an asset in this role. Capgemini is a global leader in business and technology transformation, committed to accelerating the transition to a digital and sustainable world. With a diverse team of 340,000 members in over 50 countries, Capgemini leverages its 55-year heritage to deliver end-to-end services and solutions. By unlocking the value of technology through AI, cloud, and data capabilities, Capgemini addresses the comprehensive business needs of its clients. Join us in creating a more sustainable and inclusive world while making a tangible impact on enterprises and society.,
Posted 1 day ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As a Discovery Domain Cloud Solutions Analyst at Lilly, you will play a crucial role in designing, building, and maintaining highly scalable and secure cloud-based backend services and APIs. Your main focus will be enabling data-driven decision-making in the Discovery Informatics domain by partnering with Discovery scientists and informatics experts. You will be responsible for developing and deploying containerized applications using Kubernetes, ensuring scalable data processing pipelines and scientific computing environments. Your key responsibilities will include delivering solutions leveraging AWS cloud services, implementing CI/CD pipelines for continuous deployment of scientific data services, and applying infrastructure-as-code (IaC) principles using tools like Terraform or AWS CloudFormation. You will also be responsible for ensuring secure data handling and compliance with data governance policies, developing and maintaining robust monitoring solutions, and driving automation and continuous improvement in cloud infrastructure. To be successful in this role, you must have strong hands-on experience with AWS services, Kubernetes, and backend development using Python, Node.js, or Java. Proficiency in CI/CD tools, automation frameworks, and monitoring tools is essential. Experience with Terraform or CloudFormation for automating cloud infrastructure is required, along with familiarity with container security best practices and vulnerability scanning tools. Preferred qualifications include experience working in regulated environments such as pharmaceuticals, exposure to scientific informatics platforms, and knowledge of AWS analytics services. Understanding of Agile development methodologies, DevOps practices, and excellent problem-solving skills are highly desirable. Strong verbal and written communication skills are also important for effective collaboration with technical and non-technical stakeholders. Join the Discovery Informatics team at Lilly and be part of advancing scientific research by providing robust digital solutions that support data acquisition, management, and analysis for early drug discovery. Collaborate closely with scientists to turn data into insights, accelerating the path from lab to patient. Lilly is dedicated to creating new possibilities through technology to advance our purpose of creating medicines that make life better for people around the world. If you are determined to make life better for people globally and have the required skills and experience, we encourage you to apply for this exciting opportunity at Lilly. #WeAreLilly,
Posted 1 day ago
8.0 - 12.0 years
0 Lacs
karnataka
On-site
As a DevOps Engineer at Cisco Cloud Security Engineering, you will be an integral part of the dynamic software development team. Your role will involve automating and optimizing the software delivery process, managing the organization's cloud infrastructure, troubleshooting system issues, collaborating on deployment strategies, and ensuring a seamless transition from development to production. You will also have the opportunity to learn and adapt to new technologies in the DevOps landscape. Key responsibilities include supporting the development and operations teams, monitoring system health and security, troubleshooting across various domains, participating in deployment strategies, and creating reliable deployment pipelines. You will need to have a Bachelor's Degree in Computer Science or a related field, along with 8-11 years of experience in software development or DevOps engineering. Proficiency in programming languages like Go, Java, or Python, expertise in Infrastructure as code technologies such as Terraform, and experience with cloud platforms like AWS are essential. Desired qualifications include familiarity with data pipeline tools, strong problem-solving skills, excellent communication abilities, and a willingness to learn in a fast-paced environment. You will collaborate with a team of developers, systems administrators, and other DevOps engineers to enhance the software development process and work with cross-functional teams to meet their infrastructure and automation needs. The Cloud Security Engineering team at Cisco is dedicated to building and operating core control plane services for the Umbrella and Cisco Secure Access platform. The team emphasizes learning and experimentation while closely collaborating with other engineering groups across Cisco. Cisco values inclusivity, innovation, and teamwork, offering a supportive environment for personal and professional growth. If you are passionate about technology and making a positive impact, join us at Cisco to help shape a more inclusive and digital future for everyone. #WeAreCisco,
Posted 1 day ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
You will be joining DAZN, a revolutionary sports broadcaster that is changing the way millions of households across the world experience live sports events. As a Senior Streaming Support Engineer in our Technology team based in Hyderabad, India, you will play a crucial role in ensuring the seamless delivery of live sports content to consumers worldwide through various devices. Your responsibilities will include monitoring and supporting the global streaming platform, identifying and resolving technical faults, conducting root cause analysis of complex issues, and providing regular updates on platform stability and performance to key stakeholders and senior management. Additionally, you will collaborate closely with Broadcast Platforms Managers, contribute to the development of core streaming functions, and help maintain a successful streaming operation by demonstrating strong leadership qualities. To excel in this role, you should have prior experience in supporting OTT head-end components, DRM, CDNs, and end-user client performance. Proficiency in troubleshooting OTT workflows, HTTP streaming solutions, A/V encoding, packaging, and CDN technologies is essential. You should possess a high level of expertise in troubleshooting and root cause analysis, and be willing to work flexible hours, including weekends. Experience with cloud computing platforms like AWS, GCP, or Azure, as well as monitoring tools such as New Relic, CloudWatch, and Conviva, will be advantageous. Moreover, having familiarity with tools like ServiceNow, Jira, Confluence, Grafana, or Dataminer, and prior experience in live sport broadcast/stream delivery, will further enhance your suitability for this role. By leveraging your technical skills and industry knowledge, you will contribute to the success of our streaming services and play a pivotal role in shaping the future of sports broadcasting at DAZN.,
Posted 1 day ago
4.0 - 8.0 years
0 Lacs
hyderabad, telangana
On-site
You will be joining Innovan Technologies Private Limited, a company founded in 2019 in Hyderabad, Telangana, India. The name "Innovan" signifies Innovation, and at Innovan, we strive for excellence through collaboration, learning, and mentorship. Our team works closely with colleagues from the US, Canada, Mexico, and the Philippines. We value mutual respect and trust, fostering a culture of innovation and creativity. In a short span of time, we have expanded into a larger team, enabling us to establish various "Centers of Excellence (COE)" focusing on domains such as Low-Code/No-Code, AI/ML, Generative AI, iPaaS, PaaS, and SaaS with a multi-cloud architecture and hybrid cloud solutions. As the Performance Test Lead, you will be responsible for overseeing all aspects of performance testing to ensure applications function optimally under expected and peak load conditions. This role involves collaborating closely with development, QA, and operations teams to identify performance bottlenecks and provide recommendations for enhancement. Your key responsibilities will include developing and implementing performance test strategies, executing various tests, monitoring application performance, analyzing test results, providing optimization recommendations, and coordinating with stakeholders to communicate findings effectively. Additionally, you will be managing performance testing tools, maintaining best practices, and mentoring junior team members on testing techniques. To qualify for this role, you should have at least 10 years of experience in software testing, with a minimum of 4 years in a leadership position. Proficiency in performance testing tools like JMeter, LoadRunner, and Locust, strong scripting language skills, knowledge of network protocols, databases, and server-side components, and familiarity with monitoring tools are essential technical skills required. Excellent analytical, problem-solving, communication, and interpersonal skills are also necessary to excel in this collaborative and fast-paced environment. Preferred qualifications include experience in cloud-based environments, knowledge of CI/CD pipelines and DevOps practices, and certification in performance testing or quality assurance. When forwarding your resume, please ensure to mention the position and your experience in the subject line.,
Posted 1 day ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
You will be responsible for the administration, maintenance, and support of On-prem Windows Server environments, AWS, and Azure cloud infrastructure. This includes managing and optimizing the performance, security, and availability of on-premises and cloud-based systems, as well as providing technical support to end-users and other IT staff. Your main responsibilities will include: On-premises Windows Server Administration: - Installing, configuring, and maintaining Windows Server operating systems such as Server 2016, 2019, 2022. - Managing Active Directory, Group Policy, and user accounts. - Implementing and managing security policies and procedures. - Troubleshooting and resolving server and application issues. - Maintaining backups and disaster recovery plans. - Patching and updating servers. - Monitoring server performance and resource utilization. Azure/AWS Cloud Administration: - Managing virtual machines (VMs), networks, storage, and other Azure or AWS services. - Implementing and managing Azure/AWS security policies and procedures. - Monitoring Azure/AWS resource utilization and performance. - Automating tasks using Azure/AWS automation tools. - Implementing and managing infrastructure as code (IaC). - Deploying and managing applications in Azure/AWS. - Working with Azure/AWS Active Directory. General IT Responsibilities: - Providing technical support to end-users. - Participating in on-call rotation. - Documenting IT processes and procedures. - Collaborating with other IT staff. - Staying current with new technologies and trends. Basic Requirements: - Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent work experience. - Proven experience in a Windows System Administrator role, managing Windows Server 2012/2016/2019 environments. - Solid experience with cloud services administration (e.g., Microsoft Azure, AWS). Preferred Skills: - Certifications such as Microsoft Certified Systems Administrator (MCSA), Microsoft Certified Solutions Expert (MCSE), or equivalent cloud certifications (e.g., Azure Administrator Associate, AWS Certified SysOps Administrator). - Knowledge of ITIL and industry best practices. - Familiarity with monitoring tools and services (e.g., Nagios, Prometheus, Zabbix). - Kubernetes and containerization of Windows. - Experience with virtualization technologies (e.g., Hyper-V, VMware). - Strong knowledge of systems and networking software, hardware, and networking protocols. - Experience with scripting and automation. - Knowledge of data protection operations and disaster recovery. - Ability to create a secure and robust network environment. This position does not require travel or relocation. Motorola Solutions is committed to an inclusive and accessible recruiting experience for candidates with disabilities or other physical or mental health conditions. If you believe you would be a great addition to the team, even if you do not meet all the preferred skills, we encourage you to apply.,
Posted 1 day ago
2.0 - 6.0 years
3 - 8 Lacs
pune
Work from Office
About project: Axis Solar Inc Canada (Ontario) . Axis Solar Inc. is an established service provider specializing in solar energy systems across Ontario. Their offerings include Planning, Re-powering, Maintenance, Monitoring, Optimization of solar installations. Mission is to help maximize the profitability of their clients' solar investments. Total 80 projects in running right now, each project having different types (40 sites in different locations). Almost 250 clients for solar projects. We are services providing to Commercial, Bank, School (Panel size: 20x20 , rooftop panel project.) Services include: Preventative maintenance, Solar monitoring, Strategic consulting, Corrective maintenance, Land management. Key Responsibilities Monitoring 365 days/year monitoring. Daily site monitoring 9am alert list. Send alerts to CM staff on weekends triggering truck rolls. PM Reviews Receive/store PM reports in Asset files. Read reports and create tickets for issues resolution. CM Invoices Receive/store CM invoices in Asset files. Reporting Prepare/distribute standard production reports to SIF and third-party clients. Detailed Analysis In-depth site analysis quarterly detailed reports. Key Skills & Competencies: Strong knowledge of solar PV systems, SCADA, and remote monitoring tools . Expertise in data analytics, MS Excel, Power BI/Tableau, and reporting automation . Candidate profile Degree in Engineering/Technical discipline. Experience in Solar Monitoring & Data Analysis. Knowledge of monitoring platforms & ticketing systems, reporting tools. Comfortable with rotational/shift schedules (24/7 coverage). Strong documentation, reporting & analytical skills.
Posted 1 day ago
10.0 - 15.0 years
25 - 35 Lacs
noida, pune, bengaluru
Work from Office
Description: We are looking for a highly skilled Senior DevOps Engineer (8–14 years) with strong expertise in designing and implementing scalable DevOps solutions. The ideal candidate must be an AWS Certified Solutions Architect, with mandatory hands-on experience in Chef and deep expertise in Terraform, CI/CD, cloud infrastructure, and automation. This role requires both hands-on skills and architectural design capabilities, with strong exposure to AWS, Docker, Kubernetes, and monitoring tools. Requirements: 8–14 years of experience in DevOps engineering with architectural exposure. AWS Certified Solutions Architect (mandatory). Strong expertise in Chef (mandatory ) and Terraform (IaC). Hands-on experience in CI/CD pipeline design and automation. Strong scripting skills in Python, Shell, Unix, YAML. Experience with Docker and Kubernetes in production. Monitoring tools expertise: ELK, Prometheus, Grafana, New Relic. Familiarity with serverless AWS architectures (Lambda, Step Functions, DynamoDB). Experience with data migration is a plus. Strong problem-solving, collaboration, and communication skills. Job Responsibilities: Architect, design, and implement scalable DevOps solutions and infrastructure. Develop and manage CI/CD pipelines for microservice-based applications. Automate provisioning and configuration management using Chef (mandatory) and Terraform (IaC). Write and optimize automation scripts in Python, Shell, Unix, YAML . Manage and optimize AWS cloud infrastructure , including serverless services (Lambda, Step Functions, DynamoDB). Implement monitoring and observability with ELK, Prometheus, Grafana, New Relic. Drive containerization and orchestration using Docker and Kubernetes. Ensure scalability, high availability, and security across environments. Collaborate with development and architecture teams to establish DevOps best practices. Mentor and guide junior DevOps engineers. What We Offer: Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom. Our customer list is full of fantastic global brands and leaders who love what we build for them. Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment — or even abroad in one of our global centers or client facilities! Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays. Professional Development: Our dedicated Learning & Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings. Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses. Fun Perks: We want you to love where you work, which is why we host sports events, cultural activities, offer food on subsidies rates, Corporate parties. Our vibrant offices also include dedicated GL Zones, rooftop decks and GL Club where you can drink coffee or tea with your colleagues over a game of table and offer discounts for popular stores and restaurants!
Posted 1 day ago
0.0 - 5.0 years
1 - 3 Lacs
pune
Work from Office
Job Title: Data Center Operator Location: Pune Experience Required: Minimum 6 Months CTC Range: Up to 3.5 LPA Joining Requirement: Immediate Joiner Only Employment Type: Full-time (Work from Office) Role Overview We are looking for a Data Center Operator to support day-to-day operations and ensure smooth functioning of IT infrastructure. The ideal candidate should have at least 6 months of experience in a Data Center or IT operations environment, with the ability to monitor, troubleshoot, and escalate issues as required. Key Responsibilities Monitor data center infrastructure, servers, and network equipment. Perform routine checks and escalate issues to the relevant teams. Ensure adherence to operational procedures and SLAs. Manage backups, system alerts, and incident reporting. Assist in basic troubleshooting of hardware and connectivity issues. Maintain logs, documentation, and shift handover reports. Coordinate with L2/L3 teams for issue resolution. Required Skills & Competencies Minimum 6 months of experience in Data Center / IT Operations . Knowledge of server monitoring tools, basic networking, and hardware handling . Good communication and coordination skills. Ability to work in rotational shifts (including night shifts) . Strong sense of responsibility and attention to detail. Eligibility Criteria Experience: 6 months+ in data center operations Location: Pune (Work from Office) Budget: Maximum 3.5 LPA Immediate joiners only Interested candidates can share their updated resume and details on: Anurag.Yadav@softenger.com 7385556898 (WhatsApp) Details to be shared along with resume: Updated Resume Total Experience Relevant Experience Current CTC Expected CTC Notice Period Current Location Ready to Relocate to Pune Location
Posted 1 day ago
4.0 - 5.0 years
8 - 12 Lacs
gurugram
Work from Office
Experience Level : Mid Level. Position Overview : We are looking for a Mid-Level Kubernetes Administrator to support and maintain our on-premises container orchestration infrastructure built on open-source Rancher Kubernetes. This role will focus on day-to-day cluster operations, deployment support, and working closely with DevOps, Infra, and Application teams. Roles and Responsibilities : - Manage Rancher-based Kubernetes clusters in an on-premise environment. - Deploy and monitor containerized applications using Helm and Rancher UI/CLI. - Support pod scheduling, resource allocation, and namespace management. - Handle basic troubleshooting of workloads, networking, and storage issues. - Monitor and report cluster health using Prometheus, Grafana, or similar tools. - Manage users, roles, and access using Rancher-integrated RBAC. - Participate in system patching, cluster upgrades, and capacity planning. - Document standard operating procedures, deployment guides, and issue resolutions. Must Have Skills : - 45 years of experience in Kubernetes administration in on-prem environments. - Hands-on experience with Rancher for managing K8s clusters. - Working knowledge of Linux system administration and networking. - Experience in Docker, Helm, and basic YAML scripting. - Exposure to CI/CD pipelines and Git-based deployment workflows. - Experience with monitoring/logging stacks (Prometheus, Grafana). Good to Have Skills : - Certified Kubernetes Administrator (CKA). - Familiarity with RKE (Rancher Kubernetes Engine). - Experience with bare metal provisioning, VM infrastructure, or storage systems. Qualification : BE/BTech/MCA/ME/MTech/MS in Computer Science or a related technical field or equivalent practical experience. Location : Gurgaon / Onsite.
Posted 1 day ago
5.0 - 10.0 years
9 - 18 Lacs
hyderabad, coimbatore, bengaluru
Work from Office
Scope: • As a Monitoring SME & Architect, you will be responsible for designing, implementing a comprehensible Monitoring Solutions & process to ensure uptime, system health, performance & reliability. You will be responsible for reduction of alert volume, implement intelligible alerting, alert co-relations, compression of alerts, measuring signal to noise ratio and setting up an early warning system across Operations. You will be required to collaborate across teams and create centralized dashboarding and visibility to remove Silos. You will be responsible for architecting monitoring configurations in a scalable & secure model leveraging automation with a future scope of AI integrated Monitoring Operations. Our current technical environment: Technical Skills: Monitoring Tool Administration, Logs Indexing & pipeline, Azure, VMWare, Ansible, Python, Selenium, Terraform, Shell, Windows, Linux, GROK parsing • Problem-solving skills should be able to devise technical and creative solutions. Use Analytics to understand pattern and pro-actively identify gaps • Communication skills Effective communication is key in this role to gather data about problems, prepare detailed notes and reports, and update users with further steps • Time management – Need to maintain excellent time management skills and should be able to set priorities when handling multiple cases. • Team collaboration – To routinely work with other functions to resolve user issues, so they need to successfully collaborate with team members and coworkers. • Highly motivated, hands-on personality. • Ability to learn quickly in a challenging environment Key Accountability Monitoring Effectiveness – Ensuring the monitoring framework and enhancements are setup to increase Pro-active identification & resolution prior to customer impact. Setup & maintain centralized Monitoring Configuration by code Consistently drive the alert volume down and eliminate false alerts Setup advanced monitoring alerts for golden signals i.e. Latency, Errors, Throughputs & Saturation. Transform from traditional CPU, Memory symptomatic monitors to more advanced alert co-relation pinpointing directly to issues for predictive monitoring Create & implement Synthetic or End User Monitoring using Python, Selenium for customer experience monitoring Set up API End point monitoring & measure uptime & availability across customers, products & infrastructure endpoints. Implement SLOs, SLIs, Error Budgets concepts to measure & setup Maturity model Maintain & Manage Code Repository built to scale and security measures Leverage Automation to push changes on monitoring tools Setup Orchestration mechanism for on-boarding & decommissioning to ensure Operational Readiness Setup Dashboards & Create visibility across all Cross-functional teams Establish Telemetry for automated collection of data across Metrics, Logs & Traces Continuous Analysis on Data to acknowledge gaps and implementing improvements Minimum Requirements Associate’s degree (or equivalent) in Computer Science; Information Technology or related field preferred 8-10 years of IT experience with 7 years of Monitoring Experience Experience in Administrating Monitoring Tools – AppDynamics, SolarWinds, Grafana, Zabbix, DataDog, ELK Stack etc. Hands-on experience on Logs, Metrics, Traces, Parsing, RegEx, Tagging Hands-on experience on implementing APM, EUM, Synthetics, API endpoint etc. Hands-on experience on integrations with ITSM tools such as Service Now & Jira Hands-on experience on Ansible, Python, Selenium, Shell Hands-on experience on Enterprise scale of Azure, VM Ware & AWS Hands-on experience on creating dashboards and analysis Excellent interpersonal, influencing skills, interacting appropriately with colleagues of many technical skill levels, remaining calm and courteous while working in a high-stress situation to resolve problems. Skills: Technical Skills: Monitoring Tool Administration, Logs Indexing & pipeline, Azure, VMWare, Ansible, Python, Selenium, Terraform, Shell, Windows, Linux, GROK parsing Problem-solving skills – should be able to devise technical and creative solutions. Use Analytics to understand pattern and pro-actively identify gaps Communication skills – Effective communication is key in this role to gather data about problems, prepare detailed notes and reports, and update users with further steps Time management – Need to maintain excellent time management skills and should be able to set priorities when handling multiple cases. Team collaboration – To routinely work with other functions to resolve user issues, so they need to successfully collaborate with team members and coworkers. Highly motivated, hands-on personality. Ability to learn quickly in a challenging environment.
Posted 1 day ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
The job market for monitoring tools in India is currently thriving, with a high demand for professionals who can effectively manage and maintain monitoring systems for various organizations. As businesses continue to rely on technology for their operations, the need for skilled individuals who can ensure the performance and security of these systems has never been greater.
These cities are known for their strong IT sectors and have a high concentration of companies looking to hire monitoring tools professionals.
The salary range for monitoring tools professionals in India can vary based on experience and location. On average, entry-level positions can expect to earn between INR 3-6 lakhs per annum, while experienced professionals can earn upwards of INR 10 lakhs per annum.
In the field of monitoring tools, a typical career path may include roles such as Junior Monitoring Engineer, Monitoring Analyst, Monitoring Team Lead, and Monitoring Tools Architect. As professionals gain experience and expertise in the field, they can progress to more senior positions with greater responsibilities.
In addition to proficiency in monitoring tools, professionals in this field are often expected to have knowledge of networking, database management, scripting languages, and cloud computing. Strong analytical and problem-solving skills are also essential for success in monitoring tools roles.
As you explore opportunities in the monitoring tools job market in India, remember to showcase your skills and experience confidently during interviews. By preparing thoroughly and demonstrating your expertise in monitoring systems, you can position yourself as a valuable candidate for exciting career opportunities in this field. Good luck!
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
64580 Jobs | Dublin
Wipro
25801 Jobs | Bengaluru
Accenture in India
21267 Jobs | Dublin 2
EY
19320 Jobs | London
Uplers
13908 Jobs | Ahmedabad
Bajaj Finserv
13382 Jobs |
IBM
13114 Jobs | Armonk
Accenture services Pvt Ltd
12227 Jobs |
Amazon
12149 Jobs | Seattle,WA
Oracle
11546 Jobs | Redwood City