Jobs
Interviews

1623 Grafana Jobs - Page 3

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

15.0 - 19.0 years

0 Lacs

chennai, tamil nadu

On-site

As the Quality Engineering Transformation Leader/Test Architect at Wipro Limited, you will play a crucial role in driving Quality Engineering and Testing transformation initiatives for client engagements aligned with application engineering and digital roadmap. Your responsibilities will include collaborating with client stakeholders to implement Quality Engineering & Testing Transformation, presenting quality engineering concepts to senior stakeholders, identifying opportunities for QET transformation, and working with various Wipro teams to plan, govern, measure, and deliver transformation outcomes. To excel in this role, you should have over 15 years of extensive experience working with multi-skilled testing teams and other project roles, the ability to create quality engineering and testing transformation roadmaps and strategies, and a consultative approach in helping clients align quality objectives with business goals. You should be well-versed in modern testing practices such as continuous testing, in-sprint automation, service virtualization, and various testing tools aligned with Agile and CI/CD. Certifications like SAFE and Certified Scrum Master (CSM) would be highly desirable. Your role will also involve creating frameworks to measure current baselines, plan future states, and achieve transformation goals with associated business value and cost benefits. Additionally, you will lead multiple test leads and test managers to drive transformation initiatives, build strong relationships across stakeholder groups, and effectively communicate and influence at various organizational levels. This position requires strong problem-solving skills, an entrepreneurial drive, the ability to negotiate and shape deals, and proficiency in test reporting, testing metrics, and benefit realization. You should also possess good communication, interpersonal, organizational, and time-management skills to succeed in this dynamic role. Join Wipro Limited in reinventing your world and be part of a business that empowers you to design your own reinvention. Embrace the opportunity to contribute to a modern Wipro and drive digital transformation with purpose. Wipro welcomes applications from individuals with disabilities, embodying a culture of inclusivity and diversity.,

Posted 3 days ago

Apply

7.0 - 11.0 years

0 Lacs

pune, maharashtra

On-site

Working at Tech Holding provides you with an opportunity to be part of a full-service consulting firm dedicated to delivering high-quality solutions and predictable outcomes to clients. Our team, comprising industry veterans with experience in both emerging startups and Fortune 50 firms, has developed a unique approach based on deep expertise, integrity, transparency, and dependability. We are currently seeking a Cloud Architect with at least 9 years of experience to assist in building functional systems that enhance customer experience. Your responsibilities will include: Monitoring & Observability: - Setting up and configuring Datadog and Grafana for comprehensive system metric monitoring and visualization. - Developing alerting systems to proactively identify and resolve potential issues. - Integrating monitoring tools with applications and infrastructure to ensure high observability. CI/CD: - Implementing and managing CI/CD pipelines using GitHub Actions, EKS, and Helm to automate build, test, and deployment processes. - Optimizing build times and deployment frequency to expedite development cycles. - Ensuring adherence to best practices for code quality, security, and compliance. Cloud Infrastructure: - Designing and overseeing the migration of Azure infrastructure to AWS with a focus on leveraging best practices and cloud-native technologies. - Managing and optimizing AWS and Azure environments, including cost management, resource allocation, and security. - Implementing and maintaining infrastructure as code (IaC) using tools like Terraform or AWS CloudFormation. Incident Management: - Implementing and managing incident response processes for efficient detection, response, and resolution of incidents. - Collaborating with development, operations, and security teams to identify root causes and implement preventative measures. - Maintaining incident response documentation and conducting regular drills to enhance readiness. Migration: - Leading the migration of ECS services to EKS while ensuring minimal downtime and data integrity. - Optimizing EKS clusters for performance and scalability. - Implementing best practices for container security and management. CDN Management: - Managing and optimizing the Akamai CDN solution to efficiently deliver content. - Configuring CDN settings for caching, compression, and security. - Monitoring CDN performance and troubleshooting issues. Technology Stack: - Proficiency in Python or Go for scripting and automation. - Experience with Mux Enterprise for reporting, monitoring, and alerting. - Familiarity with relevant technologies and tools such as Kubernetes, Docker, Ansible, and Jenkins. Qualifications: - Bachelor's degree in computer science, engineering, or a related field. - Minimum of 7 years of experience in DevOps or a similar role. - Strong understanding of cloud platforms (AWS and Azure) and their services. - Expertise in Python or Go Lang and monitoring/observability tools (Datadog, Grafana). - Proficiency in CI/CD pipelines and tools (GitHub Actions, EKS, Helm). - Experience with infrastructure as code (IaC) tools (Terraform, AWS CloudFormation). - Knowledge of containerization technologies (Docker, Kubernetes). - Excellent problem-solving, troubleshooting, and communication skills. - Ability to work independently and collaboratively within a team. Employee Benefits include flexible work timings, work from home options as needed, family insurance policy, various leave benefits, and opportunities for learning and development.,

Posted 3 days ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a Lead / Staff Software Engineer in Black Duck SRE team, you will play a key role in transforming our R&D products through the adoption of advanced cloud, Containerization, Microservices, modern software delivery and other cutting edge technologies. You will be a key member of the team, working independently to develop tools and scripts, automated provisioning, deployment, and monitoring. The position is based in Bangalore (Near Dairy Circle Flyover) with a Hybrid work mode. Key Qualifications: - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - Minimum of 5-7 years of experience in Site Reliability Engineering / DevOps Engineering. - Strong hands-on experience with Containerization & Orchestration using Docker, Kubernetes (K8s), Helm to Secure, optimize, and scale K8s. - Deep understanding of Cloud Platforms & Services in AWS / GCP / Azure (Preferably GCP) cloud to Optimize cost, security, and performance. - Solid experience with Infrastructure as Code (IaC) using Terraform / CloudFormation / Pulumi (Preferably Terraform) - Write modules, manage state. - Proficient in Scripting & Automation using Bash, Python / Golang - Automate tasks, error handling. - Experienced in CI/CD Pipelines & GitOps using Git / GitHub / GitLab / Bitbucket / ArgoCD, Harness.io - Implement GitOps for deployments. - Strong background in Monitoring & Observability using Prometheus / Grafana / ELK Stack / Datadog / New Relic - Configure alerts, analyze trends. - Good understanding in Networking & Security using Firewalls, VPN, IAM, RBAC, TLS, SSO, Zero Trust - Implement IAM, TLS, logging. - Experience with Backup & Disaster Recovery using Velero, Snapshots, DR Planning - Implement backup solutions. - Basic Understanding messaging concepts using RabbitMQ / Kafka / Pub,Sub / SQS. - Familiarity with Configuration Management using Ansible / Chef / Puppet / SaltStack - Run existing playbooks. Key Responsibilities: - Design and develop scalable, modular solutions that promote reuse and are easily integrated into our diverse product suite. - Collaborate with cross-functional teams to understand their needs and incorporate user feedback into the development. - Establish best practices for modern software architecture, including Microservices, Serverless computing, and API-first strategies. - Drive the strategy for Containerization and orchestration using Docker, Kubernetes, or equivalent technologies. - Ensure the platform's infrastructure is robust, secure, and compliant with industry standards. What We Offer: - An opportunity to be a part of a dynamic and innovative team committed to making a difference in the technology landscape. - Competitive compensation package, including benefits and flexible work arrangements. - A collaborative, inclusive, and diverse work environment where creativity and innovation are valued. - Continuous learning and professional development opportunities to grow your expertise within the industry.,

Posted 3 days ago

Apply

7.0 - 11.0 years

0 Lacs

karnataka

On-site

Are you looking for a chance to contribute significantly to the rapid growth of a highly successful software company At Poppulo, we are at the forefront of innovation in communications and workplace technology. We understand the challenges of effectively reaching every employee, managing office space in a hybrid environment, and enhancing customer and guest experiences. Our mission is to simplify these complexities and bring harmony to our clients" operations. As a pioneer in the industry, Poppulo's omnichannel employee communications, customer communications, and workplace experience platform are trusted by over 6,000 organizations worldwide, reaching more than 35 million employees and delivering content to over 500,000 digital signs. We acknowledge that there is no such thing as a perfect candidate, and we are all continuously developing new skills and capabilities. Therefore, we encourage you to apply for a position at Poppulo even if you do not meet all the requirements. We believe in creating an inclusive environment that values diverse perspectives to foster growth and success for all. We are currently seeking a highly skilled and experienced Senior Software Engineer to join our Data Services team. The ideal candidate will possess a solid background in software development, a deep passion for technology, and the ability to lead complex projects effectively. Key Responsibilities: - Design, develop, and maintain high-quality software solutions. - Provide technical guidance and mentorship to junior team members. - Collaborate with team members and stakeholders to ensure alignment and transparency. - Document technical specifications, system architecture, and design decisions. Qualifications: - Bachelor's degree in computer science, engineering, or a related field (Master's degree preferred). - Demonstrated experience in designing and developing complex software applications. - 7+ years of hands-on development experience in .Net. - Proficiency in C#, MSSQL, Blazor, Redis, relational or non-relational databases, multithreading, unit testing, and RESTful APIs. - Strong problem-solving and analytical skills. - Ability to write clean, efficient, and well-documented code following best practices and coding standards. - Experience with writing tests using different frameworks like NUnit. - Familiarity with optimizing performance and scalability of applications. - Expertise in developing identity and access management solutions such as OAuth 2.0, OpenID Connect, SAML, AWS, or Azure identity management solutions. - Knowledge of containerization technologies like Docker and Kubernetes. - Experience with microservices, design patterns, version control systems (e.g., GitLab), cloud platforms (e.g., AWS, Azure, Google Cloud), and CI/CD pipelines. - Proficiency in quality control tools like Snyk, SonarQube, Vulcan. - Ability to provide technical guidance and mentorship to junior and mid-level engineers. - Strong understanding of software development methodologies such as Agile and Scrum. - Experience with observability tools like Grafana, Prometheus. - Participation in technical interviews to recruit new team members. - Exposure to third-party integrations like feeds. Preferred Qualifications: - Experience with identity providers like Okta or Curity. - Knowledge of event-based architecture using Kafka. - Familiarity with technologies and tools like JavaScript, bug tracking systems like JIRA, and documentation tools like Confluence. ,

Posted 3 days ago

Apply

2.0 - 6.0 years

0 Lacs

karnataka

On-site

We are seeking a NOC Engineer to join our NOC Team within the Technology Operations Organization at Telesign. As a part of the 24/7 team, you will be responsible for monitoring both internal and external systems, troubleshooting issues, and ensuring timely resolutions or escalations. Collaboration with various Telesign teams such as Tech Ops, Software Development, and Tech Support is a key aspect of this role, aiming to enhance the quality of service provided by Telesign. Your primary duties will include offering first-level technical support for Telesign's servers and network, monitoring, troubleshooting, and managing alerts concerning servers and networks across global data centers. You will also be involved in supporting traffic moves according to technical, business, and operational requirements, as well as collecting and analyzing forensic information related to issues, alerts, or customer escalations. Furthermore, you will be responsible for executing monthly Linux patching procedures, diagnosing technical problems utilizing internal tools, database queries, and log file analysis, as well as suggesting and implementing improvements in system monitoring and self-healing processes. Your role will also involve collaborating with technical peers to enhance efficiency and customer satisfaction, ensuring that customers are content with the support they receive. To be successful in this role, you must be able to work in shifts as per a 24/7 schedule, possess advanced knowledge of Linux, SQL, and networking, along with 2-5 years of relevant experience. Strong analytical and problem-solving skills, attention to detail, and the ability to work well under pressure are essential. Additionally, you should be self-organized, proactive, and possess excellent communication skills to handle multiple priorities effectively. Preferred qualifications include a working understanding of technical concepts such as REST API, Java, PHP, Ruby, C#, Python, familiarity with Atlassian tools, and experience in Telecom and Messaging (CPaaS, SMS, Voice, telecom data, Carrier ecosystem). IT-related certifications would be an added advantage. Join us in this dynamic environment where your skills and expertise will contribute to the success of Telesign and ensure customer satisfaction through efficient and effective support services.,

Posted 3 days ago

Apply

7.0 - 11.0 years

0 Lacs

kolkata, west bengal

On-site

As a Test Engineer 3 at Hyland Software, you will play a crucial role in ensuring the delivery of high-quality software and products by writing code/scripts, identifying tools for functional and non-functional tests, and contributing to automated test frameworks. You will be responsible for designing, implementing, and managing cloud-based infrastructure and automation solutions, with a strong focus on AWS. Collaboration with other engineering groups to define, document, analyze, perform, and interpret tests for products, systems, components, and software modifications will be a key aspect of your role. Your expertise in operations, infrastructure as code (IaC), and CI/CD pipeline management will support the scalability, security, and performance of systems. Your responsibilities will include developing and maintaining complex integration, functional, and non-functional tests, leading the implementation of the delivery pipeline, verifying system functionality through automated and manual tests, analyzing results, providing recommendations, and creating and managing performance metrics and test results. You will also design and manage CI/CD pipelines in GitHub, maintain applications in Kubernetes stack, develop and maintain infrastructure as code using Terraform and Helm, manage various AWS services, implement monitoring, logging, and alerting solutions, and stay updated on emerging DevOps tools and best practices. To be successful in this role, you should have a Bachelor's degree in Computer Science or a related field with at least 7 years of relevant work experience. Proficiency in an Object-Oriented Programming Language like C# or Typescript, experience with Playwright (TypeScript), testing web applications, REST/SOAP API testing, CI/CD tools, AWS, Docker, Kubernetes, logging and monitoring tools, and Agile frameworks is required. Strong communication, organizational, collaboration, critical thinking, problem-solving, leadership, and time management skills are essential. You should be self-motivated, detail-oriented, able to work independently and in a team environment, thrive in a fast-paced environment, and have a drive to learn and stay current professionally. At Hyland Software, we offer a culture that values employee engagement and provides meaningful benefits and programs, including learning & development opportunities, R&D focus, work-life balance culture, well-being initiatives, community engagement activities, diversity & inclusion programs, and various niceties & events. If you are passionate about technology, dedicated to your work, and value honesty, integrity, and fairness, we encourage you to connect with us and be a part of our team.,

Posted 3 days ago

Apply

5.0 - 10.0 years

0 Lacs

pune, maharashtra

On-site

You will be responsible for utilizing the best DevOps practices to optimize the software development process. This includes system administration, design, construction, and operation of container platforms such as Kubernetes, as well as expertise in container technologies like Docker and their management systems. Your role will also involve working with cloud-based monitoring, alerting, and observability solutions, and possessing in-depth knowledge of developer workflows with Git. Additionally, you will be expected to document processes, procedures, and best practices, and demonstrate strong troubleshooting and problem-solving skills. Your proficiency in Network Fundamentals, Firewalls, and ingress/egress Patterns, as well as experience in security configuration Management and DevSecOps, will be crucial for this position. You should have hands-on experience with Linux, CI/CD Tools (Pipelines, GitHub, GitHub Actions/Jenkins), and Configuration Management/Infrastructure as Code tools like CloudFormation, Terraform, and Cloud technologies such as VMware, AWS, and Azure. Your responsibilities will also include build automation, deployment configuration, and enabling product automation scripts to run in CI. You will be required to design, develop, integrate, and deploy CI/CD pipelines, collaborate closely with developers, project managers, and other teams to analyze requirements, and resolve software issues. Moreover, your ability to lead the development of infrastructure using open-source technologies like Elasticsearch, Grafana, and homegrown tools such as React and Python will be highly valued. Minimum Qualifications: - Graduate/master's degree in computer science, Engineering, or related discipline - 5 to 10 years of overall DevOps/Related experience - Good written and verbal communication skills - Ability to manage and prioritize multiple tasks while working both independently and within a team - Knowledge of software test practices, software engineering, and Cloud Technologies discipline - Knowledge/Working experience with Static Code Analysis, License Check Tools, and other Development Process Improvement Tools Desired Qualifications: - Minimum 4 years of working experience in AWS, Kubernetes, Helm, Docker-related technologies - Providing visibility into cloud spending and usage across the organization - Generating and interpreting reports on cloud expenditure, resource utilization, and usage optimization - Network Fundamentals: AWS VPC, AWS VPN, Firewalls, and ingress/egress Patterns - Knowledge/Experience with embedded Linux and RTOS (e.g. ThreadX, FreeRTOS) development on ARM based projects - Domain Knowledge on Cellular wireless and WiFi is an asset - Knowledge of distributed systems, networking, AMQP/MQTT, Linux, cloud security, and Python.,

Posted 3 days ago

Apply

0.0 - 3.0 years

0 Lacs

chandigarh

On-site

As a Python Backend Developer at Lookfinity, you will be a part of the Backend Engineering team that is focused on building scalable, data-driven, and cloud-native applications to solve real-world business problems. We are dedicated to maintaining clean architecture, enhancing performance, and designing elegant APIs. Join our dynamic team that is enthusiastic about backend craftsmanship and modern infrastructure. You will be working with a tech stack that includes languages and frameworks such as Python, FastAPI, and GraphQL (Ariadne), databases like PostgreSQL, MongoDB, and ClickHouse, messaging and task queues such as RabbitMQ and Celery, cloud services like AWS (EC2, S3, Lambda), Docker, Kubernetes, data processing tools like Pandas and SQL, and monitoring and logging tools like Prometheus and Grafana. Additionally, you will be utilizing version control systems like Git, GitHub/GitLab, and CI/CD tools. Your responsibilities will include developing and maintaining scalable RESTful and GraphQL APIs using Python, designing and integrating microservices with databases, writing clean and efficient code following best practices, working with Celery & RabbitMQ for async processing, containerizing services using Docker, collaborating with cross-functional teams, monitoring and optimizing application performance, participating in code reviews, and contributing to team knowledge-sharing. We are looking for candidates with 6 months to 1 year of hands-on experience in backend Python development, a good understanding of FastAPI or willingness to learn, basic knowledge of SQL and familiarity with databases like PostgreSQL and/or MongoDB, exposure to messaging systems like RabbitMQ, familiarity with cloud platforms like AWS, understanding of Docker and containerization, curiosity towards learning new technologies, clear communication skills, team spirit, and appreciation for clean code. Additional experience with GraphQL APIs, Kubernetes, data pipelines, CI/CD processes, and observability tools is considered a bonus. In this role, you will have the opportunity to work on modern backend systems, receive mentorship, and have technical growth plans tailored to your career goals. This is a full-time position with a day shift schedule located in Panchkula. Join us at Lookfinity and be a part of our innovative team dedicated to backend development.,

Posted 3 days ago

Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

As a Network Operations Center (NOC) Analyst at Inspire Brands, you will be responsible for overseeing all technology aspects of the organization. Your primary role will involve acting as the main technology expert for the NOC team, ensuring the detection and resolution of issues in production before they impact the large scale operations. It will be your duty to guarantee that the services provided by the Inspire Digital Platform (IDP) meet user needs in terms of reliability, uptime, and continuous improvement. Additionally, you will play a crucial role in ensuring an outstanding customer experience by establishing service level agreements that align with the business model. In the technical aspect of this role, you will be required to develop and monitor various monitoring dashboards to identify problems related to applications, infrastructure, and potential security incidents. Providing operational support for multiple large, distributed software applications will be a key responsibility. Your deep troubleshooting skills will be essential in enhancing availability, performance, and security to ensure 24/7 operational readiness. Conducting thorough postmortems on production incidents to evaluate business impact and facilitate learning for the Engineering team will also be part of your responsibilities. Moreover, you will create dashboards and alerts for monitoring the platform, define key metrics and service level indicators, and ensure the collection of relevant metric data to create actionable alerts for the responsible teams. Participation in the 24/7 on-call rotation and automation of tasks to streamline application deployment and third-party tool integration will be crucial. Analyzing major incidents, collaborating with other teams to find permanent solutions, and establishing and publishing regular KPIs and metrics for measuring performance, stability, and customer satisfaction will also be expected from you. In terms of qualifications, you should hold a 4-year degree in computer science, Information Technology, or a related field. You should have a minimum of 5 years of experience in a production support role, specifically supporting large scale SAAS Production B2C or B2B Cloud Platforms, with a strong background in problem-solving and troubleshooting. Additionally, you should possess knowledge and skills in various technologies such as Java, TypeScript, Python, Azure Cloud services, monitoring tools like Splunk and Prometheus, containers, Kubernetes, Helm, Cloud networking, Firewalls, and more. Overall, this role requires strong technical expertise, effective communication skills, and a proactive approach to ensuring the smooth operation of Inspire Brands" technology infrastructure.,

Posted 3 days ago

Apply

6.0 - 10.0 years

0 Lacs

hyderabad, telangana

On-site

The role of HV Product at Hitachi Vantara is pivotal in the development of the VSP 360 platform's on-premises solution, ensuring strict adherence to delivery objectives. The VSP 360 platform is the cornerstone of the organization's management solution strategy. As a member of our global team, you will play a key role in empowering businesses to automate, optimize, innovate, and wow their customers with high-performance data infrastructure. To excel in this role, you should possess a Bachelor's degree in computer science or a related field, along with 6+ years of experience in DevOps or a related field. Your strong experience with cloud-based services, running Kubernetes as a service, managing Kubernetes clusters, and infrastructure automation and deployment tools such as Terraform, Ansible, Docker, Jenkins, GitHub, and GitHub Actions will be vital in driving the success of the VSP 360 platform. Additionally, your familiarity with monitoring tools like Grafana, Nagios, ELK, OpenTelemetry, Prometheus, Anthos/Istio Service Mesh, Cloud Native Computing Foundation (CNCF) projects, Kubernetes Operators, KeyCloak, and Linux systems administration will be highly beneficial. It would be advantageous to have proficiency in Python, Django, AWS solution design, cloud-based storage (S3, Blob, Google Storage), and storage area networks (SANs). At Hitachi Vantara, we value diversity, equity, and inclusion, as they are integral to our culture and identity. We encourage individuals from all backgrounds to apply, as we believe that diverse thinking and a commitment to allyship lead to powerful results. As part of our team, you will be supported with industry-leading benefits, services, and flexible arrangements to ensure your holistic health and wellbeing. We champion life balance and offer autonomy, freedom, and ownership in your work. Join us in co-creating meaningful solutions to complex challenges and becoming a data-driven leader that positively impacts industries and society. If you are passionate about innovation and believe in inspiring the future, Hitachi Vantara is the place for you to fulfill your purpose and reach your full potential.,

Posted 3 days ago

Apply

3.0 - 7.0 years

5 - 9 Lacs

Chennai

Work from Office

A globally focused shipping and terminal organisation, Hapag-Lloyd is continuing to drive an ambitious and complex change and transformational program to modernize the applications enabling the digital journey of its customers Hapag-Lloyds strategy depends on a successful digital transformation, As such, the business is building competitive advantage through technology and digitising interaction with customers and its core operational processes Against this backdrop, the Global One IT has a mandate from the executive board to lead the technology transformation of Hapag-Lloyd, This role is based in Chennai which has just recently been established as one of 3 global IT hubs for Hapag-Lloyd and will be the global IT development hub The GPL is required to play a crucial role in helping Hapag-Lloyd to achieve its goals by successfully innovating and creating world class solutions while optimizing the costs as efficiently as possible, Our international team takes care of the architecture, design and implementation of central and business critical platforms for API handling, system integration and automation of rule-based business decisions Advising our specialist departments and IT product teams is an essential part in that context We advise and support the implementation of new use cases and the optimization of existing solutions In addition, we work together with service providers for operation and support to guarantee the highest grade of reliability We see your role as IT operations engineer as a flexible all-rounder and dynamic team player who enjoys and is keen in operate and optimization of our cloud-based platforms We offer a variety of tasks in expanding and improving our solution, from design to implementation and management of operations related challenges As part of a powerful team, you can contribute to parts of Hapag-Lloyd's critical core systems to make a difference, Monitoring & Observability: Continuous observation of our systems regarding availability, performance, system usage and costs Definition, design and implementation of observability / monitoring regarding Service Levels (SLIs / SLOs / SLAs) Integration in central observability solutions e-g : Datadog, Elastic, ? Reporting of availability, performance, system usage and costs on a regular basis Maintenance: Planning, coordination and implementation of system updates in collaboration with our vendors and suppliers, Take care of keeping our system secure by fixing vulnerabilities in collaboration with our CISO department Take care of housekeeping tasks Automation: Drive automation regarding paradigms like CaC/IaC (Configuration as Code / Infrastructure as Code) to ensure the lowest possible degree of error prone manual work, Optimize our CI/CD pipeline Incident & Problem Management: Take over responsibility of coordination & solving incidents to keep ?Mean-Time-To-Repair? and user impact as low as possible Drive and support problem management to ensure system reliability and prevent reoccurring incidents Service Management: Take over responsibility of service request handling Continuous Improvement: Driving continuous improvement of our platform regarding to scalability, reliability & cost-efficiency BEHAVIOURS & APPROACH Strong analytical and problem-solving skills Team-oriented with excellent communication and collaboration skills Ability to build pro-active, co-operative working relationships with customers, peers and key stakeholders based on respect and teamwork Ability to act under pressure and to manage efficiently crisis situations Able to evaluate information, identify key issues and formulate conclusions based on sound, practical judgment, experience, and common sense Work Experience Extensive experience in operations of business critical and cloud-based platforms (monitoring, maintenance, improvement, troubleshooting, ?) on an enterprise scale Extensive experience with AWS cloud and container runtimes like ROSA (Red Hat Open Shift on AWS) Good Knowledge in end-to-end monitoring of applications and systems with enterprise observability tools ( e-g Datadog, Elastic, Prometheus, Grafana) Experience with automation tools such as Terraform or Ansible is an advantage Experience in software development and the tools used, such as version management, CI/CD, planning and collaboration tools ( e-g Git, Jenkins, Jira, Confluence, , ) Excellent communication, problem-solving, and stakeholder management skills, Educations & Qualifications Bachelors or Masters degree in computer science, Engineering, or related discipline English language expert proficiency (additional languages are beneficial) What We Offer Competitive salary Self & Family Health Insurance Term & Life Insurance OPD Benefits EmployeesDeposit Linked Insurance Scheme (EDLI) Learning & Development through HL Academy Flexible Work from Home Leave Travel Allowance Variable performance bonus Recreation facilities Privilege, Casual and Sick leaves Show

Posted 4 days ago

Apply

4.0 - 8.0 years

9 - 14 Lacs

Chennai

Work from Office

Job Description Immediate A deep understanding of Observability Dynatrace preferably (or other tools if they are well versed), Provisioning and setup metric in any observability tool Dynatrace, Prometheus, Thanos, or Grafana, alerts and silences Development work (not just support and running scripts but actual development) done on: Chef (basic syntax, recipes, cookbooks) or Ansible (basic syntax, tasks, playbooks) or Terraform basic syntax and GitLab CI/CD configuration, pipelines, jobs Proficiency in scripting Python, PowerShell, Bash etc This becomes the enabler for automation, Proposes ideas and solutions within the Infrastructure Department to reduce the workload through automation, Cloud resources provisioning and configuration through CLI/API specially Azure and GCP AWS experience is also ok Troubleshooting SRE approach, SRE mindset, Provides emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed Improves documentation all around, either in application documentation, or in runbooks, explaining the why, not stopping with the what, Root cause analysis and corrective actions Strong Concepts around Scale & Redundancy for design, troubleshooting, implementation Mid Term Kubernetes basic understanding, CLI, service re-provisioning Operating system (Linux) configuration, package management, startup and troubleshooting System Architecture & Design Plan, design and execute solutions to reach specific goals agreed within the team, Long Term Block and object storage configuration Networking VPCs, proxies and CDNs At DXC Technology, we believe strong connections and community are key to our success Our work model prioritizes in-person collaboration while offering flexibility to support wellbeing, productivity, individual work styles, and life circumstances Were committed to fostering an inclusive environment where everyone can thrive, Recruitment fraud is a scheme in which fictitious job opportunities are offered to job seekers typically through online services, such as false websites, or through unsolicited emails claiming to be from the company These emails may request recipients to provide personal information or to make payments as part of their illegitimate recruiting process DXC does not make offers of employment via social media networks and DXC never asks for any money or payments from applicants at any point in the recruitment process, nor ask a job seeker to purchase IT or other equipment on our behalf More information on employment scams is available here, Show

Posted 4 days ago

Apply

7.0 - 12.0 years

40 - 70 Lacs

Bengaluru

Work from Office

Hiring Now: Technical Program Manager & Agile Coach / Scrum Master Location: Bangalore, India Company: HiLabs Innovating healthcare data using cutting-edge AI & SaaS Role: Lead global scrum teams focused on software & data engineering Responsibilities: Drive cross-functional program execution & Agile coaching Agile Expertise: Scrum, SAFe, sprint planning, backlog refinement Tools: Proficient in JIRA for workflow, backlog, and metrics tracking Program Management: Define scope, timelines, risk & dependency management Agile Improvement: Conduct training, workshops, & retrospectives to boost team maturity Collaboration: Work with Product Managers, Engineers & stakeholders for smooth delivery Experience Required: 5+ years Scrum/Agile, JIRA, customer-facing role Certifications: Certified Scrum Master (CSM) mandatory; SAFe preferred Educational Background: Engineering degree required; Masters/MBA a plus Domain Knowledge: Experience or interest in US healthcare domain is advantageous Perks: Competitive salary, stock options, professional development, and great work culture Interested? Apply now to join HiLabs and be part of transforming healthcare data! #Hiring #AgileCoach #ScrumMaster #TechnicalProgramManager #BangaloreJobs #HealthcareTech #HiLabs Join HiLabs as a Scrum Master shaping healthcare AI solutionsLead agile teams to improve healthcare data quality and reduce costsWork with cutting-edge explainable AI and healthcare ontologiesCollaborate with healthcare experts, data scientists, and engineersHelp develop and implement HiLabs core platform, MCheck Be a part of a team that harnesses advanced AI, ML, and big data technologies to develop cutting-edge healthcare technology platform, delivering innovative business solutions. Job Title : Technical Project Manager (TPM) - Scrum Master & Agile Coach Job Location : Bangalore, India Job summary: We are seeking a highly skilled Technical Project Manager (TPM) with strong hands-on experience in full-stack development and cloud infrastructure to lead the successful planning, execution, and delivery of technical projects. The ideal candidate will have a strong background in React, Java, Spring Boot, Python, and AWS , and will work closely with cross-functional teams including developers, QA, DevOps, and product stakeholders. As a TPM, you will play a critical role in bridging technical and business objectives, ensuring timelines, quality, and scalability across complex software projects. Responsibilities : Own and drive the end-to-end lifecycle of technical projectsfrom initiation to deployment and post-launch support. Collaborate with development teams and stakeholders to define project scope, goals, deliverables, and timelines. Act as a hands-on contributor when needed, with the ability to guide and review code and architecture decisions. Coordinate cross-functional teams across front-end (React), back-end (Java/Spring Boot, Python), and AWS cloud infrastructure. Manage risk, change, and issue resolution in a fast-paced agile environment. Ensure projects follow best practices around version control, CI/CD, testing, deployment, and monitoring. Deliver detailed status updates, sprint reports, and retrospectives to leadership and stakeholders. Desired Profile: Strong hands-on expertise in React , Java & Spring Boot , and Python . Extensive experience with AWS services such as EC2, S3, Lambda, CloudWatch, and others. Proven ability to lead agile/Scrum teams with a solid understanding of the software development lifecycle (SDLC). Excellent communication, organizational, and interpersonal skills to collaborate effectively with diverse teams and stakeholders. Preferred Qualifications: Experience designing and managing Microservices architectures . Familiarity with messaging systems like Kafka or equivalent platforms. Knowledge of CI/CD pipelines , deployment strategies, and application monitoring tools such as Prometheus , Grafana , and CloudWatch . Practical experience with containerization tools like Docker and orchestration platforms such as Kubernetes .

Posted 4 days ago

Apply

6.0 - 10.0 years

9 - 13 Lacs

Bengaluru

Work from Office

As an employee at Thomson Reuters, you will play a role in shaping and leading the global knowledge economy. Our technology drives global markets and helps professionals around the world make decisions that matter.As the worlds leading provider of intelligent information, we want your unique perspective to create the solutions that advance our businessand your career. Our Service Management function is transforming into a truly global, data and standards-driven organization, employing best-in-class tools and practices across all disciplines of Technology Operations. About the Role: In this opportunity as Senior Service Reliability Engineer - Global Command Center, you will: Run the production environment by monitoring availability and taking a holistic view of system health. Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve . Provide primary operational support and engineering for multiple large distributed software applications. About You: You're fit for the role of Senior Service Reliability Engineer - Global Command Center, if your background includes 6 - 10 years of experience. Good understanding of Unix/Linux, Windows administration Experience in working on any public cloud like AWS, Azure Proficiency in the following general areasJava (Java 1.7/Java 1.8), Javascript, Python, Jenkins, MSTFS/ADO and/or Github experience providing technical support to Enterprise networks. Good understanding of database technologies Programming/Scripting languages such as Python, Perl,Powershell, Java / J2EE. Experience working with logging tools (ex- Logstash and/or Kibana) and monitoring tools like Datadog Security. Hands on experience in implementing an DevOps pipeline using Jenkins and the AWS CI / CD tool sets. A proactive approach to spotting problems, areas for improvement, and performance bottlenecks. #LI-VGA1 Whats in it For You Hybrid Work Model Weve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected. Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work-life balance. Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrows challenges and deliver real-world solutions. Our Grow My Way programming and skills-first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI-enabled future. Industry Competitive Benefits We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing. Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more. We live by our valuesObsess over our Customers, Compete to Win, Challenge (Y)our Thinking, Act Fast / Learn Fast, and Stronger Together. Social Impact Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives. Making a Real-World Impact: We are one of the few companies globally that helps its customers pursue justice, truth, and transparency. Together, with the professionals and institutions we serve, we help uphold the rule of law, turn the wheels of commerce, catch bad actors, report the facts, and provide trusted, unbiased information to people all over the world. Thomson Reuters informs the way forward by bringing together the trusted content and technology that people and organizations need to make the right decisions. We serve professionals across legal, tax, accounting, compliance, government, and media. Our products combine highly specialized software and insights to empower professionals with the data, intelligence, and solutions needed to make informed decisions, and to help institutions in their pursuit of justice, truth, and transparency. Reuters, part of Thomson Reuters, is a world leading provider of trusted journalism and news. We are powered by the talents of 26,000 employees across more than 70 countries, where everyone has a chance to contribute and grow professionally in flexible work environments. At a time when objectivity, accuracy, fairness, and transparency are under attack, we consider it our duty to pursue them. Sound excitingJoin us and help shape the industries that move society forward. As a global business, we rely on the unique backgrounds, perspectives, and experiences of all employees to deliver on our business goals. To ensure we can do that, we seek talented, qualified employees in all our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace. We also make reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs in accordance with applicable law. More information on requesting an accommodation here. Learn more on how to protect yourself from fraudulent job postings here. More information about Thomson Reuters can be found on thomsonreuters.com.

Posted 4 days ago

Apply

9.0 - 11.0 years

10 - 20 Lacs

Pune

Work from Office

We are seeking a highly skilled and experienced Senior/Lead SDET to join our team. The ideal candidate will have a strong background in test automation using Python, API testing with Karate, and UI automation with Selenium. You will play a key role in ensuring the quality and reliability of our software products by designing and implementing robust automated test frameworks and strategies. Key Responsibilities: Design, develop, and maintain scalable and reusable test automation frameworks using Python. Build and execute API test suites using Karate DSL for RESTful services and microservices. Develop and maintain end-to-end UI automation using Selenium with JavaScript/TypeScript. Define and implement test strategies, test plans, and test cases for new and existing features. Perform root cause analysis of test failures and drive issues to resolution. Required Skills & Experience: 8+ years of experience in software testing and automation. Strong programming skills in Python and experience with Pytest or similar frameworks. Hands-on experience with Karate DSL for API testing. Proficiency in Selenium for UI automation (JavaScript/TypeScript). Familiarity with Docker, Kubernetes, and cloud platforms (AWS) is a plus. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. Preferred Qualifications: Experience with performance testing tools (e.g., JMeter, Gatling). Knowledge of BDD frameworks like Cucumber. Exposure to monitoring tools like Grafana, Prometheus.

Posted 4 days ago

Apply

6.0 - 11.0 years

10 - 20 Lacs

Pune

Hybrid

We are hiring "GCP Devops" For one of our "IT Services & Consulting-MNc" Exp-6+ Years Mode-Permanent Location-Pune Skills: Cloud Infrastructure & Automation: Design and implement scalable, cloud-native solutions on Google Cloud Platform (GCP) . Develop and manage Infrastructure as Code (IaC) using tools like Terraform , Ansible , or CloudFormation . Automate CI/CD pipelines using Jenkins , GitLab CI , or Travis CI . Containerization & Orchestration: Implement Docker containers and manage orchestration with Kubernetes . Monitoring & Logging: Use tools like Prometheus , Grafana , and ELK stack for system monitoring and logging. Security & Optimization: Strengthen cloud security practices and optimize services across GCP , AWS , and Azure .

Posted 4 days ago

Apply

10.0 - 15.0 years

25 - 30 Lacs

Bengaluru

Work from Office

We are seeking a highly skilledInformation Security Specialist to join our team. The ideal candidate will haveextensive experience in addressing client queries related to product security,AI security, and cloud security (AWS and Azure). This role requires a proactiveapproach to identifying and mitigating security risks, as well as excellentcommunication skills to effectively interact with clients. Key Responsibilities: Good and detailed understanding of Azure and AWS services provisioning, architecture and security recommendations Respond to client queries regarding product security, AI security, and cloud security (AWS and Azure). Develop and implement security policies, protocols, and procedures. Conduct regular security audits and assessments to identify vulnerabilities. Collaborate with the product development team to ensure security best practices are integrated into the product lifecycle. Monitor and analyze security incidents to determine root causes and implement corrective actions. Stay updated with the latest security trends, threats, and technologies. Provide training and guidance to internal teams on security best practices. Co-ordinate with internal InfoSec team for timely deliverables, as required Hands-On experience for Azure and AWS Cloud services and application end -to-end provisioning on Cloud. Key Performance Indicators (KPIs): Client Query Response Time: Ensure all client queries related to security are addressed within 24 hours. Incident Resolution Time: Resolve security incidents within the defined SLA (Service Level Agreement). Security Audit Compliance: Achieve a compliance rate of 95% or higher in all security audits. Client Satisfaction: Maintain a client satisfaction score of 90% or higher for security-related queries and support. Training Effectiveness: Conduct quarterly security training sessions with an average feedback score of 4.5/5. Cloud Architecture: Ensure secure hosting of product at Cloud Environment. Qualifications: Bachelor's degree in computer science, Information Technology, or a related field. Minimum of 10-15 years of experience in information security, with a focus on AI security and cloud security (AWS and Azure). Relevant certifications such as CISSP, CISM, or AWS Certified Security Specialty. Strong understanding of security frameworks and standards (e.g., ISO 27001, NIST). Excellent problem-solving skills and attention to detail. Strong communication and interpersonal skills. Nice to have Exposure to financial researchdomain Industry recognizedcertification programs on Data Management/Cloud etc. Experience with JIRA,Confluence Understanding of scrum andAgile methodologies Experience with datavisualization tools, such as Grafana, GGplot, etc. Soft skills Oral and written communicationskills Good problem solving andnegotiation skills Intellectual curiosity to findnew and unusual ways of how to solve data management issues. Passionate about the work andattention to detail

Posted 4 days ago

Apply

6.0 - 10.0 years

25 - 30 Lacs

Bengaluru

Work from Office

6 to 10 years of experience as Machine Learning Researcher or Data Scientist Graduate in Engineering,Technology along with good business skills Good applied statistics skills,such as distributions, statistical testing, regression, etc. Excellent understanding ofmachine learning techniques and algorithms including knowledge about LLM Experience with NLP. Good scripting and programmingskills in Python Basic understanding of NoSQLdatabases, such as MongoDB, Cassandra Nice to have Exposure to financial researchdomain Experience with JIRA,Confluence Understanding of scrum andAgile methodologies Experience with datavisualization tools, such as Grafana, GGplot, etc. Soft skills Oral and written communicationskills Good problem solving andnegotiation skills Passion, curiosity andattention to detail

Posted 4 days ago

Apply

2.0 - 7.0 years

3 - 8 Lacs

Hyderabad

Work from Office

Role & responsibilities • Application Support and Production Support skills • Proven experience in production support roles, focusing on batch processing and overnight support. • Strong proficiency in Linux and Windows operating systems. • Experience in SQL querying and database management. • Experience with batch scheduling tools (e.g., Cron, Control-M, Autosys) is preferred. • Experience with monitoring tools (e.g., Nagios, Grafana) is preferred. • Working knowledge in scripting languages such as Shell Scripting Perl or Python. • Strong Communication skills. Preferred candidate profile Monitor and manage overnight batch processes to ensure timely completion and accuracy. • Investigate and resolve batch failures, escalating issues as necessary to ensure prompt resolution. • Develop and implement scripts (Shell Scripting, Python) to automate monitoring tasks and data collection. • Perform routine system checks and maintenance tasks during non-business hours. • Proactively monitor applications and batch jobs using dedicated monitoring tools and dashboards. • Provide support for Linux and Windows environments, including troubleshooting and system administration tasks. • Collaborate with development and infrastructure teams to implement solutions and enhancements to batch processes. • Document and maintain standard operating procedures (SOPs) for batch support activities. • Participate in on-call rotation schedules and respond to production incidents as needed. • Work within a rotational night shift. • Please make sure to document all monitoring activities, including identified issues, resolutions, and root cause analysis findings.

Posted 4 days ago

Apply

4.0 - 9.0 years

7 - 17 Lacs

Hyderabad

Work from Office

Experience in infrastructure/platform engineering Strong hands-on skills in Windows, Linux, Bash scripting and Powershell Experience with CI/CD pipelines and deployment automation Proficiency in tools such as Ansible, Jenkins, Azure DevOps, Git Experience with log aggregation and monitoring (e.g., ELK, Grafana, Prometheus) Comfortable supporting enterprise-grade applications in a financial services environment Preferred Skills: Exposure to Cloud platforms like AWS (especially EC2, S3, IAM, CloudWatch) Familiarity with application support tools and release pipelines SQL knowledge and ability to work with DB teams for performance tuning Prior experience working with geographically distributed teams Role & responsibilities Preferred candidate profile •Maintain and enhance platform infrastructure across Linux and Windows environments •Develop scripts to automate system monitoring, deployment, and recovery processes •Troubleshoot and resolve environment-level issues impacting application performance •Build and manage CI/CD pipelines using tools like Jenkins, Azure DevOps, or GitHub Actions •Collaborate with development, support, and cloud teams to ensure high platform availability •Support and automate tasks like patching, environment readiness, and DR test setups •Work with DBAs, application teams, and product vendors to resolve infra-related performance bottlenecks •Document processes, create knowledge articles, and ensure knowledge continuity

Posted 4 days ago

Apply

5.0 - 7.0 years

13 - 17 Lacs

Bengaluru

Work from Office

As part of the Data and Technology Services practice, you will be responsible for configuring and customizing enterprise Master Data Management (MDM) platforms such as Semarchy and EnterWorks for major global financial services clients developing integration components using Java, and supporting data quality and governance initiatives through SQL- based validation and transformation. Global technology megatrends, regulatory developments, competitive landscape and rise of alternative data are changing the way capital markets operate. At the Data and Technology services operations, we partner with some of the largest financial services firms and corporations in this transformative journey. We are looking for professionals who are hands on, passionate about the work and have the ambition to drive disruptive changes to global business models. Desired skills and experience B.Tech \ MCA in Computer Science from reputed College\University. Hands-on experience configuring and customizing Semarchy xDM and/or EnterWorks platforms. Understanding of MDM concepts such as data modeling, match/merge logic, data stewardship, and golden record creation. Proficiency in Java (mid-level) for developing custom extensions, APIs, and integration components. Working knowledge of SQL for querying, validating, and transforming data. Knowledge of EKS/ECS/Kubernetes and DevSecops with Observability Key Responsibilities Configure and customize Semarchy xDM and/or EnterWorks platforms to support master data domains such as Customer, Product, Loans and Borrowers. Develop and maintain data integration workflows, validation rules, match/merge logic, and user interfaces within the MDM platform. Collaborate with data architects and business analysts to translate requirements into technical configurations. Translate client challenges into actionable solutions by partnering with the subject matter experts. Write and maintain Java-based extensions, APIs, and services to support MDM workflows and integrations. Work with integration tools and APIs to connect MDM platforms with source and target systems. Assist in building and maintaining data pipelines to ingest, transform, and publish master data across systems. Use SQL to query, validate, and troubleshoot data issues across source and target systems. Assist in writing stored procedures or scripts for data transformation and reporting. Exposure to data governance and data quality initiatives. Support deployment, testing, and documentation of MDM solutions in collaboration with DevOps and QA teams. Leverage container orchestration platforms such as Amazon EKS, ECS, or Kubernetes, and apply DevSecOps practices with observability tools (e.g., Prometheus, Grafana, CloudWatch) to ensure secure, scalable, and well-monitored deployments. Knowledge of container orchestration platforms such as Amazon EKS, ECS, or Kubernetes, along with familiarity in DevSecOps practices and observability tools (e.g., Prometheus, Grafana, CloudWatch) is highly desirable. Expertise in understanding the technical and functional designs of the databases Ensure meticulous documentation of processes, meta data, query logic, and results Required Qualifications 4+ years of experience in software development or data engineering roles. Hands-on experience with Semarchy xDM and/or EnterWorks MDM platforms. Proficiency in Java (mid-level) for backend development and integration tasks. Working knowledge of SQL for data analysis, validation, and transformation. Understanding of MDM concepts, including data stewardship, golden record creation, and hierarchy management. Familiarity with REST APIs, JSON, XML, and integration patterns. Behavioral Competencies Efficiently lead client calls on a daily basis Be proactive and independent, able to work on projects with minimum inputs from senior stakeholders Evaluate and ensure quality of deliverables within defined timelines Prior experience in the financial services domain is highly desirable. Excellent verbal and written communication skills with a collaborative mindset.

Posted 4 days ago

Apply

6.0 - 11.0 years

9 - 13 Lacs

Bengaluru

Work from Office

We are looking for a Senior Cloud Operations DBA to manage, optimize, and ensure the reliability of cloud-based databases in a 24/7 production environment. The ideal candidate will have strong experience in AWS RDS, PostgreSQL, MySQL, and NoSQL databases, with a focus on performance tuning, high availability, backup strategies, and disaster recovery. Key Responsibilities Manage, monitor, and maintain cloud-based databases for high availability, security, and performance. Analyze and optimize database queries, indexing, and configuration for better efficiency. Implement and maintain robust backup and disaster recovery strategies. Automate repetitive database operations using scripting languages (Python, Shell, SQL). Ensure database compliance with ISO 27001, SOC 2, and other security/audit requirements. Troubleshoot and resolve issues, collaborating with CloudOps, DevOps, and Engineering teams. Plan and execute database upgrades, schema migrations, and replication strategies. Set up and manage proactive monitoring using Grafana, Prometheus, CloudWatch, and Splunk. Requirements 6+ years of experience in database administration, with a strong cloud-based background. Hands-on experience with AWS RDS (PostgreSQL, MySQL, DynamoDB). Proficient in SQL performance tuning, indexing, and debugging. Experience with Infrastructure as Code (IaC) tools like Terraform for database management. Strong scripting skills in Python, Bash, or PowerShell. Willingness to work APAC and EMEA shifts with on-call rotation. Expertise in high availability, clustering, and replication technologies. Understanding of cloud networking, IAM roles, and security best practices. Excellent troubleshooting and problem-solving abilities in cloud environments. Preferred Skills Experience with NoSQL databases (MongoDB, DynamoDB, Cassandra). Exposure to Kubernetes, containerization, and serverless architecture. Experience integrating databases with CI/CD pipelines and DevOps workflows. Knowledge of observability tools for database monitoring and proactive alerting. Cloud cost optimization and advanced performance tuning skills. Familiarity with incident response and resolution best practices.

Posted 4 days ago

Apply

12.0 - 15.0 years

7 - 11 Lacs

Bengaluru

Work from Office

This role combines leadership in managing cloud infrastructure with customer-focused incident response in a SaaS environment. The ideal candidate has a strong background in AWS cloud platforms, containerized workloads, and leading customer support teams. Youll also act as the primary escalation point for infrastructure and application performance issues. Cloud Operations Ensure 99.9%+ uptime for AWS-hosted SaaS platforms. Manage and maintain cloud infrastructure, including incident response and disaster recovery planning. Collaborate with DevOps, Engineering, IT, and Security teams to deploy, monitor, and optimize services. Proactively resolve issues related to infrastructure and application scalability and reliability. Establish strong operational practices: incident management, root cause analysis, and preventive action planning. Technical Support Lead a support operations team focused on infrastructure and application-related technical issues. Act as the point of escalation for complex, high-priority customer incidents. Ensure SLAs and KPIs are met or exceeded. Continuously improve support processes: ticket handling, escalation paths, and customer responsiveness. Work closely with Customer Success and Professional Services for a unified customer experience. Leadership and Strategy Manage, mentor, and grow a team of support engineers and cloud operations specialists. Continuously assess and improve tooling, operational processes, and technologies. Provide regular operations updates to senior leadership, highlighting KPIs and key trends. Translate business and customer needs into operational improvements. Qualifications Required Bachelors degree in Computer Science, IT, or related field or equivalent experience. 12+ years of relevant experience, including 3+ years in a managerial role. Expertise in AWS and SaaS architecture. Hands-on experience with monitoring tools (Datadog, Prometheus, Grafana, etc.) and incident management systems (ServiceNow, Zendesk, PagerDuty, Opsgenie). Proficient in SQL and experience with databases. Strong understanding of DevOps, CI/CD, and infrastructure-as-code (Terraform, Ansible). Proven track record of achieving high uptime, SLA adherence, and customer satisfaction. Experience managing 24x7 cloud operations in remote or hybrid environments. Strong problem-solving skills and ability to thrive in high-pressure situations. Excellent communication skills across technical and non-technical stakeholders. Willingness to work in APAC and EMEA time zones. Preferred Certifications AWS Professional Certifications Linux System Administration Certifications ITIL Certifications Kubernetes Administrator Certifications What We Offer Comprehensive health and wellness plans Paid time off and company holidays Shift allowances Flexible and remote-friendly work options

Posted 4 days ago

Apply

1.0 - 6.0 years

8 - 13 Lacs

Pune

Work from Office

Cloud Observability Administrator JOB_DESCRIPTION.SHARE.HTML CAROUSEL_PARAGRAPH JOB_DESCRIPTION.SHARE.HTML Pune, India India Enterprise IT - 22685 about our diversity, equity, and inclusion efforts and the networks ZS supports to assist our ZSers in cultivating community spaces, obtaining the resources they need to thrive, and sharing the messages they are passionate about. Cloud Observability Administrator ZS is looking for a Cloud Observability Administrator to join our team in Pune. As a Cloud Observability Administrator, you will be working on configuration of various Observability tools and create solutions to address business problems across multiple client engagements. You will leverage information from requirements-gathering phase and utilize past experience to design a flexible and scalable solution; Collaborate with other team members (involved in the requirements gathering, testing, roll-out and operations phases) to ensure seamless transitions. What Youll Do: Deploying, managing, and operating scalable, highly available, and fault tolerant Splunk architecture. Onboarding various kinds of log sources like Windows/Linux/Firewalls/Network into Splunk. Developing alerts, dashboards and reports in Splunk. Writing complex SPL queries. Managing and administering a distributed Splunk architecture. Very good knowledge on configuration files used in Splunk for data ingestion and field extraction. Perform regular upgrades of Splunk and relevant Apps/add-ons. Possess a comprehensive understanding of AWS infrastructure, including EC2, EKS, VPC, CloudTrail, Lambda etc. Automation of manual tasks using Shell/PowerShell scripting. Knowledge of Python scripting is a plus. Good knowledge of Linux commands to manage administration of servers. What Youll Bring: 1+ years of experience in Splunk Development & Administration, Bachelor's Degree in CS, EE, or related discipline Strong analytic, problem solving, and programming ability 1-1.5 years of relevant consulting-industry experience working on medium-large scale technology solution delivery engagements; Strong verbal, written and team presentation communication skills Strong verbal and written communication skills with ability to articulate results and issues to internal and client teams Proven ability to work creatively and analytically in a problem-solving environment Ability to work within a virtual global team environment and contribute to the overall timely delivery of multiple projects Knowledge on Observability tools such as Cribl, Datadog, Pagerduty is a plus. Knowledge on AWS Prometheus and Grafana is a plus. Knowledge on APM concepts is a plus. Knowledge on Linux/Python scripting is a plus. Splunk Certification is a plus. Perks & Benefits ZS offers a comprehensive total rewards package including health and well-being, financial planning, annual leave, personal growth and professional development. Our robust skills development programs, multiple career progression options and internal mobility paths and collaborative culture empowers you to thrive as an individual and global team member. We are committed to giving our employees a flexible and connected way of working. A flexible and connected ZS allows us to combine work from home and on-site presence at clients/ZS offices for the majority of our week. The magic of ZS culture and innovation thrives in both planned and spontaneous face-to-face connections. Travel Travel is a requirement at ZS for client facing ZSers; business needs of your project and client are the priority. While some projects may be local, all client-facing ZSers should be prepared to travel as needed. Travel provides opportunities to strengthen client relationships, gain diverse experiences, and enhance professional growth by working in different environments and cultures. Considering applying? At ZS, we're building a diverse and inclusive company where people bring their passions to inspire life-changing impact and deliver better outcomes for all. We are most interested in finding the best candidate for the job and recognize the value that candidates with all backgrounds, including non-traditional ones, bring. If you are interested in joining us, we encourage you to apply even if you don't meet 100% of the requirements listed above. ZS is an equal opportunity employer and is committed to providing equal employment and advancement opportunities without regard to any class protected by applicable law. To Complete Your Application Candidates must possess or be able to obtain work authorization for their intended country of employment.An on-line application, including a full set of transcripts (official or unofficial), is required to be considered. NO AGENCY CALLS, PLEASE. Find Out More At

Posted 4 days ago

Apply

3.0 - 6.0 years

2 - 6 Lacs

Mumbai

Work from Office

Deploy, configure, and manage AWS cloud infrastructure (EC2, VPC, S3, RDS, IAM, CloudWatch, ELB, etc.) Set up and maintain CI/CD pipelines using Jenkins, GitLab CI, or GitHub Actions Implement Infrastructure as Code (IaC) using Terraform or AWS

Posted 4 days ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies