Jobs
Interviews

2943 Datadog Jobs - Page 24

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Job Title : Senior DevOps Engineer (GCP | DevSecOps | Monitoring) Employment Type : Full-time Experience : 7+ Years Job Summary We are seeking a highly experienced and results-driven Senior DevOps Engineer to join our dynamic team. The ideal candidate will bring 7+ years of hands-on experience in cloud infrastructure, monitoring, security, and DevSecOps practices especially within the Google Cloud Platform (GCP) ecosystem. This role demands strong expertise in designing, implementing, and leading complex DevSecOps and monitoring initiatives across cloud-native environments. Key Responsibilities Lead the end-to-end design, implementation, and delivery of scalable and secure DevSecOps solutions. Implement and maintain monitoring and observability tools such as New Relic, Datadog, Grafana, and Prometheus. Manage and optimize GCP infrastructure for performance, security, and cost efficiency. Define and enforce DevSecOps best practices, integrating security at every stage of the development lifecycle. Work closely with Data Engineering teams to support data pipelines and infrastructure automation. Manage CI/CD pipelines using GitLab and ensure smooth deployment workflows. Maintain containerized environments using Docker and Kubernetes. Collaborate with cross-functional teams to ensure system reliability, scalability, and security. Required Skills & Experience 7+ years of experience in a DevOps/DevSecOps role with a strong background in GCP. Proven experience with monitoring/observability tools: New Relic, Datadog, Grafana, Prometheus. Deep understanding of DevSecOps principles, cloud security, and compliance practices. Strong hands-on experience with Docker and Kubernetes. Proficiency with GitLab for CI/CD automation. Familiarity with infrastructure-as-code and configuration management tools. Solid scripting and automation skills (e.g., Bash, Python, Terraform, etc.). Experience collaborating with Data Engineers and supporting data-driven applications. Preferred Qualifications GCP certifications (e.g., Professional Cloud DevOps Engineer, Cloud Architect). Experience with other cloud platforms (e.g., AWS, Azure) is a plus. Exposure to data pipeline tools and big data platforms is advantageous. (ref:hirist.tech)

Posted 2 weeks ago

Apply

0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

As a DevOps Engineer, youll play a key role in building and maintaining a robust, scalable, and reliable 0-downtime platform. Youll work hands-on with a recently kick-started greenfield initiative with modern infrastructure and automation tools to support our engineering teams.. This is a great opportunity to work with a forward-thinking team, and the freedom to approach problems with fresh thinking, embedding AI and automation and helping shape our cloud-native journey.. If youre passionate about automation, cloud infrastructure, and delivering high-quality production-grade platforms, this role offers the chance to make a real impact.. Key Responsibilities Hands-On Development : Design, implement, and optimise AWS infrastructure through hands-on development using Infrastructure as Code tools.. Automation & CI/CD Develop and maintain CI/CD pipelines to automate fast, secure and seamless deployments.. Platform Reliability Ensure high availability, scalability, and resilience of our platform, leveraging managed services. Monitoring & Observability Implement and manage proactive observability using DataDog and other tools to monitor system health, performance, and security, making sure we can see and fix issues before they impact users. Cloud Security & Best Practices Apply cloud and security best practices, including patching and secure configuration of networking, encryption (at rest and in transit), secrets and identity/access management. Continuous Improvement Contribute ideas and solutions to improve our DevOps processes. AI & Future Tech We want to push the boundaries of AI-driven development if you have ideas on how to embed AI into our DevOps processes, youll have the space to explore them.. Your Experience Tech stack : We use Terraform, Terragrunt, Helm, Python, Bash, AWS (EKS, Lambda, EC2, RDS/Aurora), Linux OS & Github Actions. Youre comfortable with all of these, and have strong hands-on experience with Terraform and IaC principles, CI/CD and the AWS ecosystem.. Proven experience with Networking (VPC, Subnets, Security Groups, API Gateway, Load Balancing, WAF) and Cloud configuration (Secrets Manager, IAM, KMS). Comfortable with Kubernetes, ArgoCD, Isitio & Deployment strategies (blue/green & canary). Familiarity with Cloud Security services such as Security Hub, Guard Duty, Inspector and vulnerability management/patching. Observability Mindset You believe in measuring everything. Youve worked with DataDog (or similar) to ensure teams have visibility into platform health and security.. Experience with embedding AI into DevOps processes is advantageous (ref:hirist.tech)

Posted 2 weeks ago

Apply

12.0 - 22.0 years

45 - 65 Lacs

Hyderabad

Work from Office

Role & responsibilities As a Senior Manager, you will work with and manage the engineering team in the Hyderabad Development Centre to deliver the goals and objectives of the business. As a leader, you must be capable of working in a matrixed organization and coordinating the delivery of multiple outcomes. You will be hands-on in terms of design, architecture, and development and should be able to lead the team from front in any critical situation. As a people leader first and a delivery manager second, you must build, inspire, and lead the technical teams. In this role, you are expected to work with stakeholders and internal customers across the different GAP tech locations. You will be managing the Engineering Platform Observe team that set modern architecture principles to promote innovation, flexibility, and reuse. Our team support the engineering teams in building automation to help enable developer success across all our brands and markets. You'll play a key role in building, maintaining, and supporting GAPs next-generation Observability platform enabling innovation, solutioning and exceptional developer experience. We have a sharp technical team, and you will be working with many high-performing software development professionals in a friendly, open-minded, and diverse environment. What Youll Do: Lead DevOps best practices and mentor a team of Observability engineers working towards optimizing our monitoring solutions. Develop the roadmap and strategy of seamlessly onboarding the Product teams on our Observability solutions. Architecture and enhance implementation of Observability platforms across the organization. Present possible updates, recommendations, strategic opportunities to local & US leadership. Develop relationships with local business leaders. Strong desire to simplify the developers debug experience by adopting and on boarding the right tools across the enterprise. Develop an understanding of GAP's Observability Pipelines to automate and enhance user experience. Participate in the design of new or changing monitoring needs. Build, operationalize, and maintain Observability solutions for our technology customers Participate in problem solving and troubleshooting for the assigned applications, functional areas or projects Stay current with changes in the technical area of expertise Build, maintain, and support enterprise production systems with a business mindset, keeping an eye towards simplicity, reliability, maintainability, scalability, extensibility and performance Drive resolution of operational and production issues in a timely manner. Support internal customers in adopting our Next Generation Observability pipelines. Work with the team to develop features and improvements. Identifies opportunities to eliminate or automate remediation through RCA for recurring issues to improve overall operational stability of software applications and systems Preferred candidate profile Minimum 5 years experience in Engineering Leadership position, overall 12+ years of work experience. Hands on experience and managing operations of large-scale internet-centric production environments for application or infrastructure services serving tens to millions of end users. Excellent decision-making, problem-solving and time management skills. Demonstrated ability to innovate and operate outside the comfort zone of established methods and procedures Demonstrated ability to gain immediate credibility at all levels both inside and outside the organization and develop lasting, productive and collaborative relationships Excellent communication and influencing skills including the ability to simplify key messages, present compelling stories and promote technical and personal credibility with internal and external executives, and both technical and non-technical audiences Willingly shares relevant technical and/or industry knowledge and expertise in order to mentor team members. Strong hands on experience with latest Observability trends. Asses new Observability technologies and their potential fit within our current ecosystem. Support the team's technical growth through code reviews, architecture discussions, and knowledge sharing. Drive the development of tools to streamline developer workflows, in collaboration with other teams. Efficiently collaborate with other cross-functional teams in driving initiatives. Participate in an on-call rotation as needed by the business. Retail/Ecommerce industry experience preferred Strong considerable hands-on experience with monitoring tools like Grafana, Prometheus, OpenTelemetry (OTEL), NewRelic, Nagios & Splunk or similar tools. Proficiency with Infrastructure as Code patterns & tools (e.g. ARM, Terraform, GitOps) Proficiency with Multi cloud platforms Observability solutions like Azure Monitor, Google Cloud Observability or AWS Cloudwatch, Working on at least one Kubernetes cloud offering (AKS/GKE) or on-prem Kubernetes (native Kubernetes) Experience with Unix platforms, system administration skills in UNIX Appreciation and preference for open-source solutions like OTEL or eBPF. Ability to maintain and manage observability tools to look at logs, metrics & traces to diagnose issues within that system. Experience in scaling infrastructure to support high-throughput data-intensive applications Experience working on projects following Agile methodologies You're proficient in at least one programming language (e.g., Python, Java, Go) and comfortable working across different types of languages as needed. Working knowledge of Collaboration tools like Slack, JIRA & Confluence & Service Management tools like ServiceNow & PagerDuty About Us: Hyderabad Development Center (HDC): Launched in March 2017 with a small pilot team, Gap Inc.’s Hyderabad Development Center has grown into the India’s largest fashion retail technology hub with 800+ employees today. HDC plays a pivotal role in driving innovation across digital technology, engineering, employee enablement, cybersecurity, data science, product management and customer experience. Home to 40% of Gap Inc.’s global tech workforce, this young and diverse team is powering cutting-edge e-commerce and enterprise solutions for our people and iconic brands. Our growth is powered by a strong focus on nurturing talent and shaping the next generation of innovators in fashion retail technology. About Gap Inc.: Gap Inc., a house of iconic brands, is the largest specialty apparel company in America. Its Old Navy, Gap, Banana Republic, and Athleta brands offer clothing, accessories, and lifestyle products for men, women and children. Since 1969, Gap Inc. has created products and experiences that shape culture, while doing right by employees, communities and the planet. Gap Inc. products are available worldwide through company-operated stores, franchise stores, and e-commerce sites. Fiscal year 2024 net sales were $15.1 billion. For more information, please visit www.gapinc.com.

Posted 2 weeks ago

Apply

0 years

0 Lacs

India

On-site

Job Summary: We are seeking a highly skilled and proactive DevOps Engineer to join our technology team. The DevOps Engineer will be responsible for building, managing, and optimizing CI/CD pipelines, cloud infrastructure, and deployment processes to support high-availability, secure, and scalable applications. The ideal candidate will collaborate closely with development, QA, and IT teams to drive automation, improve system performance, and ensure the reliability of our platforms. Key Responsibilities: Design, implement, and manage CI/CD pipelines to enable continuous integration and continuous delivery. Manage and optimize cloud infrastructure (AWS, Azure, GCP) for scalability, security, and performance. Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Ansible. Configure and manage containerization platforms (Docker, Kubernetes, ECS, etc.). Monitor system performance and reliability using tools like Prometheus, Grafana, Datadog, or ELK Stack. Ensure security best practices across infrastructure and deployment pipelines. Troubleshoot and resolve infrastructure and application issues in production and staging environments. Collaborate with cross-functional teams to improve software delivery processes. Document infrastructure, processes, and best practices. Requirements: Bachelor’s degree in Computer Science, Engineering, or a related field. Strong hands-on experience with cloud platforms (AWS, Azure, or GCP). Proficiency with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, Azure DevOps). Experience with containerization technologies (Docker, Kubernetes, Helm). Proficiency in scripting languages (Bash, Python, PowerShell). Strong knowledge of Linux/Unix system administration. Familiarity with monitoring tools and incident response strategies. Understanding of networking fundamentals and security best practices. Preferred Qualifications: Certifications (AWS Certified DevOps Engineer, Certified Kubernetes Administrator, Azure DevOps Engineer Expert). Experience with serverless computing and microservices. Familiarity with agile methodologies and DevSecOps practices.

Posted 2 weeks ago

Apply

5.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

Hey There 👋 At Saleshandy, we're building the Cold Email Outreach platform of the future. We're building a product toward eliminating manual processes and helping companies generate more replies/book more meetings / generate leads (faster). Since our founding in 2016, we've grown to become a profitable, 100% geographically dispersed team of 65+ high-performing happy people who are dedicated to building a product that our customers love. What’s the Role About? Ever wondered how Saleshandy schedules millions of emails and still feels lightning-fast? Behind that magic is performance engineering. We’re hiring a Performance Engineer who thrives on making systems faster, leaner, and more reliable across backend, frontend, and infrastructure. Your mission: eliminate latency, fix CPU/memory bottlenecks, optimize queries, tame queues, and guide teams to build with performance in mind. This isn’t just about fire-fighting, it’s about owning speed as a product feature. You’ll work across the stack and use deep diagnostics, smart tooling, and system intuition to make things fly. Why Join Us? Purpose: Your work will directly impact page speeds, email throughput, scale. At Saleshandy, performance isn’t a luxury, it’s part of our premium promise. Growth: You’ll operate across multiple teams and tech layers, Node.js, MySQL, Redis, React, Kafka, ClickHouse, AWS, with the freedom to shape how we build fast systems. Motivation: If you’ve ever celebrated shaving 500ms off a page load, or chased a memory leak across 3 services just for fun, this is your home. We celebrate engineers who care about P99s, flamegraphs, and cache hits. Your Main Goals Identify and Eliminate Backend Bottlenecks (within 90 days) Run deep diagnostics using Clinic.js, heap snapshots, GC logs, and flamegraphs. Tackle high CPU/memory usage, event loop stalls, and async call inefficiencies in Node.js. Goal: Cut backend P95 response times by 30–40% for key APIs. Optimize MySQL Query Performance & Configuration (within 60 days) Use slow query logs, EXPLAIN, Percona Toolkit, and indexing strategies to tune queries and schema. Tune server-level configs like innodb_buffer_pool_size. Target: Eliminate top 10 slow queries and reduce DB CPU usage by 25%. Improve Frontend Performance & Load Time (within 90 days) Audit key frontend flows using Lighthouse, Core Web Vitals, asset audits. Drive improvements via lazy loading, tree-shaking, and code splitting. Goal: Get homepage and dashboard load times under 1.5s for 95% users. Make Infra & Monitoring Observability-First (within 120 days) Set up meaningful alerts and dashboards using Grafana, Loki, Tempo, Prometheus. Lead infra-level debugging — thread stalls, IO throttling, network latency. Goal: Reduce time-to-detect and time-to-resolve for perf issues by 50%. Important Tasks First 30 Days – System Performance Audit Do a full audit of backend, DB, infra, and frontend performance. Identify critical pain points and quick wins. Debug a Live Performance Incident Catch and resolve a real-world performance regression. Could be Node.js memory leak, a slow MySQL join, or Redis job congestion. Share a full RCA and fix. Create and Share Performance Playbooks (by Day 45) Build SOPs for slow query debugging, frontend perf checks, Redis TTL fixes, or Node.js memory leaks. Turn performance tuning into team sport. Guide Teams on Performance-Aware Development (within 90 days) Create internal micro-trainings or async reviews to help devs write faster APIs, reduce DB load, and spot regressions earlier. Use AI or Smart Tooling in Diagnostics Try out tools like Copilot for test coverage, or use AI-powered observability tools (e.g. Datadog AI, Loki queries, etc.) to accelerate diagnostics. Build Flamegraph/Profiling Baselines Set up and maintain performance profiling baselines (using Clinic.js, 0x, etc.) so regressions can be caught before they ship. Review Queues and Caching Layer Identify performance issues in Redis queues — retries, TTL delays, locking — and tune caching strategies across app and DB. Contribute to Performance Culture Encourage tracking of real metrics: TTI, DB query time, API P95s. Collaborate with product and engineering to define what “fast enough” means. Experience Level: 3–5 years Tech Stack: Node.js, MySQL, Redis, Grafana, Prometheus, Clinic.js, Percona Toolkit Culture Fit – Are You One of Us? We're a fast-moving, globally distributed SaaS team where speed matters not just in product, but in how we work. We believe in ownership, system thinking, and real accountability. If you like solving hard problems, value simplicity, and hate regressions, you’ll thrive here.

Posted 2 weeks ago

Apply

3.0 - 5.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

Position Description Company Profile: Founded in 1976, CGI is among the largest independent IT and business consulting services firms in the world. With 94,000 consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from strategic IT and business consulting to systems integration, managed IT and business process services and intellectual property solutions. CGI works with clients through a local relationship model complemented by a global delivery network that helps clients digitally transform their organizations and accelerate results. CGI Fiscal 2024 reported revenue is CA$14.68 billion and CGI shares are listed on the TSX (GIB.A) and the NYSE (GIB). Learn more at cgi.com. Job Title: Performance Tester Position: Systems Engineer Experience: 3-5 Years Category: Software Development/ Engineering Shift: 24/7 Main location: Chennai Position ID: J0725-1216 Employment Type: Full Time Education Qualification: Bachelor's degree in Computer Science or related field or higher with minimum 3 years of relevant experience. Position Description: We are looking for an Performance Tester experienced to join our team. The ideal candidate should be passionate about coding and developing scalable and high-performance applications. You will work closely with our front-end developers, designers, and other members of the team to deliver quality solutions that meet the needs of our clients. Work with Product team on performance test strategy and test plan, interfacing with all level of the application protocol stack Ability to recommend scope for performance tests and to interpret results Create automated and manual test script based on uses cases and test scenarios Build and executes test to provide high value, high accuracy results Monitor and analyze performance metrics and application logs Triage defects with development partners and project management teams Work with application architects to identify performance bottlenecks and make tuning recommendations System Administration in Linux and AWS environments. Develop monitoring and management tools and processes. Communicate effectively in team environment. Participate in an on-call rotation and provide 24x7 support. The Skills that are Key to this role Performance Testing: LoadRunner, JMeter, Selenium, Grafana, Datadog, AWS Experience/Knowledge of cloud computing environments and applications (AWS/Azure) is highly desired. Application performance testing experience required. Languages: Java, SQL, scripting (bash/ksh), python preferred. Linux systems administration experience required. (Linux commands) Application server: Tomcat, Apache, IBM WebSphere. Experience in Continuous Integration tools is highly desired Solid experience in Agile methodologies. Excellent communication skills required. Strong problem resolution skills required. Ability to work in a team-oriented environment required. Any workflow tool experience is a big plus. Owns the outcome by taking personal accountability for delivering strong results Self-directed, willing to take initiative, pragmatic and results-oriented Able to work with remote and international team members The Expertise looking for 4-5 years of Performance engineering experience against large-scale end-to-end systems Experience in Systems engineering on Unix, web and application servers Experience in Performance testing of Cloud/AWS hosted applications A Bachelor’s or Master’s degree in Computer Science, Software engineering or related field. Life at CGI: It is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because… You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons Come join our team, one of the largest IT and business consulting services firms in the world Your future duties and responsibilities Required Qualifications To Be Successful In This Role Together, as owners, let’s turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because… You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction. Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise. You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons. Come join our team—one of the largest IT and business consulting services firms in the world.

Posted 2 weeks ago

Apply

2.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. Primary Responsibilities Defining and setting up best industry alert and monitoring practices across line of business and design/architect efficient monitoring dashboards on Splunk/DTSaas/DataDog/Grafana common for all applications/products across line of business Participating in program and other peak season readiness initiatives and collaboration with application teams evaluating applications from resiliency, availability, and reliability perspective Act as a gatekeeper for changes rolling into production Embrace continuous learning of engineering practices to ensure industry best practices and technology adoption, including DevOps, Cloud and Agile thinking Tech debt reduction/Tech transformation including opensource/inner source adoption, Cloud adoption, HCP assessment and adoption Improve processes/runbooks and lead automation efforts of any manual items around support cutting down manual toil Improve operational tooling, frameworks, perform chaos engineering activities Respond to platform emergencies, alerts, and escalations Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so Required Qualifications B. Tech and/or MS in computer science or equivalent 2+ years of experience as a DevOps Engineer, with a solid focus on cloud-based infrastructure and container orchestration platforms 2+ years of experience in managing cloud-based infrastructure and container orchestration platforms, specifically AWS or GCP, including services like EC2, S3, RDS, EKS/GKE 2+ years of experience with Kubernetes for container orchestration and deployment 2+ years of experience developing and maintaining robust CI/CD pipelines using Jenkins and GitHub Actions Hands-on experience with automation tools like Ansible and Terraform to automate tasks related to provisioning, configuration, and maintenance In-depth knowledge of GitHub features, including Actions, workflows, repositories, branches, pull requests, and permissions management Knowledge or architectural understanding of applications developed with Java/J2EE technologies, including Spring Boot and Microservices architecture Knowledge on DevOps tools and practices including Docker, Redis, SonarQube, and Fortify Knowledge on Development Methodology or Engineering Practices - Agile (SCRUM / KANBAN / SAFe) Solid understanding of software development princIples, version control systems (particularly Git and GitHub), continuous integration/continuous deployment (CI/CD) pipelines, and infrastructure as code (IaC) concepts Understanding of front-end web technologies and frameworks such as Angular and React, and their integration within deployment pipelines Understanding of testing methodologies and their integration within automated CI/CD processes Familiarity with Java build tools, specifically Maven At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.

Posted 2 weeks ago

Apply

3.0 - 8.0 years

12 - 16 Lacs

Hyderabad

Work from Office

Job Area: Information Technology Group, Information Technology Group > Systems Analysis General Summary: The QA COE is looking for an individual contributor with good experience in Automation using Selenium and Performance testing using JmeterThe Individual needs to collobrate closely with SA/Dev/QA to execute STLC and create all test artifacts and reportsShould have stong automation skills and zeal to Develop, Maintain, and Execute automation test suite using opensource/inhouse frameworks.Should have expertize in Selenium scriptingShould have expertize in Performance testing using Jmeter and working experience in windows and linux environments.Should have ability and flexiblity to work on multiple projects.Having experience in Python and Devops tools like KubernetiesHaving knowledge on Jenkins, Python and exposure to DevOps tools (Kubernetes, docker), Monitoring tools (Splunk, Datadog), The candidate should have good communication, analytical, and problem-solving skills.Should ensure all project deliverables are met as per requirment and process. Minimum Qualifications: 3+ years of IT-relevant work experience with a Bachelor's degree. OR 5+ years of IT-relevant work experience without a Bachelors degree. Skills/ExperienceCandidates with 6-10 years of experience and having hands on testing API, web, backend applications in linux and window environments.- 3-5 years of automation experience with good hands-on in Jmeter and Selenium- Should have Good knowledge in using Maven, Testng, jenkins, Splunk and GIT-- Working experience in Python is added advantage - Expousure to Devops, cloud, monitoring tools like technologies like Kubernetes, docker is added advantage Candidates with 4-8 years of experience in automation testing in Linux and Windows environment, Should have experience and expertize on Selenium (Web Driver, grid) ,Test Ng/Junit, Maven, Jenkins.Having knowledge on Python and exposure to DevOps tools would be a plus. Bachelors / Masters degree in any stream B.E/B.Tech/M.Tech/MCA, Major in Information technology, Computer Science, or Equivalent. Applicants Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers.

Posted 2 weeks ago

Apply

6.0 - 10.0 years

14 - 19 Lacs

Noida

Work from Office

With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And were only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieveRead on. Here, we know that youre more than your work. Thats why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose "” a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If youre passionate about our purpose "” people "”then we cant wait to support whatever gives you purpose. Were united by purpose, inspired by you. Key Responsibilities: Monitor and support Kronos Private Cloud and hosted environments remotely. Perform remote monitoring of Microsoft Windows (2003/2008/2012/2016) and Linux servers for:o System performance and uptimeo SQL database healtho Application service and web application statuso Server resource utilization Respond to alerts from monitoring tools and take corrective actions. Troubleshoot and identify root causes of server and application performance issues. Handle Level 1 escalations and follow the defined escalation matrix. Administer and maintain Windows and Linux operating systems. Support web applications and hosting services including IIS, JBoss, and Apache Tomcat. Understand and troubleshoot server-client architecture issues. Collaborate with internal teams to ensure high availability and performance of hosted services. Document incidents, resolutions, and standard operating procedures. Participate in 24/7 rotational shifts, including nights and weekends.Preferred Requirements and Skills: Experience with UKG Workforce Central (WFC) application. Familiarity with ServiceNow for incident, problem, and change management. Strong understanding of cloud infrastructure, virtualisation (VMware), and hybrid environments. Knowledge of web server configurations, deployments, and troubleshooting. Excellent communication, analytical, and problem-solving skills. Familiarity with monitoring tools (DataDog, Grafana, Splunk) and alert management. Willingness to work in rotational shifts, including nights and weekend Where were going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet its our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! UKGCareers@ukg.com

Posted 2 weeks ago

Apply

0 years

0 Lacs

Greater Hyderabad Area

On-site

Java AWS Developer Bangalore (Virtual) and Hyderabad (F2F) Job Description: • Experience in Java, J2ee, Spring boot. • Experience in Design, Kubernetes, AWS (Lambda, EKS, EC2) is needed. • Experience in AWS cloud monitoring tools like Datadog, Cloud watch, Lambda is needed. • Experience with XACML Authorization policies. • Experience in NoSQL , SQL database such as Cassandra, Aurora, Oracle. • Experience with Web Services SOA experience (SOAP as well as Restful with JSON formats), with Messaging (Kafka). • Hands on with development and test automation tools/frameworks (e.g. BDD and Cucumber)

Posted 2 weeks ago

Apply

7.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Cvent is a leading meetings, events, and hospitality technology provider with more than 4,800 employees and ~22,000 customers worldwide, including 53% of the Fortune 500. Founded in 1999, Cvent delivers a comprehensive event marketing and management platform for marketers and event professionals and offers software solutions to hotels, special event venues and destinations to help them grow their group/MICE and corporate travel business. Our technology brings millions of people together at events around the world. In short, we’re transforming the meetings and events industry through innovative technology that powers the human connection. The DNA of Cvent is our people, and our culture has an emphasis on fostering intrapreneurship – a system that encourages Cventers to think and act like individual entrepreneurs and empowers them to take action, embrace risk, and make decisions as if they had founded the company themselves. At Cvent, we value the diverse perspectives that each individual brings. Whether working with a team of colleagues or with clients, we ensure that we foster a culture that celebrates differences and builds on shared connections. Position Description: As Product Lead, you will be responsible for executing the product roadmap for one or more of our key technology products with the help of cross-functional teams through the entire product lifecycle. You will be providing direction and clarification to the development teams throughout the project and create, prioritize, groom, and manage requirements. Responsibilities Develop an in-depth understanding of the business. products, including its goals and challenges, as well as a comprehensive knowledge of the customers and users of the platform. This involves analyzing their needs, behaviors, and pain points to ensure the product aligns with their expectations and delivers meaningful value. Collaborate regularly with product managers on the future roadmap, brainstorm new ideas, and prioritize features, ensuring the backlog remains healthy, well-organized, and aligned with business goals. Become a subject matter expert in your product domain, possessing in-depth knowledge and insights that make you the primary point of contact for internal teams seeking guidance and collaboration. Should possess a solid grasp of technology platforms, integrations, and system design to collaborate effectively with engineering partners and architects to deliver scalable, performant, and reliable infrastructure. Develop detailed product specifications that begin by clearly articulating the “why” - the core purpose and value behind the product, feature, or enhancements. These specifications should be thorough in addressing all necessary aspects yet written in a clear and concise manner to ensure they are easily understood by all stakeholders. Write comprehensive user stories with detailed acceptance criteria that address all functional and non-functional aspects of the product, including security, data compliance, and other essential requirements. Work closely with engineering, design (UX/UI), and other stakeholders to define product requirements and user stories. Lead end-to-end product development process, from concept and design to development, testing, launch and ensuring timely and high-quality delivery. Lead PI (Program Increment) planning and actively participate in key SCRUM ceremonies, including daily standups, backlog refinement, sprint planning, and product demos. Leverage the power of analytics and data-driven intelligence to assess product performance, user behavior analysis, gather customer feedback, and insights from multiple channels, and make data-informed decisions which enables you to drive excellence through iteration, improvement, and refinement. Monitor and analyze key performance indicators (KPIs) and metrics to gauge the effectiveness, adoption, and product outcome. Share and present regular updates on the product roadmap and progress to leaders and stakeholders. Clearly highlight any potential risks and outline a plan to address them, ensuring everyone is informed and aligned on the path forward. Contribute to a culture of creativity and innovation within the product management team, think outside the box, explore new ideas, and identify opportunities to make our products stand out in the market. What we are looking for: Minimum 7-9 years of career experience with at least 4+ years of product management (PO/PM) experience in technology products. Strong product sense with an ability to articulate problems and envision solutions. Strong understanding of REST APIs , event-driven systems , and data processing pipelines. Ability to understand and contribute to technical architecture discussions around microservices, scalability, observability, and fault tolerance. Experience working on communication systems or messaging platforms (email, SMS, push) is highly desirable. Familiarity with monitoring/logging tools like Datadog, Splunk, or ELK for understanding system health. Understanding of compliance and privacy regulations related to communication systems. Inquisitive mindset with excellent analytical, problem-solving, and decision-making skills. Ability to handle complex situations, conversations and navigate ambiguity. Exceptional interpersonal skills, with the ability to influence, partner and build relationships across all levels of the organization. Strong written and verbal communication skills with an ability to communicate directly and clearly.

Posted 2 weeks ago

Apply

5.0 years

0 Lacs

India

On-site

Minimum of 5 years of hands-on experience as a Database Administrator managing both Oracle and Microsoft SQL Server environments. Job Description: Database Administration: Install, configure, administer, and maintain Oracle and Microsoft SQL Server databases across development, testing, and production environments. Performance Tuning and Optimization: Proactively monitor database performance, identify bottlenecks, and implement effective tuning strategies to ensure optimal responsiveness and efficiency. Backup and Recovery: Develop, implement, and test comprehensive backup and recovery strategies to ensure data integrity and business continuity. Security Management: Implement and enforce database security policies, including user access control, auditing, and data encryption, in compliance with industry best practices and regulatory requirements. High Availability and Disaster Recovery: Design, implement, and maintain high availability (HA) and disaster recovery (DR) solutions for both Oracle and SQL Server environments in VMware, AWS, and Azure. Cloud Database Management: Provision, configure, and manage cloud-based database services (e.g., Amazon RDS, Azure SQL Database, Oracle Cloud Infrastructure) and understand their specific features and limitations. Database Migrations and Upgrades: Plan and execute database migrations, upgrades, and patching activities with minimal disruption to business operations. Capacity Planning: Monitor database growth trends and proactively plan for future capacity needs to ensure adequate resources are available. Troubleshooting and Problem Solving: Diagnose and resolve complex database-related issues in a timely and efficient manner. Automation and Scripting: Develop and maintain scripts (e.g., SQL, Shell, PowerShell, Python) to automate routine database tasks and improve operational efficiency. Collaboration and Communication: Work closely with development teams, system administrators, and other stakeholders to provide database expertise and support application deployments. Documentation: Create and maintain comprehensive documentation for database configurations, procedures, and best practices. Staying Current: Continuously learn and adopt new database technologies and best practices. Qualifications Bachelor's degree in Computer Science, Information Technology, or a related field. Minimum of 5 years of hands-on experience as a Database Administrator managing both Oracle and Microsoft SQL Server environments. Proven experience working with databases in VMware virtualized environments. Significant experience with cloud-based database services on AWS (e.g., RDS, Aurora) and/or Azure (e.g., Azure SQL Database, Managed Instances). Strong understanding of database principles, architecture, and best practices. Expertise in performance tuning and optimization techniques for both Oracle and SQL Server. Solid experience with backup and recovery strategies and tools. Proficiency in implementing and maintaining database security measures. Experience in designing and implementing high availability and disaster recovery solutions. Excellent scripting skills (e.g., SQL, Shell scripting, PowerShell, Python). Strong analytical and problem-solving skills with the ability to troubleshoot complex database issues. Excellent communication, collaboration, and interpersonal skills. Ability to work independently and as part of a team. Preferred Qualifications Relevant certifications (e.g., Oracle Certified Professional, Microsoft Certified: Database Administrator Associate/Expert, AWS Certified Database – Specialty, Microsoft Certified: Azure Database Administrator Associate). Experience with database monitoring tools (e.g., SolarWinds, Datadog, CloudWatch, Azure Monitor). Experience with data warehousing concepts and technologies. Familiarity with DevOps practices and CI/CD pipelines. Kaleris is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Posted 2 weeks ago

Apply

5.0 years

10 - 12 Lacs

Thiruvananthapuram Taluk, India

Remote

Devops Engineer Work mode : Remote Salary- 12 LPA Experience- 5 To 8 Years We are seeking a highly experienced and passionate DevOps Engineer with 5 years of hands-on experience to join our dynamic team. The ideal candidate will be instrumental in designing, implementing, and maintaining our scalable and highly available infrastructure, focusing on automation, CI/CD pipelines, and cloud-native solutions. You will play a crucial role in bridging the gap between development and operations, ensuring smooth and efficient software delivery from code commit to production. Responsibilities Infrastructure as Code (IaC): Design, implement, and manage cloud infrastructure using IaC tools (e.g., Terraform, CloudFormation, Pulumi) for various cloud providers (AWS, Azure, GCP). CI/CD Pipeline Management: Develop, maintain, and optimize robust CI/CD pipelines using tools like Jenkins, GitLab CI/CD, Azure DevOps, GitHub Actions, or CircleCI to automate software build, test, and deployment processes. Containerization & Orchestration: Expertise in containerization technologies (Docker) and orchestration platforms (Kubernetes, Amazon EKS, Azure AKS, Google GKE). Cloud Platform Expertise: Proficient in administering, optimizing, and troubleshooting services on at least one major public cloud platform (AWS, Azure, or GCP). Monitoring & Logging: Implement and manage comprehensive monitoring (e.g., Prometheus, Grafana, Datadog) and centralized logging (e.g., ELK Stack, Splunk, Loki) solutions to ensure system health and performance. Automation: Automate repetitive tasks and workflows across the software development lifecycle using scripting languages (e.g., Python, Bash, Go). Configuration Management: Utilize configuration management tools (e.g., Ansible, Chef, Puppet, SaltStack) for consistent environment provisioning and management. Security Best Practices: Implement and enforce security best practices within the infrastructure and CI/CD pipelines, including vulnerability scanning, access control, and compliance. Troubleshooting & Support: Provide expert-level support for infrastructure and application issues, proactively identify bottlenecks, and implement solutions. Collaboration: Work closely with development, QA, and operations teams to understand requirements, propose solutions, and foster a culture of shared responsibility and continuous improvement. Documentation: Create and maintain clear, concise documentation for infrastructure, processes, and tools. On-Call Rotation: Participate in an on-call rotation to ensure system availability and responsiveness (if applicable). Required Skills & Qualifications Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience. 5+ years of hands-on experience as a DevOps Engineer or a similar role. Strong proficiency in at least one major cloud platform (AWS, Azure, or GCP), with relevant certifications preferred. Extensive experience with Infrastructure as Code (IaC) tools (e.g., Terraform). In-depth knowledge and practical experience with CI/CD tools (e.g., Jenkins, GitLab CI/CD). Solid understanding of containerization (Docker) and container orchestration (Kubernetes). Proficiency in scripting languages (Python, Bash). Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack). Familiarity with configuration management tools (e.g., Ansible). Strong understanding of networking concepts (TCP/IP, DNS, VPN, Load Balancing). Experience with version control systems (Git, GitHub, GitLab, Bitbucket). Excellent problem-solving, analytical, and communication skills. Ability to work independently and as part of a collaborative team. Preferred Qualifications (Nice To Have) Experience with serverless computing (AWS Lambda, Azure Functions, Google Cloud Functions). Knowledge of database administration (SQL, NoSQL). Experience with microservices architecture. Familiarity with site reliability engineering (SRE) principles. Certifications in relevant technologies (e.g., Kubernetes Skills: cloudformation,go,cloud,bitbucket,loki,cd,circleci,github,ci,devops,ansible,azure devops,datadog,elk stack,gcp,scripting languages,pulumi,gitlab,kubernetes,azure,splunk,github actions,prometheus,aws,git,gitlab ci/cd,terraform,azure aks,grafana,bash,docker,chef,google gke,amazon eks,python,shell scripting,jenkins,puppet,saltstack

Posted 2 weeks ago

Apply

14.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Senior Site Reliability Engineer (SRE) – Azure Focused Location :Pune Experience : 7–14 Years Notice Period : Immediate to 30 Days Key Responsibilities Ensure availability, latency, performance, and efficiency of global eCommerce sites Design and develop E2E observability dashboards and tooling Maintain error budgets, meet SLOs, and drive incident response automation Collaborate with engineering teams to build highly reliable systems Drive proactive monitoring, root cause analysis (RCA), and system optimization Build tools to improve incident management and software delivery processes Optimize cloud infrastructure for performance and cost, primarily in Azure Promote observability best practices and help define instrumentation standards Required Skills 7–14 years in Site Reliability Engineering or DevOps Experience supporting cloud production environments (Azure preferred) Expertise with monitoring tools: Splunk, Dynatrace, Datadog, Grafana, New Relic Strong scripting skills – Python preferred (Shell acceptable) Hands-on with CI/CD tools – GitLab, Jenkins, Azure DevOps, etc. Proficient in Kubernetes, Docker, Terraform, and Ansible Knowledge of configuration management – Ansible, Chef, or AWS CodeDeploy Proven troubleshooting skills with strong ownership mindset Passionate about automation, observability, and platform reliability

Posted 2 weeks ago

Apply

0 years

0 Lacs

Gurugram, Haryana, India

On-site

Overview Cvent is a global meeting, event, travel, and hospitality technology leader, with more than 4000+ employees worldwide. As a leading cloud-based technology company, we have over 28,000+ customers, including 80% of the Fortune 100 companies, in more than 100 countries. Cvent’s software solutions optimize the entire event management value chain and have enabled clients around the world to manage hundreds of thousands of meetings and events. In addition to helping event planners navigate every aspect of the event process, we also provide an integrated platform to hoteliers to help create qualified demand for their hotels, manage that demand more efficiently, and measure their business performance in real-time. In This Role, You Will As a Site Reliability Engineer, you'll use your advanced development and operations knowledge to identify and prioritize issues. Find universal solutions to common problems and mentor and support junior staff. Additionally, You Will Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations. Tackle complex development, automation and business process problems. Champion Cvent standards and best practices. Ensure the scalability, performance, and resilience of our suite of products. Work with the development and product team of a new application to establish the right monitoring and alerting strategy. Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions. Help a dev team working on a legacy code base to realize zero-down-time deployments. Give back by working on and contributing to Open Source projects Automate all the things! Here's What You Need Experience with SDLC methodologies (preferably Agile software development methodology). Scripting languages like Ruby, Groovy, Bash, PowerShell, or Python. Exposure to managing AWS services / operational knowledge of managing applications in AWS Experience with configuration management tools such as Chef, Puppet, Ansible or equivalent Hands-on experience with Windows and Linux/Unix Administration Working with APM, monitoring, and logging tools (New Relic, DataDog, Splunk) Good understanding of containerization concepts - docker, ECS, EKS, Kubernetes Experience managing 3 tier application stacks Experience with build tools such as Jenkins Working experience with NoSQL databases such as MongoDB, couchbase, postgres etc F5 load balancing concepts Understanding of basic networking concepts Experience with package managers such as nexus, artifactory or equivalent Good communication skills

Posted 2 weeks ago

Apply

7.0 years

0 Lacs

Vadodara, Gujarat, India

Remote

Company Description Webbrains Technologies is an Australian-based IT firm with development locations in India, offering a wide range of IT services to clients globally. Our services include Web Designing & Development, Mobile Application Development, Custom Software Solutions, E-Commerce Platforms, and dedicated remote IT resources. We cater to various industries such as Healthcare, Energy, Construction, Finance, Media, and more. We are proud to have a successful client base across 42+ countries. Job Summary: We are seeking a highly skilled DevOps Architect to design, implement, and manage scalable, secure, and reliable DevOps practices. You will play a critical role in enhancing our CI/CD pipelines, infrastructure automation, cloud architecture, and system monitoring to support smooth and efficient software delivery. Key Responsibilities: Design and implement DevOps solutions, infrastructure, and CI/CD pipelines across various environments (development, testing, staging, and production). Architect highly available, scalable, and secure systems using cloud platforms (AWS, Azure, or GCP). Collaborate with development, QA, and IT teams to ensure reliable software deployments and operational excellence. Automate provisioning, deployment, scaling, and monitoring using tools like Terraform, Ansible, or Cloud Formation. Implement and manage containerization (Docker) and orchestration (Kubernetes) solutions. Define and enforce DevOps best practices and standard operating procedures. Design monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK Stack, Datadog, etc. Ensure security, compliance, and cost-efficiency of DevOps pipelines and cloud infrastructure. Mentor DevOps engineers and support continuous learning across the team. Required Skills & Qualifications: Proven experience as a DevOps Engineer/Architect (minimum 7 years). Expertise in CI/CD tools such as Jenkins, GitLab CI, Azure DevOps, or CircleCI. Strong experience with cloud platforms (AWS preferred, but Azure or GCP acceptable). Proficiency in Infrastructure as Code (Terraform, Ansible, or similar). Hands-on knowledge of containerization (Docker) and orchestration (Kubernetes). Experience with monitoring/logging tools (Grafana, ELK, Prometheus, CloudWatch, etc.). Strong scripting skills (Bash, Python, or similar). Good understanding of networking, system security, and DevSecOps principles. Excellent communication, problem-solving, and leadership skills.

Posted 2 weeks ago

Apply

0 years

0 Lacs

India

On-site

AI Engineer (Mid / Senior level) • Design, build, and optimize AI/ML models and algorithms tailored to business needs. • Develop and implement LLM-based solutions, including custom training and fine-tuning. • Integrate AI models into existing or new applications. • Work with Azure OpenAI Service to deploy and manage AI workloads. • Collaborate with cross-functional teams (Data Engineers, DevOps, Product) to scale AI initiatives. • Maintain documentation, evaluate model performance, and iterate on improvements. • Experience with Azure Open AI. Cloud Ops Engineer (Part-time / Project basic ) • Design and implement end-to-end integration between Azure and Datadog, including: o Metrics and log collection (Azure Monitor, Activity Logs, Diagnostic Settings). o Distributed tracing and custom dashboards. o Alerting rules and incident response workflows. • Build and maintain an integrated cloud support system: o Automated monitoring and alerting pipelines. o Incident triage and escalation playbooks. o Ticketing system and notification integration (e.g., PagerDuty, ServiceNow, Slack). • Ensure operational readiness through runbooks, dashboards, and SLA reporting. • Collaborate with development and infrastructure teams to identify observability gaps and improve monitoring coverage. • Maintain Infrastructure-as-Code (IaC) templates (ARM, Bicep, or Terraform) for repeatable deployment of monitoring and support tooling. Requirements • Strong experience with Microsoft Azure (App Services, AKS, Functions, Monitor, Log Analytics). • Hands-on experience integrating Datadog with Azure using APIs, agents, and Azure-native connectors. • Experience in building support or operations tools/systems (e.g., alert routing, self-healing scripts). • Solid scripting skills (e.g., PowerShell, Python, or Bash). • Familiarity with CI/CD, containerization (Docker/Kubernetes), and DevOps practices. • Experience with infrastructure automation using Terraform, Bicep, or ARM templates. • Strong troubleshooting, problem-solving, and incident management skills

Posted 2 weeks ago

Apply

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

As a Lead Software Engineer – Performance Engineering , you will drive the strategy, design, and execution of performance engineering initiatives across highly distributed systems. You will lead technical efforts to ensure reliability, scalability, and responsiveness of business-critical applications. This role requires deep technical expertise, hands-on performance testing experience, and the ability to mentor engineers while collaborating cross-functionally with architecture, SRE, and development teams. Responsibilities: Define, implement, and enforce SLAs, SLOs, and performance benchmarks for large-scale systems. Lead performance testing initiatives including load, stress, soak, chaos, and scalability testing. Design and build performance testing frameworks integrated into CI/CD pipelines. Analyze application, infrastructure, and database metrics to identify bottlenecks and recommend optimizations. Collaborate with cross-functional teams to influence system architecture and improve end-to-end performance. Guide the implementation of observability strategies using monitoring and APM tools. Optimize cloud infrastructure (e.g., autoscaling, caching, network tuning) for cost-efficiency and speed. Tune databases and messaging systems (e.g., PostgreSQL, Kafka, Redis) for high throughput and low latency. Mentor engineers and foster a performance-first culture across teams. Lead incident response and postmortem processes related to performance issues. Drive continuous improvement initiatives using data-driven insights and operational feedback. Required Qualifications: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field. 8+ years of experience in software/performance engineering, with 2+ years in a technical leadership role. Expertise in performance testing tools such as JMeter, k6, Gatling, or Locust. Strong knowledge of distributed systems, cloud-native architecture, and microservices. Proficiency in scripting and automation using Python, Go, or Shell. Experience with observability and APM tools (e.g., Datadog, Prometheus, New Relic, AppDynamics). Deep understanding of SQL performance, caching strategies, and tuning for systems like PostgreSQL and Redis. Familiarity with CI/CD pipelines, container orchestration, and IaC tools (e.g., Kubernetes, Terraform). Strong communication skills and experience mentoring and leading technical teams. Ability to work cross-functionally and make informed decisions in high-scale, production environments.

Posted 2 weeks ago

Apply

4.0 - 6.0 years

0 Lacs

Noida

Remote

Role Summary While many vendors treat monitoring as a reactive afterthought, we embed Datadog-trained Observability Engineers directly into our engineering and operations teams to deliver real-time visibility, proactive tuning, and smarter incident management. We are looking for a highly capable Observability & Monitoring Engineer with 46 years of experience in Datadog and related observability practices. The engineer will be at the forefront of transforming how systems are monitored—reducing noise, accelerating root-cause discovery, and enabling smarter, correlated event flows across cloud-native environments. Core Responsibilities: Datadog Ownership: Build and maintain Datadog dashboards, monitors, and SLOs with a focus on business and operational relevance. Configure and tune alerts to eliminate noise and reduce false positives, enabling focused responses and intelligent routing. Proactive Monitoring & Alert Tuning: Implement proactive alert strategies based on usage patterns and event behavior. Continuously optimize thresholds, baselines, and anomaly detection logic to ensure actionable monitoring signals. Observability & Root-Cause Analysis (RCA): Correlate metrics, logs, and traces across distributed systems to facilitate rapid root-cause triangulation. Drive investigations from high CPU alerts to middleware issues such as queue overloads, using Datadog APM and tracing. Integrated Support & Event Correlation: Work closely with L2/Smart L3 and platform teams to support event correlation, AWS incident flows, and CI/CD telemetry. Participate in day-to-day IT operations, functional system support, and incident escalation workflows. SAP CPI API Monitoring: Build and maintain targeted dashboards for SAP CPI APIs to ensure availability, throughput, and performance visibility. What Makes This Role Unique: You are embedded in the core delivery team, not isolated in a separate monitoring silo. You work on proactive monitoring, not just reacting to alerts. You support a platform aligned with Smart’s tooling and architecture, including high-frequency CI tracing and real-time AWS integration. You help evolve how we define “observability maturity” by integrating it deeply into development and ops workflows. Required Skills & Experience: 4–6 years of experience in observability, SRE, or DevOps roles with strong exposure to Datadog. Experience with configuring and managing Datadog’s dashboards, monitors, APM, and logs. Deep understanding of observability principles: metrics, logs, distributed traces, RUM, and synthetic monitoring. Experience tracing infrastructure or application alerts (e.g., CPU, latency) to actual service or middleware-level bottlenecks. Familiarity with cloud platforms like AWS (preferred), Azure, or GCP. Hands-on experience in event management, incident support, and RCA documentation. Exposure to SAP CPI monitoring or other enterprise integration middleware is a plus. What You’ll Get: The opportunity to redefine observability in a modern, fast-paced environment. Ownership of critical monitoring pipelines and real-time troubleshooting tools. Work with global engineering and platform teams to drive performance and reliability. Flexible work environment and access to upskilling resources.

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

Noida, Uttar Pradesh

Remote

Role Summary: The AIML Platform Engineering Lead is a pivotal leadership role responsible for managing the day-to-day operations and development of the AI/ML platform team. In this role, you will guide the team in designing, building, and maintaining scalable platforms, while collaborating with other engineering and data science teams to ensure successful model deployment and lifecycle management. Key Responsibilities: Lead and manage a team of platform engineers in developing and maintaining robust AI/ML platforms. Define and implement best practices for machine learning infrastructure, ensuring scalability, performance, and security. Collaborate closely with data scientists and DevOps teams to optimize the ML lifecycle from model training to deployment. Establish and enforce standards for platform automation, monitoring, and operational efficiency. Serve as the primary liaison between engineering teams, product teams, and leadership. Mentor and develop junior engineers, providing technical guidance and performance feedback. Stay abreast of the latest advancements in AI/ML infrastructure and integrate new technologies where applicable. Qualifications: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field. 8+ years of experience in Python & Node.js development and infrastructure. Proven experience in leading engineering teams and driving large-scale projects. Extensive expertise in cloud infrastructure (AWS, GCP, Azure), MLOps tools (e.g., Kubeflow, MLflow), and infrastructure as code (Terraform) Strong programming skills in Python and Node.js, with a proven track record of building scalable and maintainable systems that support AI/ML workflows. Hands-on experience with monitoring and observability tools, such as Datadog, to ensure platform reliability and performance. Strong leadership and communication skills with the ability to influence cross-functional teams. Excellent problem-solving skills and the ability to work in a fast-paced, collaborative environment. Job Type: Full-time Benefits: Commuter assistance Flexible schedule Health insurance Life insurance Paid sick time Paid time off Provident Fund Work from home Ability to commute/relocate: Noida, Uttar Pradesh: Reliably commute or planning to relocate before starting work (Preferred) Application Question(s): What are your salary expectations? What is your notice period? Location: Noida, Uttar Pradesh (Preferred) Work Location: In person

Posted 2 weeks ago

Apply

5.0 years

1 - 10 Lacs

Hyderābād

On-site

As a Lead Software Engineer at JPMorgan Chase within the Consumer & Community Banking Technical Team, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. Drive significant business impact through your capabilities and contributions, and apply deep technical expertise and problem-solving methodologies to tackle a diverse array of challenges that span multiple technologies and applications. Job responsibilities Executes creative software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems Develops secure high-quality production code, and reviews and debugs code written by others Identifies opportunities to eliminate or automate remediation of recurring issues to improve overall operational stability of software applications and systems Leads evaluation sessions with external vendors, startups, and internal teams to drive outcomes-oriented probing of architectural designs, technical credentials, and applicability for use within existing systems and information architecture Leads communities of practice across Software Engineering to drive awareness and use of new and leading-edge technologies Adds to team culture of diversity, equity, inclusion, and respect Required qualifications, capabilities, and skills Formal training or certification on software engineering concepts and 5+ years of applied experience Demonstrated and strong hands on Python/Java Enterprise Web Development; developing in all tiers (middleware, integration and database) of the application and proven experience with design patterns Experience in design and Architecture Experience in AWS (EKS, EC2, S3,EventBridge,StepFunction, SNS/SQS,Lambda) is must Experience in Design and develop scalable, high-performance applications using AWS-native event-driven services, including API Gateway Experience in AWS cloud monitoring tools like Datadog, Cloud watch, Lambda is needed Deep hands-on experience in Django, Flask & Object Oriented methodology of design and development Experience with databases like Amazon RDS, caching and performance tuning, REST APIs, with Messaging (Kafka) Hands on with development and test automation tools/frameworks (e.g. BDD and Cucumber) Experience in best practices for Data Pipeline design, Data architecture and processing of structured and unstructured data. Ability to plan, prioritize and follow through on their work and meet deadlines in a fast-paced environment, while also clearly articulating both technical and non-technical issues with stake holders & partners like Dev Ops, Architects, QA testers & Product Owners Preferred qualifications, capabilities, and skills Experience in Micro services Experience in financial domain is preferred Exposure to artificial intelligence, machine learning, mobile Exposure to agile methodologies such as CI/CD, Applicant Resiliency, and Security Hands-on practical experience in system design, application development, testing, and operational stability

Posted 2 weeks ago

Apply

6.0 years

0 Lacs

Noida, Uttar Pradesh, India

Remote

Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we’re only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all. Here, we know that you’re more than your work. That’s why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose — a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If you’re passionate about our purpose — people —then we can’t wait to support whatever gives you purpose. We’re united by purpose, inspired by you. Key Responsibilities Monitor and support Kronos Private Cloud and hosted environments remotely. Perform remote monitoring of Microsoft Windows (2016 and 2019) and Linux servers for: System performance and uptime SQL database health Application service and web application status Server resource utilization Citrix – End user account setup SFTP – End user account setup vCenter – Operational tasks GCP Console – Operational tasks Respond to alerts from monitoring tools and take corrective actions. Troubleshoot and identify root causes of server and application performance issues. Handle Level 1 escalations and follow the defined escalation matrix. Setting up a meeting with the customers for escalated cases to troubleshoot the issue. Administer and maintain Windows and Linux operating systems. Understand and troubleshoot server-client architecture issues. Participate in applying the monthly operating system security patching activity to keep the cloud environment security compliant. Collaborate with internal teams to ensure high availability and performance of hosted services. Identifying the challenging tasks and automating the manual processes. Document incidents, resolutions, and standard operating procedures. Actively participate in incident response, including on-call responsibilities Participate in 24/7 rotational shifts, including nights and weekends. Required Overall, must have at least 6-7 years of hands-on experience working in the IT Industry. Engineering degree, or a related technical discipline, or equivalent work experience Strong understanding of cloud infrastructure, virtualization (VMware), and hybrid environments. Excellent communication, analytical, and problem-solving skills. Familiarity with monitoring tools (DataDog, Grafana, Splunk) and alert management. Familiarity with Privilege Access Management (PAM) tools (CyberArk or Saviynt). This position typically requires being on-site in an office at least 3 days per week. Preferred Minimum 2 years of hands-on experience working in a private cloud (VMware) / public cloud platform, AWS, GCP or Azure Familiarity with JIRA, ServiceNow for incident, problem, and change management. Willingness to work in rotational shifts, including nights and weekends. Where we’re going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it’s our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! UKG is proud to be an equal opportunity employer and is committed to promoting diversity and inclusion in the workplace, including the recruitment process. Disability Accommodation in the Application and Interview Process For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com

Posted 2 weeks ago

Apply

3.0 years

4 - 7 Lacs

Hyderābād

On-site

Company Description Organizations everywhere struggle under the crushing costs and complexities of “solutions” that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice that can make or break a business. Create better or worse experiences. Propel or throttle growth. Business software has become a blocker instead of ways to get work done. There’s another option. Freshworks. With a fresh vision for how the world works. At Freshworks, we build uncomplicated service software that delivers exceptional customer and employee experiences. Our enterprise-grade solutions are powerful, yet easy to use, and quick to deliver results. Our people-first approach to AI eliminates friction, making employees more effective and organizations more productive. Over 72,000 companies, including Bridgestone, New Balance, Nucor, S&P Global, and Sony Music, trust Freshworks’ customer experience (CX) and employee experience (EX) software to fuel customer loyalty and service efficiency. And, over 4,500 Freshworks employees make this possible, all around the world. Fresh vision. Real impact. Come build it with us. Job Description We are looking for a skilled and detail-oriented NOC (Network Operations Center) Engineer to join our team. You will be responsible for managing and monitoring production systems, ensuring the health, performance, and reliability of our internet-facing infrastructure. The ideal candidate should have a solid background in system administration, networking, and incident response, with a proactive mindset and a willingness to work in a 24/7 environment. Key Responsibilities Monitor and manage production systems to ensure high availability and performance. Act as the primary point of contact for all production-related incidents and alerts. Perform root cause analysis (RCA) for service-impacting events and implement preventive measures. Troubleshoot and resolve infrastructure issues escalated by internal systems or customers. Participate in a 24/7 shift rotation and ensure timely incident response and communication. Collaborate with platform and product teams to review and implement application and infrastructure monitoring changes. Develop and maintain SOPs and knowledge base articles for recurring operations tasks. Support and enforce compliance with internal security and compliance policies. Qualifications 3+ years of experience in Linux/Unix systems administration. Strong understanding of internet protocols and networking concepts: DNS, DHCP, NTP, SMTP, TCP/IP, SSH, HTTPS, TLS, IPSec, VPN, etc. Experience in application and database-level monitoring and troubleshooting (e.g., Apache, Tomcat, MySQL). Proficiency in scripting languages: Shell, Python or Ruby. Hands-on experience with monitoring and logging tools such as Nagios, New Relic, Datadog, Splunk, Sumo Logic, ELK stack, etc. Experience with incident management tools like ServiceNow, JIRA, and PagerDuty. Basic knowledge of web fundamentals: HTML, JavaScript, CSS, server-side programming, and databases. Working knowledge of AWS or other cloud platforms. Experience with containers and orchestration platforms: Docker and Kubernetes. Familiarity with CI/CD ( Jenkins ) pipelines and automation tools. Knowledge of infrastructure-as-code using Terraform. Strong collaboration skills with cross-functional teams, including SRE, Security, and DevOps. Additional Information At Freshworks, we are creating a global workplace that enables everyone to find their true potential, purpose, and passion irrespective of their background, gender, race, sexual orientation, religion and ethnicity. We are committed to providing equal opportunity for all and believe that diversity in the workplace creates a more vibrant, richer work environment that advances the goals of our employees, communities and the business.

Posted 2 weeks ago

Apply

4.0 years

0 Lacs

Hyderābād

On-site

Requisition Number: 101579 Cloud Engineer III - Azure Infra/Migration/IaC/DevOps Shift: 2 PM- 11 PM IST Location: Delhi NCR, Hyderabad, Bangalore, Pune, Mumbai, Chennai, this is a hybrid work opportunity. Insight at a Glance 14,000+ engaged teammates globally with operations in 25 countries across the globe. Received 35+ industry and partner awards in the past year $9.2 billion in revenue #20 on Fortune’s World's Best Workplaces™ list #14 on Forbes World's Best Employers in IT – 2023 #23 on Forbes Best Employers for Women in IT- 2023 Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. As a Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge, we guide organizations through complex digital decisions. About the role As a Cloud Engineer III, you will be part of the consulting practice, utilizing cutting-edge automation tools and provisioning in public cloud providers—preferably Azure, AWS, or GCP. You will be responsible for designing and deploying well-architected cloud solutions. The ideal candidate will have experience in customer-facing roles and a proven track record of delivering cloud solutions with Infrastructure as Code (IaC) automation on various projects. Along the way, you will: Design scalable, secure, and resilient cloud infrastructure (primarily on Azure, AWS, or GCP). Create architecture diagrams, deployment strategies, and cloud roadmaps. Deploy and configure cloud resources such as VMs, storage, networking, containers, and databases. Automate infrastructure provisioning using tools like Terraform, ARM templates, or Bicep. Set up CI/CD pipelines using tools like Azure DevOps, GitHub Actions, or Jenkins. Implement Infrastructure as Code (IaC) and configuration management. Support microservices-based architecture designs. Set up application and infrastructure monitoring with tools like Prometheus, Grafana, Datadog, New Relic, or Azure Monitor. Perform cost optimization and performance tuning. Implement cloud security best practices, including identity and access management (IAM), encryption, firewall rules, and network security groups. Collaborate with Insight and client teams, following Agile/Scrum methodologies and ceremonies. Communicate effectively and professionally with teammates, client personnel, and stakeholders. What we’re looking for Bachelor’s degree in information technology, Computer Science, or related field preferred, or equivalent practical experience. 4-6 years of relevant experience in a similar or related role is required. Any relevant cloud certification is a plus. Hands-on experience with one or more cloud providers (AWS, Azure, GCP) is a must. Azure being the primary cloud. Familiarity with writing infrastructure as code (e.g., Terraform, Azure Bicep, ARM templates, CloudFormation) is a must. Working experience with at least one of the CI/CD tools and version control systems (e.g. Azure DevOps, GitHub Actions, Jenkins, Git, GitHub, Azure Repos) is required. Familiarity with Windows and Linux/Unix-based systems is a must. Proficiency in Azure infrastructure cloud services like Azure VM, VNET, Storage, Monitoring, Azure Functions, Load Balancers, Azure AD, Azure DNS, Traffic managers and Application Gateways for network optimization. Knowledge of Azure Kubernetes Service (AKS), Docker containers, and application monitoring services such as Prometheus, Grafana, Datadog, and New Relic is highly desirable. Experience in application deployment and management within cloud environments. Hands-on knowledge of Docker and container lifecycle management. Experience in deploying and managing distributed applications in production-grade environments What you can expect- We’re legendary for taking care of you, your family, and helping you engage with your local community. We want you to enjoy a full, meaningful life and own your career at Insight. Some of our benefits include: Freedom to work from another location—even an international destination—for up to 30 consecutive calendar days per year. Medical Insurance Health Benefits Professional Development: Learning Platform and Certificate Reimbursement Shift Allowance But what really sets us apart are our core values of Hunger, Heart, and Harmony, which guide everything we do, from building relationships with teammates, partners, and clients to making a positive impact in our communities. Join us today, your ambitious journey starts here. Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. At Insight, we celebrate diversity of skills and experience so even if you don’t feel like your skills are a perfect match - we still want to hear from you! Today's talent leads tomorrow's success. Learn more about Insight: https://www.linkedin.com/company/insight/ Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. Insight India Location:Level 16, Tower B, Building No 14, Dlf Cyber City In It/Ites Sez, Sector 24 &25 A Gurugram Gurgaon Hr 122002 India

Posted 2 weeks ago

Apply

3.0 - 7.0 years

7 - 8 Lacs

Hyderābād

On-site

CORE BUSINESS OPERATIONS The Core Business Operations (CBO) portfolio is an integrated set of offerings that addresses our clients’ heart-of-the-business issues. This portfolio combines our functional and technical capabilities to help clients transform, modernize, and run their existing technology platforms across industries. As our clients navigate dynamic and disruptive markets, these solutions are designed to help them drive product and service innovation, improve financial performance, accelerate speed to market, and operate their platforms to innovate continuously. ROLE Level: Consultant As an Consultant at Deloitte Consulting, you will be responsible for individually delivering high quality work products within due timelines in an agile framework. Need-basis consultants will be mentoring and/or directing junior team members/liaising with onsite/offshore teams to understand the functional requirements. Collaborate with development and operations teams to enhance system reliability, scalability, and efficiency through effective infrastructure management. Implement and oversee deployment processes to ensure optimal system performance. Drive the implementation of continuous integration/continuous deployment (CI/CD) processes for streamlined and efficient operations. You will also be responsible for the ownership of tasks assigned through SNOW, Dashboard, Order forms etc. The work you will do includes: Lead DevOps initiatives overseeing AWS cloud infrastructure. Employ CloudFormation, Terraform, Ansible for automated provisioning. Administer Windows/Linux on EC2 instances, ensuring security patches and updates. Collaborate with cross-functional teams to deploy secure, scalable, cost-effective solutions. Implement monitoring/logging for infrastructure and applications. Automate/Execute tasks related to IAM, monitoring, backup, and vulnerability remediation. Contribute to performance testing, capacity planning, and documentation efforts. Develop and manage CI/CD pipelines using AWS CodePipeline, Jenkins, Concourse etc. Facilitate knowledge transfer to junior resources and provide weekend on-call support on a rotational basis. QUALIFICATIONS Skills / Project Experience: Must Have: Proficient in Windows/Linux administration, 2-tier, 3-tier, and multi-tier architecture, IaaS/PaaS/SaaS, disaster recovery, and networking/security. Strong scripting skills in PowerShell, Shell, and Python. Hold AWS Solution Architecture certification, Architecture, and DevOps certifications. Familiarity with Agile methodology, Git branching strategy, and ITIL foundations. Expertise in CloudWatch, DataDog, Dynatrace, and infrastructure monitoring tools. Good interpersonal and communication skills, adapting innovation to varied business domains. Good to Have: Education: B.E./B. Tech/M.C.A./M.Sc (CS) degree or equivalent from accredited university Prior Experience: 3 – 7 years of experience working with AWS Infra and DEVOPS Location: Hyderabad, Pune The team Deloitte Consulting LLP’s Technology Consulting practice is dedicated to helping our clients build tomorrow by solving today’s complex business problems involving strategy, procurement, design, delivery, and assurance of technology solutions. Our service areas include analytics and information management, delivery, cyber risk services, and technical strategy and architecture, as well as the spectrum of digital strategy, design, and development services Core Business Operations Practice optimizes clients’ business operations and helps them take advantage of new technologies. Drives product and service innovation, improves financial performance, accelerates speed to market, and operates client platforms to innovate continuously. Learn more about our Technology Consulting practice on www.deloitte.com For information on CBO visit - https://www.youtube.com/watch?v=L1cGlScLuX0 For information on life of an Analyst at CBO visit- https://www.youtube.com/watch?v=CMe0DkmMQHI Our purpose Deloitte’s purpose is to make an impact that matters for our people, clients, and communities. At Deloitte, purpose is synonymous with how we work every day. It defines who we are. Our purpose comes through in our work with clients that enables impact and value in their organizations, as well as through our own investments, commitments, and actions across areas that help drive positive outcomes for our communities. Our people and culture Our inclusive culture empowers our people to be who they are, contribute their unique perspectives, and make a difference individually and collectively. It enables us to leverage different ideas and perspectives, and bring more creativity and innovation to help solve our clients' most complex challenges. This makes Deloitte one of the most rewarding places to work. Professional development At Deloitte, professionals have the opportunity to work with some of the best and discover what works best for them. Here, we prioritize professional growth, offering diverse learning and networking opportunities to help accelerate careers and enhance leadership skills. Our state-of-the-art DU: The Leadership Center in India, located in Hyderabad, represents a tangible symbol of our commitment to the holistic growth and development of our people. Explore DU: The Leadership Center in India. Benefits to help you thrive At Deloitte, we know that great people make a great organization. Our comprehensive rewards program helps us deliver a distinctly Deloitte experience that helps that empowers our professionals to thrive mentally, physically, and financially—and live their purpose. To support our professionals and their loved ones, we offer a broad range of benefits. Eligibility requirements may be based on role, tenure, type of employment and/ or other criteria. Learn more about what working at Deloitte can mean for you. Recruiting tips From developing a stand out resume to putting your best foot forward in the interview, we want you to feel prepared and confident as you explore opportunities at Deloitte. Check out recruiting tips from Deloitte recruiters. Requisition code: 306524

Posted 2 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies