Jobs
Interviews

3006 Datadog Jobs - Page 36

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0 years

0 Lacs

Pune, Maharashtra, India

On-site

Some careers shine brighter than others. If you’re looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC offers opportunities, support and rewards that will take you further. HSBC is one of the largest banking and financial services organisations in the world, with operations in 64 countries and territories. We aim to be where the growth is, enabling businesses to thrive and economies to prosper, and, ultimately, helping people to fulfil their hopes and realise their ambitions. We are currently seeking an experienced professional to join our team in the role of Consultant Specialist. In this role, you will: The DevOps Engineering job is responsible for developing automations across the Technology delivery lifecycle including construction, testing, release and ongoing service management, and monitoring of a product or service within a Technology team. They will be required to continually enhance their skills within a number of specialisms which include CI/CD, automation, pipeline development, security, testing, and operational support. This role will carry out some or all of the following activities: The role of the DevOps engineer is to facilitate the application teams across the Bank to deploy and their applications across GCP services like GKE Container, BigQuery, Dataflow, PubSub, Kafka The DevOps Engineer should be the go-to person in case application team faces any issue during Platform adoption, onboarding, deployment and environment troubleshooting. Ensure service resilience, service sustainability and recovery time objectives are met for all the software solutions delivered. Responsible for automating the continuous integration / continuous delivery pipeline within a DevOps Product/Service team driving a culture of continuous improvement. Keep up to date and have expertise on current tools, technologies and areas like cyber security and regulations pertaining to aspects like data privacy, consent, data residency etc. that are applicable End to end accountability for a product or service, identifying and developing the most appropriate Technology solutions to meet customer needs as part of the Customer Journey Liaise with other engineers, architects, and business stakeholders to understand and drive the product or service’s direction. Analyze production errors to define and create tools that help mitigate problems in the system design stage and applying user-defined integrations, improving the user experience. Requirements To be successful in this role, you should meet the following requirements: Bachelor Degree in Computer Science or related disciplines 6 or more years of hands-on development experience building fully self-serve, observable solutions using infrastructure and Policy As A Code Proficiency developing with modern programming languages and and ability to rapidly develop proof-of-concepts Ability to work with geographically distributed and cross-functional teams Expert in code deployment tools (Jenkins, Puppet, Ansible, Git, Selenium, and Chef) Expert in automation tools (CloudFormation, Terraform, shell script, Helm, Ansible) Familiar with Containers (Docker, Docker compose, Kubernetes, GKE) Familiar with Monitoring (DATADOG, Grafana, Prometheus, AppDynamics, New Relic, Splunk) The successful candidate will also meet the following requirements: Good understanding of GCP Cloud or Hybrid Cloud approach implementations Good understanding and experience on MuleSoft / PCF/Any Gateway Server Implementations Hands on experience in Kong API Gateway platform Good understanding and experience on Middleware and MQ areas. Familiar with infrastructure support Apache Gateway, runtime Server Configurations, SSL Cert setup etc You’ll achieve more when you join HSBC.

Posted 4 weeks ago

Apply

3.0 - 5.0 years

12 - 24 Lacs

Bengaluru

Work from Office

Exp writing test cases, building test frameworks, AWS services ( Lambda, S3, EC2, CloudWatch, Datadog. Exp in software/systems testing concepts. Exp Automotive Security, IT Security, Linux security concepts. Testing embedded systems, IoT devices.

Posted 4 weeks ago

Apply

6.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Sprinklr is a leading enterprise software company for all customer-facing functions. With advanced AI, Sprinklr's unified customer experience management (Unified-CXM) platform helps companies deliver human experiences to every customer, every time, across any modern channel. Headquartered in New York City with employees around the world, Sprinklr works with more than 1,000 of the world’s most valuable enterprises — global brands like Microsoft, P&G, Samsung and more than 50% of the Fortune 100. Learn more about our culture and how we make our employees happier through The Sprinklr Way. Job Description Lead end-to-end management of Critical Service Outages (P0/P1 incidents), driving timely resolution through coordinated incident response, effective communication with stakeholders, and robust post incident reviews with actionable remediation. Oversee a 24x7 Network Operations Center (NOC), implementing scalable observability, alerting, and monitoring strategies to ensure infrastructure, application, and network reliability. Continuously optimize alert triage, diagnostics, and noise reduction to boost efficiency. Build and develop a high-performing team of incident managers, NOC engineers, and shift leads. Foster operational maturity through training, performance management, and close collaboration with Engineering, SRE, DevOps, and Product teams. Define and uphold standards for incident SLAs, escalation processes, runbooks, and playbooks, while ensuring continuous shift coverage, smooth handoffs, and comprehensive KPI reporting on system health and incident trends. Required Skills: 6+ years of experience in Technical Operations, Site Reliability, NOC, or Incident Management roles. 2+ years in a people management or team leadership role. Deep knowledge of major incident management, escalation practices, and real-time service recovery strategies. Strong technical understanding of cloud-native architectures (AWS, Azure, GCP), infrastructure monitoring, and DevOps practices. Proven experience working with observability tools (e.g., Datadog, Splunk, Grafana, Prometheus), incident tools (PagerDuty), and ITSM platforms (e.g., ServiceNow, Jira) Prior experience supporting high-availability SaaS or telecommunications systems is a strong plus. Experience with customer-facing incident communication practices Why You'll Love Sprinklr: We're committed to creating a culture where you feel like you belong, are happier today than you were yesterday, and your contributions matter. At Sprinklr, we passionately, genuinely care. For full-time employees, we provide a range of comprehensive health plans, leading well-being programs, and financial protection for you and your family through a range of global and localized plans throughout the world. For more information on Sprinklr Benefits around the world, head to https://sprinklrbenefits.com/ to browse our country-specific benefits guides. We focus on our mission: We founded Sprinklr with one mission: to enable every organization on the planet to make their customers happier. Our vision is to be the world’s most loved enterprise software company, ever. We believe in our product: Sprinklr was built from the ground up to enable a brand’s digital transformation. Its platform provides every customer-facing team with the ability to reach, engage, and listen to customers around the world. At Sprinklr, we have many of the world's largest brands as our clients, and our employees have the opportunity to work closely alongside them. We invest in our people: At Sprinklr, we believe every human has the potential to be amazing. We empower each Sprinklrite in the journey toward achieving their personal and professional best. For wellbeing, this includes daily meditation breaks, virtual fitness, and access to Headspace. We have continuous learning opportunities available with LinkedIn Learning and more. EEO - Our philosophy: Our goal is to ensure every employee feels like they belong and are operating in a judgment-free zone regardless of gender, race, ethnicity, age, and lifestyle preference, among others. We value and celebrate diversity and fervently believe every employee matters and should be respected and heard. We believe we are stronger when we belong because collectively, we’re more innovative, creative, and successful. Sprinklr is proud to be an equal-opportunity workplace and is an affirmative-action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. See also Sprinklr’s EEO Policy and EEO is the Law.

Posted 4 weeks ago

Apply

3.0 - 5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Product Manager Our Client: Company is (founded in 2020) is an industry leading, first of its kind in India, digital healthcare data platform and exchange, infused with AI/ML capabilities delivering solutions to stakeholders in all segments of the healthcare sector. Job Title: Product Manager Education: Graduate (Technical background) or MBA preferred Experience: 3 - 5 years (preferably in B2B SaaS, HealthTech, or FinTech platforms) Location: Bangalore (Hybrid - 3-4 days from office) About the Role: As a Product Manager focused on Integrations, you will lead critical initiatives that power the seamless exchange of data between the company and its payer partners. You will own product areas spanning payer integration frameworks, transaction health & monitoring, core transaction lifecycle management, and platform roadmap execution. This is a high-impact role requiring a strong blend of systems thinking, stakeholder collaboration, API and workflow design, and platform-scale product delivery. Roles & Responsibilities: 1. Integration Ownership: Lead the product strategy and roadmap for payer-side integrations ( RESTful APIs, RPA bots, email ingestion, etc.). Define reusable integration patterns across payers with varying levels of tech maturity. Work closely with engineering and implementation teams to deliver scalable and secure integration mechanisms. 2. Transaction Platform Management: Own and enhance the transaction pipeline for core health insurance operations – preauthorization, enhancements, discharge, and settlement. Build capabilities for idempotent and reliable transaction orchestration. Ensure the platform is performant, auditable, and supports both API and semiautomated workflows. 3. Data-driven Transaction Health Define and monitor metrics like transaction latency, success/failure rates, retries, and drops. Partner with data engineering and analytics to expose dashboards and alerts for internal and external consumption. Translate platform telemetry into proactive product improvements. 4. Execution and Delivery: Drive cross-functional sprints with engineering, QA, and customer success for release execution. Ensure documentation, GTM enablement, and internal stakeholder training. Manage platform backlog, maintain sprint discipline, and communicate roadmap progress transparently. 5. Stakeholder Engagement: Collaborate with customer success, operations, and client onboarding teams to refine payer onboarding journeys. Act as the product POC for payer partnerships from integration through to steady-state. Job Qualifications and Requirements: uct management or platform/technical program management. Attitude to get things done. Problem solver at core. Demonstrated success in managing API-based B2B integrations or transaction platforms. Strong understanding of RESTful APIs, JSON, webhook design, and workflow engines. Experience building back-end/platform features with cro Must-Haves: 3 - 5 years of experience in prodss-functional engineering teams. Systems thinker – capable of designing reusable frameworks and scalable abstractions. Good-to-Have: Prior experience in HealthTech, InsurTech, and/or enterprise SaaS. Familiarity with EHR systems , payer-provider transaction types, or healthcare data standards (X12, HL7, FHIR). Exposure to observability tools like Prometheus, Grafana, ELK, or DataDog. Experience with enterprise integrations, RPA, email-based automation, or hybrid integration patterns. About Hireginie: Hireginie is a prominent talent search company specializing in connecting top talent with leading organizations. We are committed to excellence and offer customized recruitment solutions across industries, ensuring a seamless and transparent hiring process. Our mission is to empower both clients and candidates by matching the right talent with the right opportunities, fostering growth and success for all.

Posted 4 weeks ago

Apply

8.0 years

0 Lacs

Jaipur, Rajasthan, India

On-site

Role Overview As a core member of the backend engineering team, you will design, develop, and maintain essential services powering Dreamcast’s event-tech SaaS products. These services include registration, messaging, virtual event tools, CRM, and more. You will create scalable, secure, and high-performance APIs and background jobs that integrate across web platforms and third-party services. Key Responsibilities Design and develop backend services using Nest.js (Node.js + TypeScript) and Laravel Build and maintain RESTful APIs , webhooks, message queues, and job processors Integrate with third-party services such as Twilio , SendGrid, AWS SES, Facebook, Gupshup, and Interakt Develop Redis-based queues (BullMQ) for asynchronous task processing Write optimized SQL queries and manage relational database schemas (MySQL/PostgreSQL) Synchronize data flows between Laravel and Nest.js services Collaborate with frontend teams (Vue 3 / React) to support full-stack delivery Conduct code reviews, mentor junior developers, and maintain clean, scalable architecture Required Qualifications 5–8 years of backend development experience with production-level applications Expertise in Laravel and Nest.js (or Node.js with TypeScript) Proficiency in building and consuming APIs, background job handling, and queue management Solid experience with Redis , BullMQ , and asynchronous workflows Strong database knowledge in MySQL/PostgreSQL , including schema design and query optimization Comfortable working with Git, CI/CD workflows, and versioned codebases Bachelor’s degree in Computer Science, Engineering, or equivalent technical field Preferred Skills Experience with AWS services (EC2, Lambda, RDS, S3, CloudWatch, IAM) Familiarity with Docker , containerization, and CI/CD pipelines Understanding of microservices architecture and serverless design Exposure to observability tools like Grafana, Datadog, or New Relic Leadership qualities, with past experience mentoring or leading technical teams

Posted 4 weeks ago

Apply

15.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Responsibilities Leadership & Strategy Define and execute the application support strategy aligned with business goals. Lead and mentor a global team of support engineers and managers. Establish KPIs and SLAs to measure and improve support performance. Define strategies and establish support process with Icertis solution partners. Customer Focus Customer focused leader with proven ability to build relations based on trust & professionalism. Must possess excellent management skills with a successful track record of driving support, adoption and value realization for global customers of enterprise products. Operational Excellence Ensure 24/7 support coverage for critical applications. Implement ITIL best practices for incident, problem, and change management. Drive root cause analysis and continuous improvement initiatives. Collaboration & Communication Partner with Product, Engineering, QA, and Customer Success teams to ensure seamless issue resolution. Act as an escalation point for critical incidents and customer concerns. Communicate effectively with stakeholders on support metrics, trends, and improvement plans. Technology & Tools Evaluate and implement support tools and platforms (e.g., ticketing systems, monitoring tools). Leverage automation and AI to improve support efficiency and reduce manual effort. Compliance & Risk Management Ensure compliance with data protection, security, and regulatory requirements. Manage risk through proactive monitoring and mitigation strategies. Qualifications Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field. 15+ years of experience in application support, with at least 3+ years in a leadership role. Proven experience in managing global support teams for SaaS or enterprise software products. Entrepreneurial hands on working style to develop and deliver business outcomes — effectively doing so even when resource and time frame constraints exist. Strong understanding of ITIL, DevOps, and Agile methodologies. Excellent communication, leadership, and stakeholder management skills. Preferred Skills Experience with cloud platforms (AWS, Azure, GCP). Familiarity with observability tools (Datadog, Splunk, New Relic). Knowledge of database and application performance tuning. Certifications in ITIL, PMP, or similar frameworks. About Us Icertis is the global leader in AI-powered contract intelligence. The Icertis platform revolutionizes contract management, equipping customers with powerful insights and automation to grow revenue, control costs, mitigate risk, and ensure compliance - the pillars of business success. Today, more than one third of the Fortune 100 trust Icertis to realize the full intent of millions of commercial agreements in 90+ countries. About The Team Who we a re: Icertis is the only contract intelligence platform companies trust to keep them out in front, now and in the future. Our unwavering commitment to contract intelligence is grounded in our FORTE values—Fairness, Openness, Respect, Teamwork and Execution—which guide all our interactions with employees, customers, partners, and stakeholders. Because in our mission to be the contract intelligence platform of the world, we believe how we get there is as important as the destination. Icertis, Inc. provides Equal Employment Opportunity to all employees and applicants for employment without regard to race, color, religion, gender identity or expression, sex, sexual orientation, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. Icertis, Inc. complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. If you are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to careers@icertis.com or get in touch with your recruiter.

Posted 4 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

Mumbai, Maharashtra, India

On-site

Sr Infrastructure Support 5 to 9 years Location- Mumbai 5 days WFO ( 24/7 Rotational Shifts) Job Description: Technical Skills You have hands-on experience in using CI/CD tools such as Jenkins, CircleCI or Gitlab for executing deployments. You have knowledge of Infrastructure as Code (IAC) tech stacks such as Terraform, Ansible, ARM or Cloudformation to provision and manage infrastructure. You have working experience in using observability tools for logging, monitoring, tracing and alerting, e.g.: Datadog/PrometheusGrafana, ELK/EFK/Splunk. You have experience in supporting at least one public cloud, e.g.: AWS, Azure or GCP. You have hands-on experience executing most common operations in managing workloads on any container ecosystem tech stacks. e.g.: Docker, Kubernetes, Openshift, etc. You understand system performance tuning and scaling to handle common heavy load scenarios along with concepts of highly available systems and basics of disaster recovery solutions, and are familiar with failover, backup and recovery concepts. You have experience operating a Linux OS such as RHEL or a Debian-Based OS and are familiar with most common Linux OS operations and commands, reading and tweaking Bash scripts andmanaging runtime environment configurations such as Env Vars, Logs,etc. You have experience supporting backend storage solutions such as SQL and NoSQL databases, e.g.: Postgres and MongoDB, and caching solutions such as Redis and Memcached. You have experience in networking configuration and security, and are familiar with common networking setup and security practices, e.g.: loading, balancing, proxies, transport layer security (TLS) and certificate management, and an understanding of standard network protocols and configurations. You have a good understanding of fundamental concepts of APIs such as request, response, headers, authentication, JSON payloads, etc. Other things to know Learning & Development There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys.

Posted 4 weeks ago

Apply

12.0 - 16.0 years

0 Lacs

pune, maharashtra

On-site

Success in the role requires agility and results orientation, strategic and innovative thinking, a proven track record of delivering new customer-facing software products at scale, rigorous analytical skills, and a passion for automation and data-driven approaches to solving problems. As a Director of eCommerce Engineering, your responsibilities include overseeing and leading the engineering project delivery for the ECommerce Global Multi-Tenant Platform. You will ensure high availability, scalability, and performance to support global business operations. Defining and executing the engineering strategy that aligns with the company's business goals and long-term vision for omnichannel retail is crucial. Establishing robust processes for code reviews, testing, and deployment to ensure high-quality deliverables is also part of your role. You will actively collaborate with Product Management, Business Stakeholders, and other Engineering Teams to define project requirements and deliver customer-centric solutions. Serving as a key point of contact for resolving technical challenges and ensuring alignment between business needs and technical capabilities. Promoting seamless communication between teams to deliver cross-functional initiatives on time and within budget is essential. Building a strong and diverse engineering team by attracting, recruiting, and retaining top talent is a key responsibility. Designing and implementing a robust onboarding program to ensure new hires are set up for success. Coaching team members to enhance technical expertise, problem-solving skills, and leadership abilities, fostering a culture of continuous learning and improvement. Maintaining a strong pipeline of talent by building relationships with local universities, engineering communities, and industry professionals is also part of your role. You will define clear, measurable goals for individual contributors and teams to ensure alignment with broader organizational objectives. Conducting regular one-on-one meetings to provide personalized feedback, career guidance, and development opportunities. Managing performance reviews and recognizing high-performing individuals, while providing coaching and support to those needing improvement. Fostering a culture of accountability, where team members take ownership of their work and deliver results. Championing the adoption of best practices in software engineering, including agile methodologies, DevOps, and automation is crucial. Facilitating and encouraging knowledge sharing and expertise in critical technologies, such as cloud computing, microservices, and AI/ML. Evaluating and introducing emerging technologies that align with business goals, driving innovation and competitive advantage is part of your responsibility. Developing and executing a continuous education program to upskill team members on key technologies and the Williams-Sonoma business domain is essential. Organizing training sessions, workshops, and certifications to keep the team updated on the latest industry trends. Encouraging team members to actively participate in tech conferences, hackathons, and seminars to broaden their knowledge and network is also important. Accurately estimating development efforts for projects, considering complexity, risks, and resource availability. Developing and implementing project plans, timelines, and budgets to deliver initiatives on schedule. Overseeing system rollouts and implementation efforts to ensure smooth transitions and minimal disruptions to business operations. Optimizing resource allocation to maximize team productivity and ensure proper workload distribution is a key responsibility. Championing initiatives to improve the engineering organization's culture, focusing on collaboration, transparency, and inclusivity. Continuously evaluating and refining engineering processes to increase efficiency and reduce bottlenecks. Promoting team well-being by fostering a positive and supportive work environment where engineers feel valued and motivated. Leading efforts to make the organization a "Great Place to Work," including regular engagement activities, mentorship programs, and open communication. Developing a deep understanding of critical systems and processes, including platform architecture, APIs, data pipelines, and DevOps practices. Providing technical guidance to the team, addressing complex challenges, and ensuring alignment with architectural best practices. Partnering with senior leaders to align technology decisions with business priorities and future-proof the company's systems. Playing a pivotal role in transforming Williams-Sonoma into a leading technology organization by implementing cutting-edge solutions in eCommerce, Platform Engineering, AI, ML, and Data Science. Driving the future of omnichannel retail by conceptualizing and delivering innovative products and features that enhance customer experiences. Actively representing the organization in the technology community, building a strong presence through speaking engagements, partnerships, and contributions to open-source projects. Identifying opportunities for process automation and optimization to improve operational efficiency. Being adaptable to perform other duties as required, addressing unforeseen challenges, and contributing to organizational goals. Staying updated on industry trends and competitive landscapes to ensure the company remains ahead of the curve. Williams-Sonoma Inc. is the premier specialty retailer of high-quality products for the kitchen and home in the United States. Founded in 1956, it is now one of the United States" largest e-commerce retailers with well-known brands in home furnishings. The India Technology Center serves as a critical hub for innovation, focusing on developing cutting-edge solutions in areas such as e-commerce, supply chain optimization, and customer experience management. Through advanced technologies like artificial intelligence, data analytics, and machine learning, the India Technology Center plays a crucial role in accelerating Williams-Sonoma's growth and maintaining its competitive edge in the global market.,

Posted 4 weeks ago

Apply

0 years

0 Lacs

Gurugram, Haryana, India

On-site

The Role As a DevOps Engineer, youll play a key role in building and maintaining a robust, scalable, and reliable 0-downtime platform. Youll work hands-on with a recently kick-started greenfield initiative with modern infrastructure and automation tools to support our engineering teams. This is a great opportunity to work with a forward-thinking team, and the freedom to approach problems with fresh thinking, embedding AI and automation and helping shape our cloud-native journey. If youre passionate about automation, cloud infrastructure, and delivering high-quality production-grade platforms, this role offers the chance to make a real impact. Key Responsibilities Hands-On Development : Design, implement, and optimise AWS infrastructure through hands-on development using Infrastructure as Code tools. Automation & CI/CD : Develop and maintain CI/CD pipelines to automate fast, secure and seamless deployments. Platform Reliability : Ensure high availability, scalability, and resilience of our platform, leveraging managed services. Monitoring & Observability : Implement and manage proactive observability using DataDog and other tools to monitor system health, performance, and security, making sure we can see and fix issues before they impact users. Cloud Security & Best Practices : Apply cloud and security best practices, including patching and secure configuration of networking, encryption (at rest and in transit), secrets and identity/access management. Continuous Improvement : Contribute ideas and solutions to improve our DevOps processes. AI & Future Tech : We want to push the boundaries of AI-driven development if you have ideas on how to embed AI into our DevOps processes, youll have the space to explore them. Your Experience Tech stack We use : Terraform, Terragrunt, Helm, Python, Bash, AWS (EKS, Lambda, EC2, RDS/Aurora), Linux OS & Github Actions. Youre comfortable with all of these, and have strong hands-on experience with Terraform and IaC principles, CI/CD and the AWS ecosystem. Proven experience with Networking (VPC, Subnets, Security Groups, API Gateway, Load Balancing, WAF) and Cloud configuration (Secrets Manager, IAM, KMS). Comfortable with Kubernetes, ArgoCD, Isitio & Deployment strategies (blue/green & canary). Familiarity with Cloud Security services such as Security Hub, Guard Duty, Inspector and vulnerability Observability Mindset You believe in measuring everything. Youve worked with DataDog (or similar) to ensure teams have visibility into platform health and security. Experience with embedding AI into DevOps processes is advantageous. (ref:hirist.tech)

Posted 4 weeks ago

Apply

4.0 - 6.0 years

0 Lacs

Gurgaon, Haryana, India

Remote

Experience Required: 4-6 years Location: Gurgaon Department: Product and Engineering Working Days: Alternate Saturdays Working (1st and 3rd) 🔧 Key Responsibilities Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services. Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices. Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure provisioning. Containerize applications and optimize Docker images for performance and security. Ensure CI/CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure deployments. Drive SRE principles including monitoring, alerting, SLIs/SLOs, and incident response. Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.). Automate routine tasks with scripting languages (Python, Bash, etc.). Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure. Collaborate closely with development teams to enable DevSecOps best practices. Participate in on-call rotations, handle outages with calm, and conduct postmortems. 🧰 Must-Have Technical Skills Kubernetes (EKS, Helm, Operators) Docker & Docker Compose Terraform (modular, state management, remote backends) AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS/EKS) Linux system administration CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions) Logging & monitoring tools: ELK, Prometheus, Grafana, CloudWatch Site Reliability Engineering practices Load balancing, autoscaling, and HA architectures 💡 Good-To-Have GCP or Azure exposure Service Mesh (Istio, Linkerd) Secrets management (Vault, AWS Secrets Manager) Security hardening of containers and infrastructure Chaos engineering exposure Knowledge of networking (DNS, firewalls, VPNs) 👤 Soft Skills Strong problem-solving attitude; calm under pressure Good documentation and communication skills Ownership mindset with a drive to automate everything Collaborative and proactive with cross-functional teams

Posted 4 weeks ago

Apply

5.0 years

0 Lacs

India

Remote

Senior Reporting Analyst - Infrastructure Location: 100 % Remote in India (EST) Contract About the Role: We are looking for an experienced Senior Reporting Analyst with a passion for transforming technical infrastructure data into actionable business intelligence. This role is perfect for a highly analytical professional with a Big 5 consulting background who excels at crafting compelling executive presentations and partnering with IT leaders to drive informed decisions. As a strategic member of the Infrastructure & Reporting team, you will play a key role in turning raw data from network operations, cybersecurity, and IT systems into clear, impactful insights for C-level stakeholders. You’ll help shape technology strategy by enhancing infrastructure performance visibility and reporting maturity across the organization. Key Responsibilities: Develop and deliver sophisticated dashboards and reports covering critical infrastructure areas such as networks, firewalls, VPNs, and system performance. Visualize complex data into compelling narratives for executive audiences including CIOs, CTOs, and business leaders. Gather, analyze, and interpret data from tools such as Jira, Confluence, ITSM platforms, and infrastructure monitoring systems to support incident, change, and asset management reporting. Partner closely with network engineering and cybersecurity teams to ensure data accuracy, reliability, and consistency. Define and track Key Performance Indicators (KPIs) including system uptime, security event trends, network throughput, and infrastructure health metrics. Lead the continuous improvement of infrastructure reporting processes, with an emphasis on automation and proactive performance monitoring. Provide regular updates and data-driven recommendations to senior leadership, influencing operational strategy and investment decisions. Identify trends in infrastructure operations, propose enhancements, and develop reports that help mitigate risks and drive resilience. Support incident post-mortem reporting, helping teams learn from outages and strengthen operational processes. Required Skills & Experience: 5+ years of experience in data analytics, business intelligence, or IT reporting roles with a focus on infrastructure or IT services. Proven track record with a Big 5 consulting firm (Accenture, Deloitte, PwC, EY, KPMG) in delivering high-impact reporting or advisory services. Strong understanding of IT infrastructure components: routers, switches, firewalls, VPNs, and network performance indicators. Proficiency in Power BI, Tableau, and other data visualization tools, with the ability to create impactful dashboards and executive-level presentations. Skilled in Excel and PowerPoint for rapid analysis and visual storytelling. Familiarity with Jira, Confluence, ITSM tools, and infrastructure monitoring solutions. Excellent stakeholder management, communication, and storytelling skills, with the ability to translate technical information into actionable insights. Hands-on experience in dashboard automation, KPI development, and IT operations reporting. Preferred Qualifications: Previous exposure to the pharmaceutical industry is highly desirable. Bachelor’s or Master’s degree in Computer Science, Data Analytics, Information Systems, or a related technical discipline. Knowledge of ITIL practices, change management, and infrastructure lifecycle processes. Familiarity with infrastructure monitoring platforms such as Splunk, SolarWinds, Datadog, or similar solutions. Experience with Smartsheet, Canva, or other collaborative reporting tools is an added advantage. What You'll Achieve: Your dashboards and insights will become the backbone of executive decision-making. You’ll help optimize infrastructure stability, security, and performance through meaningful data analysis. You will be a trusted advisor to senior leadership, driving infrastructure strategy with evidence-based recommendations.

Posted 4 weeks ago

Apply

3.0 - 6.0 years

12 - 22 Lacs

Gurugram, Bengaluru, Mumbai (All Areas)

Work from Office

In the role of a DevOps Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and CI/CD pipelines necessary to support our Generative AI projects. Furthermore, you will have the opportunity to critically assess and influence the engineering design, architecture, and technology stack across multiple products, extending beyond your immediate focus. - Design, deploy, and manage scalable, reliable, and secure Azure cloud infrastructure to support Generative AI workloads. - Implement monitoring, logging, and alerting solutions to ensure the health and performance of AI applications. - Optimize cloud resource usage and costs while ensuring high performance and availability. - Work closely with Data Scientists and Machine Learning Engineers to understand their requirements and provide the necessary infrastructure and tools. - Automate repetitive tasks, configuration management, and infrastructure provisioning using tools like Terraform, Ansible, and Azure Resource Manager (ARM). - Utilize APM (Application Performance Monitoring) to identify and resolve performance bottlenecks Maintain comprehensive documentation for infrastructure, processes, and workflows. Must Have Skills: - Extensive knowledge of Azure services: Kubernetes, Azure App Service, Azure API management(APIM), Application gateway, AAD, GitHub Action, Istio, Datadog, Proficiency in containerization and orchestration tools such as (Jenkins, GitLab CI/CD, Azure DevOps) - Knowledge of API management platforms like APIM for API governance, security, and lifecycle management. - Expertise in monitoring and observability tools like Datadog, loki, grafana, prometheus for comprehensive monitoring, logging, and alerting solutions. Good scripting skills (Python, Bash, PowerShell). - Experience with infrastructure as code (Terraform, ARM Templates). - Experience in optimizing cloud resource usage and costs utilizing insights from Azure cost and monitor metrics.

Posted 4 weeks ago

Apply

0 years

0 Lacs

Bengaluru, Karnataka, India

Remote

Get to know Okta Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. Join our team! We’re building a world where Identity belongs to you. Who Is Okta? | About Okta Okta is an enterprise-grade identity management service, built from the ground up in the cloud and delivered with an unwavering focus on customer success. With Okta you can manage access across any application, person, or device. Whether the people are employees, partners, or customers, or the applications are in the cloud, on-premises, or on a mobile device, Okta helps you become more secure, make people more productive, and maintain compliance. The Okta service provides directory services, single sign-on, strong authentication, provisioning, workflow, and built-in reporting. It runs in the cloud on a secure, reliable, extensively audited platform and integrates deeply with on-premises applications, directories, and identity management systems. Watch this 1-min video to meet the people behind the world’s leading identity company:Here Location: India Innovation Centre, Bangalore Position Description: We are looking for engineering graduates who are ready to kickstart their careers! You will join one of our Site Reliability Engineering Domains, where you will address real-world challenges and create innovative value for our customers. We are looking for problem solvers who care about making a meaningful impact for our customers and enjoy working as part of a collaborative, distributed team. As a new grad, you will be supported along the way and will have opportunities for continuous learning and mentorship. Job Duties and Responsibilities: Assist in monitoring the health and performance of production systems using internal tools and dashboards. Help automate routine tasks through scripting and simple infrastructure-as-code solutions. Collaborate with SRE and developers to improve system reliability and deployment workflows. Contribute to the creation and maintenance of internal documentation for operational processes and runbooks. Participate in testing and validation of system updates, deployments, and configuration changes. Learn and apply best practices in observability, alerting, and service-level objectives (SLO). Perform basic troubleshooting of services, network issues, and infrastructure components. Attend team standups and planning meetings to align with ongoing priorities and projects. Take ownership of a small project or deliverable under the guidance of a mentor. Writing and maintaining documentation. Minimum Required Knowledge, Skills, and Abilities: Graduated with a Bachelor’s degree in Computer Science Basic understanding of Linux/Unix operating systems and command-line tools. Familiarity with networking fundamentals (e.g., DNS, HTTP, TCP/IP). Awareness of software development and deployment lifecycles. Introductory knowledge of monitoring tools (e.g., Prometheus, Grafana, Nagios, Datadog) and logging frameworks (e.g., ELK stack, Splunk) Willingness to learn new technologies and tools quickly. Ability to work both independently and as part of a collaborative team. Strong attention to detail and a proactive approach to problem-solving. Understanding basic programming concepts, data structures, and algorithms will help you understand and contribute to SRE tools and automation efforts. Basic understanding of database concepts and SQL Skills: Proficiency in at least one scripting language (e.g., Python, GO, Bash, or Shell). Understanding of cloud computing principles (AWS or GCP preferred). Basic troubleshooting and problem-solving capabilities. Comfort with Git and version control workflows. Good communication skills, especially in explaining technical concepts. Enjoy being part of a highly collaborative, remote-friendly environment. Basic knowledge of container technologies like Docker and orchestration tools like Kubernetes Familiarity with Infrastructure-as-Code (IaC) tools like Terraform or CloudFormation, and configuration management tools like Ansible, Puppet, or Chef, can be a plus. What You Can Look Forward to as a Full-Time/Intern at Okta Projects with real-world impact Mentorship and career growth Inclusive culture and fun communities Social impact opportunities What you can look forward to as a Full-Time Okta employee! Amazing Benefits Making Social Impact Developing Talent and Fostering Connection + Community at Okta Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/. Some roles may require travel to one of our office locations for in-person onboarding. Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation. Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at https://www.okta.com/privacy-policy/.

Posted 4 weeks ago

Apply

5.0 years

0 Lacs

Trivandrum, Kerala, India

Remote

Equifax is where you can power your possible. We seek individuals to achieve their potential, develop new skills, and collaborate with bright minds. The Technology Operations Resilience Center – Incident Management Supervisor will lead a team providing 24x7 support for Event and Incident Management of all Equifax applications and infrastructure. The primary goal is to identify and mitigate incidents proactively. This role requires partnership with business sponsors, project managers, application support, networking, system administrators, and business owners. What You’ll Do Lead and manage a team of Incident Coordinators / Managers in India. Monitor Equifax Applications and Infrastructure. Perform initial analysis of alert events and guide the team in determining next steps. Ensure the team performs basic System Administration tasks to provide Level 1 NixSA and WinSA services. Lead and participate actively in Incident Management bridge lines and chats. Oversee the team's coordination of low-priority issues, ensuring proper team engagement, incident investigation progress, timely resolution, and accurate documentation. Oversee fault handling and escalation (identifying and responding to faults, liaising with 3rd party suppliers, and handling escalation). Ensure 24x7 support coverage with team shift management across time zones. Provide an “eyes on glass” presence, ensuring immediate identification of system degradation or failure, and guide the team in the same. Provide the team with tools, training, and guidance to react to alerting, provide first-level analysis, and perform mitigation actions. Manage communication with external customers during overflow situations. Mentor, train, and develop Incident Coordinators, fostering a high-performing team environment. Conduct performance reviews and provide feedback to team members. Ensure adherence to ITIL Incident Management processes and Equifax policies. Drive continuous improvement initiatives within the team and the incident management process. What Experience You Need A Bachelor’s Degree in a Technology field OR 5+ years of equivalent work experience. English (B2+). High School Diploma. 3+ years of experience in Incident Management or a related field. 2+ years of experience in a supervisory or team lead role. Experience managing a team in India or remotely. What Could Set You Apart Experience in any of the following technologies: ServiceNow PagerDuty Datadog SolarWinds GCP Statuspage Experience with performance management and team development. We offer comprehensive compensation and healthcare packages, 401k matching, paid time off, and organizational growth potential. Equifax is an Equal Opportunity Employer.

Posted 4 weeks ago

Apply

7.0 - 12.0 years

19 - 25 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Project description We're seeking a solid and creative AWS Cloud DevOps eager to solve scale problems and work on cutting-edge and open-source technologies. In this project, you will have the opportunity to write code that will impact thousands of users You'll implement your critical thinking and technical skills to develop cutting-edge software, and you'll have the opportunity to interact with teams across disciplines. In Luxoft, our culture strives to solve challenging problems focusing on product engineering based on hypothesis testing to empower people to come up with ideas. We do it with a truly flexible environment, high-impact projects in Agile environments, a culture focused on results, training, and strong support to grow your career. In this project, you will be a member of the Information Technology Team, within the Information Technology Division. Responsibilities +7 years of Experience as a AWS DevOps Engineer with technical expertise in Build and Release Management. ECS Fargate CloudFormation Elastic Cache Redis Open Search Solace/Active MQ Github and Github actions Route 53 DR setup ( Active / Active or Active / Passive ) Monitoring Tools Kibana/ Datadog/ Dynatrace r any similar tools Skills Must have Strong in communication. AWS hands-on experience on AWS 5+ years as SrDevOps and over all IT experience 10+ years CI/CD 7+ years Lead Experience 2 -3 minimum Experience doing production support and able to guide the team. Able to drive innovation and leadership. Nice to have EKS Document DB Dynamo Neptune Harness Quantum Metrics

Posted 4 weeks ago

Apply

10.0 - 15.0 years

13 - 18 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Project description We're seeking a solid and creative AWS Cloud DevOps eager to solve scale problems and work on cutting-edge and open-source technologies. In this project, you will have the opportunity to write code that will impact thousands of users You'll implement your critical thinking and technical skills to develop cutting-edge software, and you'll have the opportunity to interact with teams across disciplines. In Luxoft, our culture strives to solve challenging problems focusing on product engineering based on hypothesis testing to empower people to come up with ideas. We do it with a truly flexible environment, high-impact projects in Agile environments, a culture focused on results, training, and strong support to grow your career. In this project, you will be a member of the Information Technology Team, within the Information Technology Division. Responsibilities This position supports and transforms existing and new mission-critical and highly visible operational website(s) and applications spanning multiple technology stacks through all phases of SDLC, while working collaboratively across IT, business, and third-party suppliers from around the globe in a 24x7, fast-paced, and Agile based environment. Assist in design and development of the account application Influencing peers, juniors, and seniors both within the organization and across the account. Coordinate and collaborate with the Product and Engineering teams to understand and solve problems, come up with creative solutions, and help with tracking and delivering within the release plan Collaborate with Engineering and QA to resolve bugs Skills Must have 10+ years experience in IT 5+ years as SrDevOps , with AWS hands-on experience 7+ years CI/CD experience 2+ Years experience in Technical leadership role Experience infrastructure provisioning using Terraforms, Cloudformation Experience doing production support and able to guide the team. Hands on experience in AWS provisioning and good knowledge of AWS Services like ECS Fargate (Mandatory), OpenSearch. Knowledge of CloudFormation scripts for automation purpose. Experience in creating of Elastic Cache Redis cluster and monitoring the metrics. Experience in working on GitHub and GitHub actions. Experience in DR setup ( Active / Active or Active / Passive ) Experience in Monitoring the Kibana / DataDog / Dynatrace and checking the latency issues if any. Experience in Route 53 Akamai Understanding Nice to have Experience in creating of DynamoDB tables, DocumentDB, and AuroraDB. Harness

Posted 4 weeks ago

Apply

7.0 - 12.0 years

17 - 32 Lacs

Hyderabad

Hybrid

The GCP CloudOps Engineer is accountable for a continuous, repeatable, secure, and automated deployment, integration, and test solutions utilizing Infrastructure as Code (IaC) and DevSecOps techniques 8+ years of hands-on experience in infrastructure design, implementation, and delivery 3+ years of hands-on experience with monitoring tools (Datadog, New Relic, or Splunk) 4+ years of hands-on experience with Container orchestration services, including Docker or Kubernetes, GKE. Experience with working across time zones and with different cultures. 5+ years of hands-on experience in Cloud technologies GCP is preferred. Maintain an outstanding level of documentation, including principles, standards, practices, and project plans. Having experience building a data warehouse using Databricks is a huge plus. Hands-on experience with IaC patterns and practices and related automation tools such as Terraform, Jenkins, Spinnaker, CircleCI, etc., built automation and tools using Python, Go, Java, or Ruby. Deep knowledge of CICD processes, tools, and platforms like GitHub workflows and Azure DevOps. Proactive collaborator and can work in cross-team initiatives with excellent written and verbal communication skills. Experience with automating long-term solutions to problems rather than applying a quick fix. Extensive knowledge of improving platform observability and implementing optimizations to monitoring and alerting tools. Experience measuring and modeling cost and performance metrics of cloud services and establishing a vision backed by data. Develop tools and CI/CD framework to make it easier for teams to build, configure, and deploy applications Contribute to Cloud strategy discussions and decisions on overall Cloud design and best approach for implementing Cloud solutions Follow and Develop standards and procedures for all aspects of a Digital Platform in the Cloud Identify system enhancements and automation opportunities for installing/maintaining digital platforms Adhere to best practices on Incident, Problem, and Change management Implementing automated procedures to handle issues and alerts proactively Experience with debugging applications and a deep understanding of deployment architectures. Pluses: Data bricks and MongoDB Experience with the Multicloud environment (GCP, AWS, Azure), GCP is the preferred cloud provider. Experience with GitHub and GitHub Actions

Posted 4 weeks ago

Apply

8.0 - 12.0 years

20 - 25 Lacs

Bengaluru

Work from Office

Project description We're seeking a solid and creative .NET Developer eager to solve scale problems and work on cutting-edge and open-source technologies. In this project, you will have the opportunity to write code that will impact thousands of users You'll implement your critical thinking and technical skills to develop cutting-edge software, and you'll have the opportunity to interact with teams across disciplines. In Luxoft, our culture strives to solve challenging problems focusing on product engineering based on hypothesis testing to empower people to come up with ideas. We do it with a truly flexible environment, high-impact projects in Agile environments, a culture focused on results, training, and strong support to grow your career. In this project, you will be a member of the Information Technology Team, within the Information Technology Division. This position supports and transforms existing and new mission-critical and highly-visible operational website(s) and applications - spanning multiple technology stacks - through all phases of SDLC, while working collaboratively across IT, business, and third-party suppliers from around the globe in a 24x7, fast-paced, and Agile based environment. Responsibilities Experience in .NET (Backend) Skills Must have 8-12 Years experience in .Net Technologies Hands-on service design, schema design and application integration design Hands-on software development using C#, .Net Core Use of multiple Cloud native database platforms including DynamoDB, SQL, Elastic cache, and others Conduct Code reviews and peer reviews Unit test and Unit test automation, defect resolution and software optimization Code deployment using CI/CD processes Understand business requirements and technical limitations Ability to learn new technologies and influence the team and leadership to constantly implement modern solutions Experience in using Elasticsearch, Logstash, Kibana (ELK) stack for Logging and Analytics Experience in container orchestration using Kubernetes Knowledge and Experience working with public cloud AWS services Knowledge of Cloud Architecture and Design Patterns Ability to prepare documentation for Microservices Monitoring tools such as Datadog, Logstash Excellent Communication skills Nice to have Airline industry knowledge is preferred but not required

Posted 4 weeks ago

Apply

0 years

10 - 40 Lacs

Gurugram, Haryana, India

On-site

DevOps Engineer AiSensy Gurugram, Haryana, India (On-site) About AiSensy AiSensy is a WhatsApp based Marketing & Engagement platform helping businesses like Adani, Delhi Transport Corporation, Yakult, Godrej, Aditya Birla Hindalco., Wipro, Asian Paints, India Today Group Skullcandy, Vivo, Physicswallah, Cosco grow their revenues via WhatsApp. Enabling 100,000+ Businesses with WhatsApp Engagement & Marketing 400Crores + WhatsApp Messages done between Businesses and Users via AiSensy per year Working with top brands like Delhi Transport Corporation, Vivo, Physicswallah & more High Impact as Businesses drive 25-80% Revenues using AiSensy Platform Mission-Driven and Growth Stage Startup backed by Marsshot.vc, Bluelotus.vc & 50+ Angel Investors Now, we’re looking for a DevOps Engineer to help scale our infrastructure and optimize performance for millions of users. 🚀 What You’ll Do (Key Responsibilities) 🔹 CI/CD & Automation: Implement, manage, and optimize CI/CD pipelines using AWS CodePipeline, GitHub Actions, or Jenkins. Automate deployment processes to improve efficiency and reduce downtime. 🔹 Infrastructure Management Use Terraform, Ansible, Chef, Puppet, or Pulumi to manage infrastructure as code. Deploy and maintain Dockerized applications on Kubernetes clusters for scalability. 🔹 Cloud & Security Work extensively with AWS (Preferred) or other cloud platforms to build and maintain cloud infrastructure. Optimize cloud costs and ensure security best practices are in place. 🔹 Monitoring & Troubleshooting Set up and manage monitoring tools like CloudWatch, Prometheus, Datadog, New Relic, or Grafana to track system performance and uptime. Proactively identify and resolve infrastructure-related issues. 🔹 Scripting & Automation Use Python or Bash scripting to automate repetitive DevOps tasks. Build internal tools for system health monitoring, logging, and debugging. What We’re Looking For (Must-Have Skills) ✅ Version Control: Proficiency in Git (GitLab / GitHub / Bitbucket) ✅ CI/CD Tools: Hands-on experience with AWS CodePipeline, GitHub Actions, or Jenkins ✅ Infrastructure as Code: Strong knowledge of Terraform, Ansible, Chef, or Pulumi ✅ Containerization & Orchestration: Experience with Docker & Kubernetes ✅ Cloud Expertise: Hands-on experience with AWS (Preferred) or other cloud providers ✅ Monitoring & Alerting: Familiarity with CloudWatch, Prometheus, Datadog, or Grafana ✅ Scripting Knowledge: Python or Bash for automation Bonus Skills (Good to Have, Not Mandatory) ➕ AWS Certifications: Solutions Architect, DevOps Engineer, Security, Networking ➕ Experience with Microsoft/Linux/F5 Technologies ➕ Hands-on knowledge of Database servers Skills:- Amazon Web Services (AWS), GitHub, Jenkins, Terraform, Ansible, Kubernetes, prometheus, AWS Bedrock, Chef, Puppet, Docker, Google Cloud Platform (GCP) and Python

Posted 4 weeks ago

Apply

3.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Job Summary - IMMEDIATE JOINERS ONLY (ON-SITE IN CHENNAI) We are seeking ITSM manager to lead and evolve our change management strategy, ensuring software and infrastructure changes are delivered safely, reliably, and with minimal risk to business operations. You will collaborate with engineering, DevOps, SRE, security, and compliance teams to drive process maturity, automation, and cultural adoption of safe change practices. Key Responsibilities Change Governance Own and continuously improve the change management framework across the organization. Lead or participate daily/weekly Change Review Board (CRB) meetings and ensure timely approvals. Risk & Reliability Oversight Assess the risk of planned changes and verify readiness of rollout, rollback, and validation plans. Track key reliability metrics such as change failure rate, MTTR, and deployment lead time. Incident Correlation & Analysis Investigate change-related incidents and contribute to post-incident reviews. Identify patterns and systemic issues in failed or high-risk changes. Automation & Tooling Partner with DevOps/SRE teams to integrate change validation, canary rollouts, and automated approvals into CI/CD pipelines. Champion use of observability tools to monitor live changes and detect anomalies early. Stakeholder Communication Provide clear and actionable reporting to leadership on change success, risk trends, and improvement areas. Coordinate with product, engineering, and operations teams for major releases or changes during high-risk periods. Compliance & Audit Support Ensure adherence to regulatory or internal audit requirements (e.g., SOX, ISO, PCI-DSS). Maintain documentation and audit trails for all changes. Qualifications Required: 3+ years of experience in ITSM Strong knowledge of change management principles Experience with CI/CD platforms (e.g., Jenkins, Spinnaker, ArgoCD) Proficiency with monitoring and observability tools (e.g., Datadog, Splunk, Prometheus) Excellent stakeholder management and communication skills Preferred: Background in high-availability or regulated industries (e.g., fintech) Experience with automated risk scoring, canary analysis, or feature flag systems SRE training is a plus Key Metrics You’ll Drive Change Failure Rate (CFR) Successful Change Audits (SCAs) Mean Time to Recovery (MTTR) Lead Time for Changes % of Automated Change Validations Emergency Change Volume Pay: $15-17 USD

Posted 4 weeks ago

Apply

0 years

0 Lacs

Pune, Maharashtra, India

On-site

At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all. EY-Consulting – AWS Staff-Senior The opportunity We are looking for a skilled AWS Data Engineer to join our growing data team. This role involves building and managing scalable data pipelines that ingest, process, and store data from various sources using modern AWS technologies. You will work with both batch and streaming data and contribute to a robust, scalable data architecture to support analytics, BI, and data science use cases. As a problem-solver with the keen ability to diagnose a client’s unique needs, one should be able to see the gap between where clients currently are and where they need to be. The candidate should be capable of creating a blueprint to help clients achieve their end goal. Key Responsibilities: Design and implement data ingestion pipelines from various sources including on-premise Oracle databases, batch files, and Confluent Kafka. Develop Python producers and AWS Glue jobs for batch data processing. Build and manage Spark streaming applications on Amazon EMR. Architect and maintain Medallion Architecture-based data lakes on Amazon S3. Develop and maintain data sinks in Redshift and Oracle. Automate and orchestrate workflows using Apache Airflow. Monitor, debug, and optimize data pipelines for performance and reliability. Collaborate with cross-functional teams including data analysts, scientists, and DevOps. Required Skills and Experience: Good programming skills in Python and Spark (Pyspark). Hands on Experience with Amazon S3, Glue, EMR. Good SQL knowledge on Amazon Redshift and Oracle Proven experience in handling streaming data with Kafka and building real-time pipelines. Good understanding of data modeling, ETL frameworks, and performance tuning. Experience with workflow orchestration tools like Airflow. Nice-to-Have Skills: Infrastructure as Code using Terraform. Experience with AWS services like SNS, SQS, DynamoDB, DMS, Athena, and Lake Formation. Familiarity with DataSync for file movement and medallion architecture for data lakes. Monitoring and alerting using CloudWatch, Datadog, or Splunk. Qualifications : BTech / MTech / MCA / MBA EY | Building a better working world EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets. Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate. Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today.

Posted 4 weeks ago

Apply

7.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

Job Title: Senior DevOps Engineer (GCP | DevSecOps | Monitoring) Employment Type: Full-time Experience: 7+ Years Job Summary: We are seeking a highly experienced and results-driven Senior DevOps Engineer to join our dynamic team. The ideal candidate will bring 7+ years of hands-on experience in cloud infrastructure, monitoring, security, and DevSecOps practices—especially within the Google Cloud Platform (GCP) ecosystem. This role demands strong expertise in designing, implementing, and leading complex DevSecOps and monitoring initiatives across cloud-native environments. Key Responsibilities: Lead the end-to-end design, implementation, and delivery of scalable and secure DevSecOps solutions. Implement and maintain monitoring and observability tools such as New Relic, Datadog, Grafana, and Prometheus. Manage and optimize GCP infrastructure for performance, security, and cost efficiency. Define and enforce DevSecOps best practices, integrating security at every stage of the development lifecycle. Work closely with Data Engineering teams to support data pipelines and infrastructure automation. Manage CI/CD pipelines using GitLab and ensure smooth deployment workflows. Maintain containerized environments using Docker and Kubernetes. Collaborate with cross-functional teams to ensure system reliability, scalability, and security. Required Skills & Experience: 7+ years of experience in a DevOps/DevSecOps role with a strong background in GCP. Proven experience with monitoring/observability tools: New Relic, Datadog, Grafana, Prometheus. Deep understanding of DevSecOps principles, cloud security, and compliance practices. Strong hands-on experience with Docker and Kubernetes. Proficiency with GitLab for CI/CD automation. Familiarity with infrastructure-as-code and configuration management tools. Solid scripting and automation skills (e.g., Bash, Python, Terraform, etc.). Experience collaborating with Data Engineers and supporting data-driven applications. Preferred Qualifications: GCP certifications (e.g., Professional Cloud DevOps Engineer, Cloud Architect). Experience with other cloud platforms (e.g., AWS, Azure) is a plus. Exposure to data pipeline tools and big data platforms is advantageous. About Encora Encora is the preferred digital engineering and modernization partner of some of the world's leading enterprises and digital native companies. With over 9,000 experts in 47+ offices and innovation labs worldwide, Encora's technology practices include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering. At Encora, we hire professionals based solely on their skills and qualifications, and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

Posted 4 weeks ago

Apply

5.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Job Description As a Senior Platform Product Manager , you will be responsible for defining and managing the core platform services and APIs that power our technology ecosystem. You will work closely with Platform Engineering and Software Development teams to design and unify foundational business APIs and services that enable consistency, scalability, and efficiency across the company. Your role will focus on standardizing platform capabilities , improving system interoperability, and ensuring that all teams can leverage common infrastructure services . The ideal candidate has a strong technical background , understands API design principles , and is experienced in defining enterprise-wide platform strategies that reduce duplication and streamline engineering efforts. The role requires a strong understanding of the Fintech, Healthcare, and related business domains , including industry regulations, operational workflows, and technology trends. Responsibilities Define and execute the platform API strategy, ensuring a unified and scalable approach across teams. Partner with Product Teams, DevOps, Platform Engineering, and Software Engineering teams to design, develop, and maintain foundational APIs that support critical business functions. Implementing APIs that follow outlined governance and best practices, including security, versioning, monitoring, and lifecycle management. Collaborate with engineering teams to identify redundant or fragmented business capabilities and drive standardization. Own the product roadmap for core platform services, ensuring alignment with engineering and business objectives. Develop clear API documentation, usage guidelines, and adoption strategies to ensure cross-team consistency and efficiency. Work with stakeholders to prioritize platform improvements, balancing short-term needs with long-term scalability. Critical Skills & Experience 5+ years of experience in Product Management, with a focus on platform, infrastructure, or API-driven products. Strong understanding of API design, microservices architecture, and API lifecycle management. Ability to assess organizational priorities and capabilities while maintaining a strategic mindset and a long-term product vision. Technical background or hands-on experience with software development, DevOps, or infrastructure is highly preferred. Strong problem-solving and analytical skills, with the ability to translate technical challenges into platform solutions. Excellent stakeholder management skills, working across engineering, security, compliance, and business teams. Familiarity with observability and monitoring tools (e.g., Datadog, Prometheus, New Relic) is a plus. Experience with API management platforms (e.g., Kong, Apigee, AWS API Gateway) is a plus. Proficiency in Data Models, SQL, and database fundamentals is a plus.

Posted 4 weeks ago

Apply

0 years

0 Lacs

Pune, Maharashtra, India

On-site

Job Description Some careers shine brighter than others. If you’re looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC offers opportunities, support and rewards that will take you further. HSBC is one of the largest banking and financial services organisations in the world, with operations in 64 countries and territories. We aim to be where the growth is, enabling businesses to thrive and economies to prosper, and, ultimately, helping people to fulfil their hopes and realise their ambitions. We are currently seeking an experienced professional to join our team in the role of Associate Director, Software engineering In this role, you will: A Lead Automation engineer with deep hands-on experience of Software Automation testing and Performance testing tools, practices, processes Need deep understanding of Desktop, Web, Data warehouse application, API Development, Design Patterns, SDLC, IAC tools, testing and site reliability engineering and related ways to design and develop automation framework Define and implement best practices for software automation testing, performance testing, framework, and patterns, including testing methodologies. Be a generalist with the breadth and depth of experience in CICD best practices and has core experience in testing (ie. TDD/BDD/Automated testing/Contract testing/API testing/Desktop/web apps, DW test automation) Able to see a problem or an opportunity with the ability to engineer a solution, be respected for what they deliver not just what they say, should think about the business impact of their work and has a holistic view to problem- solving Proven industry experience of running an Engineering team with focus on optimization of processes, introduction of new technologies, solving challenges, building strategy, business planning, governance and Stakeholder Management. Apply thinking to many problems across multiple technical domains and suggest way to solve the problems Contributes to architectural discussions by asking the right questions to ensure a solution matches the business needs Identify opportunities for system optimization, performance tuning, and scalability enhancements. Implement solutions to improve system efficiency and reliability. Excellent verbal and written communication skills to articulate technical concepts to both technical and non-technical stakeholders. Build Performance assurance procedures with the latest feasible tools and techniques, establish Performance test automation process to improve testing productivity. Responsible for end-to-end Software testing, performance testing and engineering life cycle - technical scoping, performance scripting, testing, and tuning. Analyse the test assessment results, provide recommendations to improve performance or save infrastructure costs. Represent at Scrum meetings and all other key project meetings and provide a single point of accountability and escalation for Performance testing within the scrum teams. Advise on needed infrastructure and Performance Engineering and testing guidelines & be responsible for performance risk assessment of various application features. Work with cross-functional team, opportunity to work with software product, development, and support teams, capable of handling tasks to accelerate the testing delivery and to improve the quality for Applications at HSBC. Able to provide support in product/application design from performance point of view. Able to communicate plans, status, and results as per target audience. Willing to adapt, learn innovative technologies/trades and be flexible to work on projects as demanded by business Define and implement best practices for software automation testing, including testing standards, test reviews, coverage, and testing methodologies, tractability between requirements and test cases. Prepare, develop and maintain test automation framework that can be used for software testing, performance testing., write automation test scripts, conduct reviews. Develop and execute regression, smoke, integration tests timely. Requirements To be successful in this role, you must meet the following requirements: Experience in software testing approaches on automation testing using Tosca, Selenium, cucumber BDD framework Experienced on writing test plans, test strategy, test data management includes test artifacts management for both automation and manual testing. Experience on setting up CI/CD pipeline and work experience on GitHub, Jenkins along with integration to cucumber and Jira. Experience in agile methodology and proven experience in working on agile projects. Experience in analysis of bug tracking, prioritizing and bug reporting with bug tracking tools. Experience in SQL, Unix, Control-M, ETL, Data Testing, API testing, API Automation using Rest Assured. Familiar with following performance testing tools. Micro Focus LoadRunner Enterprise (VuGen, Analysis, LRE OneLG), Protocols: HTTP/HTMP, CITRIX,JMETER, Postman, Insomnia Familiar with following observability tools -AppDynamics, New Relic, Splunk, Geneos., Datadog, Grafana Knowledge of following will be an added advantage -GitHub, Jenkins, Kubernetes, Jira & Confluence. Programming and scripting language skills in Java, Shell, Scala, Groovy, Python,WebLogic server administration. Familiar with BMC Control M tool. CICD tools – Ansible, AWS RO, G3 UNIX/Linux/Web monitors & performance analysis tools to diagnose and resolve performance issues. Experience of working in an Agile environment, "DevOps" team or a similar multi skilled team in a technically demanding function. Experience of working on performance testing and tuning of micro-services/APIs, Desktop applications, Webapps, Cloud Services, ETL Apps, database queries. Experience of writing/modifying performance testing scripts, Implementation & usage of automated tools for result analysis Experience of working on performance testing and tuning of Data warehouse applications doing batch processing on various stages of ETL and information delivery components Good to have skills: Knowledge on latest technology, tools like Python Scripting, Tricentis Toaca, Dataflow, Hive, DevOpS, REST API, Hadoop, Kafka framework, GCP, AWS, will be an added advantage You’ll achieve more when you join HSBC. www.hsbc.com/careers HSBC is committed to building a culture where all employees are valued, respected and opinions count. We take pride in providing a workplace that fosters continuous professional development, flexible working and opportunities to grow within an inclusive and diverse environment. Personal data held by the Bank relating to employment applications will be used in accordance with our Privacy Statement, which is available on our website. Issued by – HSDI

Posted 4 weeks ago

Apply

4.0 years

0 Lacs

Hyderābād

On-site

Requisition Number: 101578 Cloud Engineer III - Azure Infra/Migration/IaC/DevOps Shift: 2 PM- 11 PM IST Location: Delhi NCR, Hyderabad, Bangalore, Pune, Mumbai, Chennai, this is a hybrid work opportunity. Insight at a Glance 14,000+ engaged teammates globally with operations in 25 countries across the globe. Received 35+ industry and partner awards in the past year $9.2 billion in revenue #20 on Fortune’s World's Best Workplaces™ list #14 on Forbes World's Best Employers in IT – 2023 #23 on Forbes Best Employers for Women in IT- 2023 Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. As a Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge, we guide organizations through complex digital decisions. About the role As a Cloud Engineer III, you will be part of the consulting practice, utilizing cutting-edge automation tools and provisioning in public cloud providers—preferably Azure, AWS, or GCP. You will be responsible for designing and deploying well-architected cloud solutions. The ideal candidate will have experience in customer-facing roles and a proven track record of delivering cloud solutions with Infrastructure as Code (IaC) automation on various projects. Along the way, you will: Design scalable, secure, and resilient cloud infrastructure (primarily on Azure, AWS, or GCP). Create architecture diagrams, deployment strategies, and cloud roadmaps. Deploy and configure cloud resources such as VMs, storage, networking, containers, and databases. Automate infrastructure provisioning using tools like Terraform, ARM templates, or Bicep. Set up CI/CD pipelines using tools like Azure DevOps, GitHub Actions, or Jenkins. Implement Infrastructure as Code (IaC) and configuration management. Support microservices-based architecture designs. Set up application and infrastructure monitoring with tools like Prometheus, Grafana, Datadog, New Relic, or Azure Monitor. Perform cost optimization and performance tuning. Implement cloud security best practices, including identity and access management (IAM), encryption, firewall rules, and network security groups. Collaborate with Insight and client teams, following Agile/Scrum methodologies and ceremonies. Communicate effectively and professionally with teammates, client personnel, and stakeholders. What we’re looking for Bachelor’s degree in information technology, Computer Science, or related field preferred, or equivalent practical experience. 4-6 years of relevant experience in a similar or related role is required. Any relevant cloud certification is a plus. Hands-on experience with one or more cloud providers (AWS, Azure, GCP) is a must. Azure being the primary cloud. Familiarity with writing infrastructure as code (e.g., Terraform, Azure Bicep, ARM templates, CloudFormation) is a must. Working experience with at least one of the CI/CD tools and version control systems (e.g. Azure DevOps, GitHub Actions, Jenkins, Git, GitHub, Azure Repos) is required. Familiarity with Windows and Linux/Unix-based systems is a must. Proficiency in Azure infrastructure cloud services like Azure VM, VNET, Storage, Monitoring, Azure Functions, Load Balancers, Azure AD, Azure DNS, Traffic managers and Application Gateways for network optimization. Knowledge of Azure Kubernetes Service (AKS), Docker containers, and application monitoring services such as Prometheus, Grafana, Datadog, and New Relic is highly desirable. Experience in application deployment and management within cloud environments. Hands-on knowledge of Docker and container lifecycle management. Experience in deploying and managing distributed applications in production-grade environments What you can expect- We’re legendary for taking care of you, your family, and helping you engage with your local community. We want you to enjoy a full, meaningful life and own your career at Insight. Some of our benefits include: Freedom to work from another location—even an international destination—for up to 30 consecutive calendar days per year. Medical Insurance Health Benefits Professional Development: Learning Platform and Certificate Reimbursement Shift Allowance But what really sets us apart are our core values of Hunger, Heart, and Harmony, which guide everything we do, from building relationships with teammates, partners, and clients to making a positive impact in our communities. Join us today, your ambitious journey starts here. Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. At Insight, we celebrate diversity of skills and experience so even if you don’t feel like your skills are a perfect match - we still want to hear from you! Today's talent leads tomorrow's success. Learn more about Insight: https://www.linkedin.com/company/insight/ Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. Insight India Location:Level 16, Tower B, Building No 14, Dlf Cyber City In It/Ites Sez, Sector 24 &25 A Gurugram Gurgaon Hr 122002 India

Posted 4 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies