Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As an Observability Developer at GlobalLogic, you will play a crucial role in alert configuration, workflow automation, and AI-driven solutions within the observability stack. Your responsibilities will involve designing and implementing alerting rules, configuring alert routing and escalation policies, building workflow integrations, developing AI-based solutions, collaborating with teams, and automating alert lifecycle management. Here's a breakdown of your role: **Role Overview:** You will be a proactive and technically versatile Observability Developer responsible for ensuring actionable alerts, well-integrated workflows, and AI-enhanced issue resolution within the observability stack. **Key Responsibilities:** - Design and implement alerting rules for metrics, logs, and traces using tools like Grafana, Prometheus, or similar. - Configure alert routing and escalation policies integrated with collaboration and incident management platforms (e.g., Slack, PagerDuty, ServiceNow, Opsgenie). - Build and maintain workflow integrations between observability platforms and ticketing systems, CMDBs, and automation tools. - Develop or integrate AI-based solutions for mapping telemetry signals, porting configurations, and reducing alert fatigue. - Collaborate with DevOps, SRE, and development teams to contextualize alerts and ensure their meaning. - Automate alert lifecycle management through CI/CD and GitOps pipelines. - Maintain observability integration documentation and provide support to teams using alerting and workflows. **Qualification Required:** - 3+ years of experience in DevOps, SRE, Observability, or Integration Development roles. - Hands-on experience with alert configuration in tools like Grafana, Prometheus, Alertmanager, or similar. - Experience integrating alerts with operational tools such as Slack, PagerDuty, Opsgenie, or ServiceNow. - Solid understanding of observability concepts (metrics, logs, traces). - Scripting or development experience in Python, Bash, or similar. - Experience with REST APIs and webhooks for creating workflow integrations. - Familiarity with CI/CD and GitOps tooling (e.g., ArgoCD, GitHub Workflows). *Note: Preferred qualifications were not explicitly mentioned in the provided job description.* GlobalLogic prioritizes a culture of caring, continuous learning and development, interesting and meaningful work, balance and flexibility, and integrity. As a part of the GlobalLogic team, you will have the opportunity to work on impactful projects and collaborate with forward-thinking companies to shape the digital landscape.,
Posted 3 days ago
7.0 - 11.0 years
0 Lacs
ahmedabad, gujarat
On-site
As a Lead DevOps Engineer at our company, you will have the opportunity to shape our DevOps culture, bridge the gap between Development and Platform Engineering, and implement innovative solutions in a multi-cloud environment. You will collaborate with industry-leading technologies to solve complex problems. **Key Responsibilities:** - Define and implement DevOps best practices, including CI/CD pipelines, Infrastructure as Code (IaC), and configuration management. - Automate tasks and optimize workflows to continuously improve DevOps processes. - Partner with Development and Product teams to understand their needs and identify opportunities for DevOps adoption. - Architect and implement robust infrastructure solutions across multi-cloud environments (AWS, GCP, Azure). - Design and implement monitoring and alerting systems for infrastructure health and application performance. - Stay updated on the latest DevOps trends and best practices. **Qualifications:** - 8+ years of relevant experience as a DevOps Engineer or in a related role. - Self-motivated with outstanding communication skills and a great team-player attitude. - Strong problem-solving, critical thinking, and analytical skills. - Ability to self-manage time effectively and efficiently. - Basic programming experience with high-level languages like Python or Go. - Strong understanding of CI/CD principles and automation tools. - Expertise in Infrastructure as Code (IaC) methodologies such as Terraform and Ansible. - In-depth knowledge of cloud platforms like AWS, Azure, and GCP, including associated services. - Experience with monitoring and alerting tools like Datadog, Prometheus, and PagerDuty. - Passion for developer productivity and tooling. - Excellent communication, collaboration, and problem-solving skills. - Positive and proactive approach to work. - Programming background with DevOps strength is highly preferred. **Nice to Have:** - Experience in AWS/GCP partnership collaboration. - Professional level certification in any one of the cloud platforms.,
Posted 3 days ago
8.0 - 13.0 years
6 - 10 Lacs
mumbai, hyderabad, bengaluru
Work from Office
Your Role Set up and fine-tune thresholds, alerts, and dashboards across infrastructure layers usingGrafanaand other monitoring tools. Integrate monitoring tools with ITOM platforms such asServiceNow ITOM. Optimize tool configurations to reduce alert noise and improve signal quality. Identify and implement automation opportunities across servers, storage, backup, databases, networks, andAzureenvironments. Collaborate with infrastructure and cloud teams to ensure full-stack visibility and end-to-end observability. Your Profile 8+ yearsof experience in infrastructure monitoring, tool administration, or cloud operations. Hands-on experience withGrafana(primary skill), and secondary tools such asZabbix,PagerDuty,ELK, andServiceNow. Strong understanding of IT infrastructure componentsservers, storage, backup, databases, and networks. Familiarity with ITOM integrations, automation, and observability best practices. Scripting knowledge (e.g.,PowerShell,Python,Shell) for automation tasks. Experience withAzure Monitorand public cloud environments (IaaS, PaaS, SaaS). What You Will Love Working at Capgemini Be part of a next-gen observability and automation team driving innovation. Work on mission-critical transformation programs across hybrid infrastructure and cloud platforms. Collaborate with global clients and diverse teams. Grow your skills in tool integration, automation, and cloud observability.
Posted 5 days ago
3.0 - 6.0 years
4 - 8 Lacs
hyderabad
Work from Office
Job Purpose The Systems Operations Analyst is part of a support organization that is responsible for the daily operations of multiple industry leading trading exchanges. This is a customer-facing position, providing immediate assistance to ICE/NYSE exchanges, back office, support personnel and IT staff, to achieve the highest customer satisfaction and minimize the impact of IT related problems. This is a critical support role within the overall architecture of ICE/NYSE exchanges, divisions, and infrastructure. This is a 24x7 environment and the position requires shift rotation and/or weekend work. Responsibilities Monitoring and Incident Management Monitor systems and applications within the production environment Diagnose and fix incidents raised through monitoring tools, conference bridges and chats Work with and escalate to internal and external teams to implement incident fixes, work-around and data recovery Open and update production incident tickets according to company standards Problem Management Investigate and update incident tickets with root cause and incident description, ensuring appropriate corrective action follow-up tickets are assigned Manage incident tickets to closure, ensuring incident details are complete and accurate, and all corrective actions have been completed System and Application Production Readiness Work with internal and external teams to expand and maintain operational runbooks and other documentation Check application and infrastructure availability and tasks at scheduled times Configure monitoring tools and alarms Deployment Management Production deployments Approve and execute production deployment tasks Participate in disaster recovery, business continuity and workplace recovery events. Participate in continuous improvement programs, such as trend analysis of recurring issues. Provide and report on performance metrics of the environment. Follow the handover process documented to bring the next shift up to speed and highlight priority items or issues. Knowledge and Experience Experience with PagerDuty Experience with ServiceNow & Jira Experience with Jenkins & Git Experience in scripting Cloud (AWS) & VMware knowledge is a must Bachelors degree (IT-based) or experience within IT systems support and/or operational support of applications databases within Windows & Linux/Unix OS environment. Strong communication skills High level of general IT skills with email and MS Office Applications Able to think logically and critically. Analytical problem-solving skills with an ability to identify root cause(s) Able to work as a team player across the organization. Able to build and maintain effective relationships with individuals and the team. Ability to be organized and decisive while under pressure. Excellent time management skills Able to manage priorities and multi-task. Self-confident and assertive
Posted 6 days ago
6.0 - 9.0 years
5 - 10 Lacs
bengaluru, karnataka, india
On-site
Job description Design, develop, and maintain scalable and efficient automated test frameworks Implement automated test scripts for functional, regression, and performance testing Collaborate with software developers to integrate test automation into the continuous integration/continuous deployment (CI/CD) pipeline Develop and execute comprehensive test plans based on project requirements Collaborate with cross-functional teams to identify test scenarios and ensure adequate test coverage Contribute to the creation and maintenance of test documentation Identify, analyze, and report software defects with a high level of detail Work closely with development teams to prioritize and facilitate the resolution of identified issues Bachelors degree in computer science or engineering with 8+ years experience8+ years experience with key QA automation tools: Java, Cucumber, Rest Template, TestNGIdeally experience with the following technologies: Gherkin, Springboot, Maven, Jenkins, Gitlab,, VaultStrong understanding of basic programming concepts and data structuresExperience with cloud technologies (ideally AWS, Kafka Experience with monitoring and logging tools such as Splunk, DataDog Experience with day-to-day tools such as Jira, Gitlab, PagerDuty, ServiceNow is nice to have Experience with integrating performance testing / monitoring into GitLab CI/CD Pipelines is nice to have Acknowledge the presence of choice in every moment and take personal responsibility for your life Possess an entrepreneurial spirit and continuously innovate to achieve great results Communicate with honesty and kindness and create the space for others to do the same Lead with courage, knowing the possibility of greatness is bigger than the fear of failure Foster connection by putting people first and building trusting relationships Integrate fun and joy as a way of being and working, aka doesn t take yourself too seriously 3 must have Java 4/5Selenium 4/5BDD / Cucumber 3/5Healthcare domain 3/5
Posted 6 days ago
4.0 - 8.0 years
0 Lacs
pune, maharashtra
On-site
ZS is a place where passion changes lives. As a management consulting and technology firm focused on transforming global healthcare and beyond, our most valuable asset is our people. Here you'll work side-by-side with a powerful collective of thinkers and experts shaping solutions from start to finish. At ZS, we believe that making an impact demands a different approach; and that's why here your ideas elevate actions, and here you'll have the freedom to define your own path and pursue cutting-edge work. We partner collaboratively with our clients to develop products that create value and deliver company results across critical areas of their business including portfolio strategy, customer insights, research and development, operational and technology transformation, marketing strategy and many more. If you dare to think differently, join us, and find a path where your passion can change lives. Our most valuable asset is our people. At ZS we honor the visible and invisible elements of our identities, personal experiences, and belief systems - the ones that comprise us as individuals, shape who we are and make us unique. We believe your personal interests, identities, and desire to learn are part of your success here. As a Senior Cloud Monitoring Administrator in ZS IT Operations Center (ITOC), you will serve as a key escalation point and owner of the Incident Management lifecycle for ZS's global IT infrastructure. This includes managing major incidents (P1/P2), ensuring timely resolution, driving stakeholder communication, and leading post-incident analysis to prevent recurrence. What You'll Do: - Own and lead high-impact incident management (P1/P2) processes end-to-end. - Facilitate incident bridges and war rooms with cross-functional teams (Cloud Compute, Network, Security, Cloud Ops). - Coordinate with global stakeholders, vendors, and leadership for real-time updates and escalations. - Maintain real-time communication on Team/ServiceNow and through structured email updates. - Conduct in-depth Post-Incident Reviews (PIR) and ensure follow-ups via Problem Management. - Track incident metrics (MTTR, SLA breaches, recurrence), analyze trends, and recommend improvements. - Partner with engineering and automation teams to enhance observability and proactive detection. - Standardize and enhance ITOC's incident response processes based on ITIL best practices. - Drive improvements in incident communication protocols, documentation, and playbooks. - Mentor junior engineers (L1/L2) in handling escalations and developing response skills. - Partner with Observability Team, Cloud Compute, Network, Security and Cloud Ops to enable integrated monitoring and alerting. - Collaborate with application and business teams to minimize business disruption and align resolution priorities. - Participate in Change Advisory Board (CAB) to mitigate incident risks from changes. What You'll Bring: - 4+ years of experience in IT Operations/Incident Management roles, with at least 2-3 years handling global environments. - Prior experience in a Consultant/L3 capacity in a matrixed or client-facing IT environment. - Strong expertise in handling hybrid infrastructure (AWS + On-prem) incidents. - Proven success in independently leading major incidents and stakeholder management. - Tools: ServiceNow, JIRA, SolarWinds, Splunk, AWS CloudWatch, PagerDuty/Uptrends, Teams. - Cloud: Working knowledge of Networking technologies, VMWare, AWS (EC2, RDS, Route 53, ELB, VPC, etc.) - Concepts: ITIL (Incident, Problem, Change), Monitoring & Alerting, Automation basics (preferred) - Certifications: ITIL v4 Foundation (required); AWS Cloud Practitioner or higher (preferred) - Clear and timely communication during critical scenarios. - Strong decision-making and accountability under pressure. - Ability to influence cross-functional teams without direct authority. - Structured thinking with an eye for continuous service improvement. - Willingness to work in 24x7 support environment (via on-call rotation). Perks & Benefits: ZS offers a comprehensive total rewards package including health and well-being, financial planning, annual leave, personal growth and professional development. Our robust skills development programs, multiple career progression options and internal mobility paths and collaborative culture empower you to thrive as an individual and global team member. Travel: Travel is a requirement at ZS for client-facing ZSers; business needs of your project and client are the priority. While some projects may be local, all client-facing ZSers should be prepared to travel as needed. Considering applying At ZS, we honor the visible and invisible elements of our identities, personal experiences, and belief systems - the ones that comprise us as individuals, shape who we are, and make us unique. We believe your personal interests, identities, and desire to learn are integral to your success here. We are committed to building a team that reflects a broad variety of backgrounds, perspectives, and experiences. ZS is an equal opportunity employer and is committed to providing equal employment and advancement opportunities without regard to any class protected by applicable law. To complete your application: Candidates must possess or be able to obtain work authorization for their intended country of employment. An online application, including a full set of transcripts (official or unofficial), is required to be considered. NO AGENCY CALLS, PLEASE. Find Out More At: www.zs.com,
Posted 6 days ago
4.0 - 6.0 years
0 Lacs
pune, maharashtra, india
Remote
ZS is a place where passion changes lives. As a management consulting and technology firm focused on transforming global healthcare and beyond, our most valuable asset is our people. Here you'll work side-by-side with a powerful collective of thinkers and experts shaping solutions from start to finish. At ZS, we believe that making an impact demands a different approach and that's why here your ideas elevate actions, and here you'll have the freedom to define your own path and pursue cutting-edge work. We partner collaboratively with our clients to develop products that create value and deliver company results across critical areas of their business including portfolio strategy, customer insights, research and development, operational and technology transformation, marketing strategy and many more. If you dare to think differently, join us, and find a path where your passion can change lives. Our most valuable asset is our people. At ZS we honor the visible and invisible elements of our identities, personal experiences and belief systems-the ones that comprise us as individuals, shape who we are and make us unique. We believe your personal interests, identities, and desire to learn are part of your success here. Learn more about our diversity, equity, and inclusion efforts and the networks ZS supports to assist our ZSers in cultivating community spaces, obtaining the resources they need to thrive, and sharing the messages they are passionate about. As a Senior Cloud Monitoring Administrator in ZS IT Operations Center (ITOC) , you will serve as a key escalation point and owner of the Incident Management lifecycle for ZS's global IT infrastructure. This includes managing major incidents (P1/P2) , ensuring timely resolution, driving stakeholder communication, and leading post-incident analysis to prevent recurrence. What You'll Do: Own and lead high-impact incident management (P1/P2) processes end-to-end. Facilitate incident bridges and war rooms with cross-functional teams (Cloud Compute, Network, Security, Cloud Ops). Coordinate with global stakeholders, vendors, and leadership for real-time updates and escalations. Maintain real-time communication on Team/ServiceNow and through structured email updates. Conduct in-depth Post-Incident Reviews (PIR) and ensure follow-ups via Problem Management. Track incident metrics (MTTR, SLA breaches, recurrence), analyze trends, and recommend improvements. Partner with engineering and automation teams to enhance observability and proactive detection. Standardize and enhance ITOC's incident response processes based on ITIL best practices. Drive improvements in incident communication protocols , documentation, and playbooks. Mentor junior engineers (L1/L2) in handling escalations and developing response skills. Partner with Observability Team,Cloud Compute, Network, Security and Cloud Ops to enable integrated monitoring and alerting. Collaborate with application and business teams to minimize business disruption and align resolution priorities. Participate in Change Advisory Board (CAB) to mitigate incident risks from changes. What You'll Bring: 4+ years of experience in IT Operations/Incident Management roles, with at least 2-3 years handling global environments. Prior experience in a Consultant/L3 capacity in a matrixed or client-facing IT environment. Strong expertise in handling hybrid infrastructure (AWS + On-prem) incidents. Proven success in independently leading major incidents and stakeholder management. Tools: ServiceNow, JIRA, SolarWinds, Splunk, AWS CloudWatch, PagerDuty/Uptrends, Teams. Cloud: Working knowledge of Networking technologies, VMWare, AWS (EC2, RDS, Route 53, ELB, VPC, etc.) Concepts: ITIL (Incident, Problem, Change), Monitoring & Alerting, Automation basics (preferred) Certifications: ITIL v4 Foundation (required) AWS Cloud Practitioner or higher (preferred) Clear and timely communication during critical scenarios. Strong decision-making and accountability under pressure. Ability to influence cross-functional teams without direct authority. Structured thinking with an eye for continuous service improvement. Willingness to work in 24x7 support environment (via on-call rotation). Perks & Benefits: ZS offers a comprehensive total rewards package including health and well-being, financial planning, annual leave, personal growth and professional development. Our robust skills development programs, multiple career progression options and internal mobility paths and collaborative culture empowers you to thrive as an individual and global team member. We are committed to giving our employees a flexible and connected way of working. A flexible and connected ZS allows us to combine work from home and on-site presence at clients/ZS offices for the majority of our week. The magic of ZS culture and innovation thrives in both planned and spontaneous face-to-face connections. Travel: Travel is a requirement at ZS for client facing ZSers business needs of your project and client are the priority. While some projects may be local, all client-facing ZSers should be prepared to travel as needed. Travel provides opportunities to strengthen client relationships, gain diverse experiences, and enhance professional growth by working in different environments and cultures. Considering applying At ZS, we honor the visible and invisible elements of our identities, personal experiences, and belief systems-the ones that comprise us as individuals, shape who we are, and make us unique. We believe your personal interests, identities, and desire to learn are integral to your success here. We are committed to building a team that reflects a broad variety of backgrounds, perspectives, and experiences. about our inclusion and belonging efforts and the networks ZS supports to assist our ZSers in cultivating community spaces and obtaining the resources they need to thrive. If you're eager to grow, contribute, and bring your unique self to our work, we encourage you to apply. ZS is an equal opportunity employer and is committed to providing equal employment and advancement opportunities without regard to any class protected by applicable law. To complete your application: Candidates must possess or be able to obtain work authorization for their intended country of employment.An on-line application, including a full set of transcripts (official or unofficial), is required to be considered. NO AGENCY CALLS, PLEASE. Find Out More At:
Posted 6 days ago
6.0 - 8.0 years
0 Lacs
ahmedabad, gujarat, india
On-site
Job Description: Technical Skill . Experience working on AWS, Kubernetes Orchestration and EKS. . Must have 6+ years of Production Env. Experience. . Exp. with infra-automation using Terraform| CloudFormation | Ansible. . Exp. working on Linux environment with at least one scripting language. . Exp. of CI/CD pipeline using Jenkins, Harness l ArgoCD . Exp. in Application Performance Monitoring tools such as Datadog, Instana, Grafana, Splunk, PagerDuty, Pingdom and Cloud Watch. . Good to know and at least hands on experience on DevOps and Cloud Operation Process and Agile Release Processes, and L2/L3 Ticketing process and experience in SRE role. . Should have experience with source control and management tools like Git. Essential Duties & Responsibilities: . Should be able to work in 16 X 5 shifts for support of infrastructure, Weekly rotational shifts Morning OR evening. . Design, implement and maintain highly available, scalable AWS infrastructure and services within a managed service environment and continual re-evaluation. . Manage the deployments in env. such as QA/Stage/Production and engaging stack-holders as required, and adhering to SLA (Service Level Agreement) . Build, Deploy and Manage Kubernetes clusters through automation like Terraform. . Institute infrastructure as code, security and process automation and automation of routine maintenance tasks . Security and Vulnerability Patching. . Create and deliver knowledge-sharing presentations and documentation. . Learning on the job and exploring new technologies with little supervision. . Participating in and leading war-room/critical outage calls and ensuring the RCA process is implemented. . Cross located Team Management to create a self-managed team, support skill development, and provide growth path, performance evaluation, and attrition management. . Customer Management for daily calls to discuss progress/updates, weekly & monthly status reporting, and lead calls with cross-functional team for creating new services monitors. . Ability to work with and influence cross-functional teams of customer / eInfochips. . Delivery Management to ensure well-defined SLAs (Service Level Agreement) and adherence of the same by self/team using delivery tracking tool - JIRA / Confluence. . Quality Management to contribute that all deliverables from eInfochips are following the quality processes of eInfochips and the customer. . Documentation of processes/status reporting / SOPs for future reference. . Knowledge of AI tools, and Savant with Amazon SageMaker Services will be an added advantage. Tools and Technology specific Exp. Level: Linux : 5+ Yrs. AWS : 5+ Yrs. Kubernetes : 3+ Yrs. Terraform : 3+ yrs. Total 5+ Year Production Operations and Maintenance Experience.
Posted 1 week ago
4.0 - 6.0 years
0 Lacs
indi, karnataka
On-site
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins. Why join Coupa Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend. Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence. Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. Learn more on blog and hear from our employees about their experiences working at Coupa. The Impact of a Sr. Site Reliability Engineer to Coupa: If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the Site Reliability Engineering team is looking for a quality-driven engineer who is ready to constantly challenge his/her mind with a mixture of troubleshooting, code development, hacking, networking, and more. As a Senior Site Reliability Engineer, you will play a crucial role in the development of solutions for our Contract platform. Contract in a nutshell: Coupa Contract (Standard) enables customers to author, approve, and operationalize contracts, making them easily available for purchasing by employees across the organization. Contract compliance delivers savings as employees make purchases using negotiated rates and helps to mitigate risk by ensuring that appropriate terms are in place. Contract enforcement and spend visibility are provided through embedded dashboards at both the contract and summary level. Coupa Contract Advanced is an enterprise-class contract management solution to help companies improve contract visibility, risk management, and operational efficiency at scale. Contract Advanced is designed to handle the creation, storage, and optimization of any contract across any industry or department. At a business level, together with the product management and development team you will change the way our customers deal with Contracts life cycle management ecosystem and build best in class hosting infrastructure on cloud. At a technical level we will jointly drive scaling our Business Spend Management platform on public cloud by following Site reliability engineering (SRE) best practices. What You'll Do: Administration of Linux machines, Web servers, Application servers, Databases Application and cloud infrastructure support for customer environments. Owning the Dev and QE infrastructure Debug and troubleshoot Dev/QE infrastructure issues Perform various deployments for Dev and QE environment on demand Write readable, testable, maintainable, and extensible code in Bash/Ruby Work in an agile environment where quick iterations and good feedback are a way of life Continually look for opportunities to improve our platform, process, cost and business Communicate and coordinate with our Dev and QE teams to solve issues What you will bring to Coupa: Bachelor's Degree with 4+ years of professional experience handling large scale production systems. Good exposure to one of the cloud technologies, preferably AWS Hands-on with Unix, DevOps concepts, SRE concepts, CI/CD tools (Jenkins) Hands-on with configuration management tools (Chef, Ansible, Puppet) Exposure to monitoring tools like New Relic, PagerDuty etc. Exposure to infrastructure orchestration tools Exposure towards Infrastructure as a code practise Experience working collaboratively with a distributed team MySQL and general database knowledge, including performance and optimization Excellent written and verbal communication skills. Critical thinking, continuously challenging how and why we do things to help us improve What can you expect when you join Coupa During your onboarding you will: - Have a buddy that will guide you on your onboarding journey - Receive training on our products, processes and organization - Learn our stack by diving into SRE best practices with the team Once you are fully onboarded you will: - Drive feature design and implementation in your team. - Participate in our community to evolve together. - Pursue your personal development journey as part of the Coupa family. Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. Please be advised that inquiries or resumes from recruiters will not be accepted. By submitting your application, you acknowledge that you have read and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.
Posted 1 week ago
0.0 years
0 Lacs
coimbatore, tamil nadu, india
On-site
The Opportunity: Avantor is looking for an Engineer for the IT Operations Team. As a member of the IT Service Management monitoring team, reporting to the Senior Manager of IT Services, the associate will be responsible for monitoring servers, networks, databases, storage, and backup devices for proactive identification of incidents. In this well-respected IT group, the associate will enjoy a wide variety of self-directed work within a supportive team environment. What we're looking for Education: A Bachelor's degree or equivalent experience within an enterprise-level corporate IT environment is required. Experience: 0-2 years work experience in IT monitoring is highly desirable. Preferred Qualifications: Direct experience with Jenkins, Nprinting, Cloudwatch, Qlikview, SolarWinds, Redwood, OpManager and/or PagerDuty is highly desirable. Experience with scripting languages is a plus. Certifications in AWS or ITIL is a plus. Knowledge of ITIL based Incident, Problem and Change Management processes. Strong problem solving and analytical skills. Ability to self-start and to effectively participate in a team environment. Ability to be an on-call escalation point for production support and scheduled off-hours/weekend work if/when required. Ability to focus on the customer and to adhere to processes defined for customer issue handling. Ability to examine, summarize, and effectively present data when required. Commitment to high professional and ethical standards in a diverse workplace. How will you thrive and create an impact: Monitor event alerts, acknowledge and, when appropriate, escalate to the next level support team(s). Schedule jobs in SAP for different systems, ensure successful runs and restart when required. Cleanup NAS backup server files. Prepare weekly error report and ensure tickets are created for all failed jobs. Prepare weekly & monthly Task performance/ Aging reports, drive aging calls with wider team and ensure tickets are closed on time/record justification if required. Actively participate in process improvement sessions and internal projects. Support IT changes, prioritizing change requests, assessing impact, and accepting changes which meet requirements. Track incidents to improve service level agreement adherence, create problem tickets with priorities based on impacts and track and update corrective action plans. Maintain internal knowledge repository. Manage ticketed query system and ensure queries and resolutions are tracked and kept up to date. Disclaimer: The above statements are intended to describe the general nature and level of work being performed by employees assigned to this classification. They are not intended to be construed as an exhaustive list of all responsibilities, duties and skills required of employees assigned to this position. Avantor is proud to be an equal opportunity employer. Why Avantor Dare to go further in your career. Join our global team of 14,000+ associates whose passion for discovery and determination to overcome challenges relentlessly advances life-changing science. The work we do changes people's lives for the better. It brings new patient treatments and therapies to market, giving a cancer survivor the chance to walk his daughter down the aisle. It enables medical devices that help a little boy hear his mom's voice for the first time. Outcomes such as these create unlimited opportunities for you to contribute your talents, learn new skills and grow your career at Avantor. We are committed to helping you on this journey through our diverse, equitable and inclusive culture which includes learning experiences to support your career growth and success. At Avantor, dare to go further and see how the impact of your contributions set science in motion to create a better world. Apply today! EEO Statement: We are an Equal Employment/Affirmative Action employer and VEVRAA Federal Contractor. We do not discriminate in hiring on the basis of sex, gender identity, sexual orientation, race, color, religious creed, national origin, physical or mental disability, protected Veteran status, or any other characteristic protected by federal, state/province, or local law. If you need a reasonable accommodation for any part of the employment process, please contact us by email at and let us know the nature of your request and your contact information. Requests for accommodation will be considered on a case-by-case basis. Please note that only inquiries concerning a request for reasonable accommodation will be responded to from this email address. 3rd party non-solicitation policy: By submitting candidates without having been formally assigned on and contracted for a specific job requisition by Avantor, or by failing to comply with the Avantor recruitment process, you forfeit any fee on the submitted candidates, regardless of your usual terms and conditions. Avantor works with a preferred supplier list and will take the initiative to engage with recruitment agencies based on its needs and will not be accepting any form of solicitation
Posted 1 week ago
0.0 - 1.0 years
0 - 0 Lacs
nawanshahr
Work from Office
This is a free and unpaid 6-week remote DevOps internship program designed to provide hands-on experience in cloud, automation, CI/CD, and containerization. Gain practical skills, work on real projects, and build a foundation for your DevOps career.
Posted 2 weeks ago
4.0 - 8.0 years
0 Lacs
karnataka
On-site
You have 4 years of experience developing solutions in Golang with proficiency in GO development frameworks. It is also beneficial if you have experience with Java, typescript/node.js. Additionally, you should have a background in backend software development, focusing on building microservices and event-driven architectures/solutions. Your demonstrated ability to use common industry tools for software development, including IDEs, build and continuous integration, source control management, code review tools, data storage services, and cloud infrastructure is essential. You should be capable of building software in a professional team environment and delivering it to production using these tools. Familiarity with various database technologies, encompassing both SQL and NoSQL options such as DynamoDB, Elasticsearch, and Postgres Aurora, is required. You should also possess a deep understanding of Docker, Kubernetes, and various AWS services. Having experience in building, operating, and owning services is crucial. You must be able to implement operational excellence mechanisms including alerting, metrics, and logging using tools like Prometheus, CloudWatch, Kibana, and PagerDuty. In addition, you should have experience with software engineering best practices such as unit testing, design patterns, building maintainable code, and performance optimization. DevOps experience, particularly with AWS, is preferred. You should be adept at architecting and configuring cloud technology stacks, including Compute, Network Security, API Gateways, VPCs, CDNs, Kafka/MKS, Kubernetes, Fargate/EKS, Jenkins configuration, and CI/CD configurations. Demonstrating good software development practices in various areas and showing improvement over time is expected. Providing technical documentation describing your contributions and contributing enhancements to your team's best practices is essential. You should have the ability to work with minimal instructions on day-to-day tasks and self-start with general guidance on new assignments. Participation in an inclusive and supportive engineering culture is encouraged.,
Posted 2 weeks ago
5.0 - 8.0 years
8 - 13 Lacs
bengaluru
Work from Office
Role Overview: The Cloud Operations Tools Engineer at Skyhigh Security will be responsible for the administration, customization, and optimization of Atlassian tools, primarily Jira and Confluence, with a strong focus on Jira Service Management (JSM). This role involves working closely with cross-functional teams to enhance workflows, automate processes, and ensure seamless integration with other tools and systems. This position is part of the growing Skyhigh Security Tech Ops team, playing a crucial role in improving operational efficiency and scalability. Key Responsibilities: Administer, configure, and maintain Atlassian Jira, Jira Service Management (JSM), and Confluence. Design and implement workflows, dashboards, automation rules, and reporting solutions within Jira and JSM. Collaborate with IT and business teams to understand requirements and enhance the functionality of Jira and JSM. Integrate Jira with other tools such as CI/CD pipelines, monitoring systems, and ITSM platforms. Ensure the reliability, scalability, and security of Atlassian tools. Troubleshoot and resolve issues related to Jira, JSM, and other Atlassian applications. Develop scripts and automation solutions using Groovy, Python, or other scripting languages. Maintain documentation for workflows, configurations, and best practices. Stay updated with Atlassians latest features, best practices, and industry trends. Qualifications: 5 to 8 years of experience in Atlassian tools administration, particularly Jira and Jira Service Management (JSM). Strong expertise in Jira workflow configuration, automation, and customization. Experience with Jira plugins and add-ons such as ScriptRunner, Insight (Asset Management), and Service Desk integrations. Proficiency in scripting languages such as Groovy, Python, or Shell scripting. Experience integrating Jira with cloud platforms (AWS, Azure, OCI) and other ITSM tools. Familiarity with automation and infrastructure-as-code tools (e.g., Ansible, Terraform). Strong analytical and problem-solving abilities. Excellent communication and collaboration skills. Ability to work in a fast-paced, dynamic environment. Bachelor's degree in Computer Science, Information Technology, or a related field. Preferred Qualifications: Atlassian Certifications (e.g., Atlassian Certified Jira Administrator, Jira Service Management Administrator). Experience with ITIL practices and IT Service Management workflows. Knowledge of monitoring and incident management tools such as PagerDuty, Grafana, and AI Ops. Understanding of containerization and orchestration tools (e.g., Docker, Kubernetes). Familiarity with security tools and best practices.
Posted 2 weeks ago
3.0 - 5.0 years
0 Lacs
bengaluru, karnataka, india
Remote
About the Role As a Production Quality Management Engineer at Uber, you'll play a key role in improving the reliability, safety, and operational excellence of Uber's services. You'll help ensure our systems meet production standards by working on tooling, automation, and process improvements that reduce risk and elevate the quality of engineering execution. This is a hands-on, collaborative role that offers the opportunity to work across engineering teams, contribute to real-time reliability efforts, and grow your technical and operational expertise. What You'll Do 1. Support Production Readiness: Participate in incident reviews, postmortem processes, and quality audits. Help enforce production standards across services. 2. Contribute to Metrics & Automation: Build or improve tools and dashboards (e.g., Tableau, SQL-based) to monitor reliability, measure SLA adherence, and identify improvement opportunities. 3. Collaborate Across Teams: Partner with engineers in Platform, Infrastructure, and Compliance to align on reliability practices and implement production safeguards. 4. Improve Engineering Workflows: Help maintain runbooks, lockdown policies, readiness reviews (PRR), and alerting hygiene to streamline operational practices. 5. Learn from Experience: Shadow experienced engineers, attend architecture reviews, and gain exposure to high-severity incident handling and quality programs. Basic Qualifications 1. Bachelor's degree in Computer Science, Engineering, or equivalent experience. 2. 3+ years of experience in software development, operations, incident management, or production engineering. 3. Comfortable writing basic SQL and/or Python scripts to extract and analyze production data. 4. Strong communication and documentation skills, with attention to detail. 5. Interest in systems reliability, incident management, and continuous improvement. Preferred Qualifications 1. Experience working with incident tracking systems (e.g., Jira, PagerDuty) or monitoring platforms (e.g., Tableau, Grafana, Prometheus). 2. Exposure to production support, quality assurance, or DevOps environments. 3. Familiarity with topics such as SLAs, SLOs, alerting, or postmortem workflows. What You'll Gain 1. Mentorship from senior production engineers and reliability experts. 2. Deep exposure to production systems, incident operations, and quality tooling at scale. 3. The chance to make a measurable impact on Uber's global reliability posture. 4. Career growth opportunities through learning, cross-functional collaboration, and ownership of real-world problems. About the PQM Team The Production Quality Management (PQM) team ensures Uber's services operate with high reliability and efficiency. We: 1. Build tools and dashboards to monitor and improve production quality. 2. Lead initiatives to standardize quality and reliability policies and processes. 3. Partner with teams across Uber to embed operational excellence into engineering culture. Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuelds progress. What moves us, moves the world - let's move it forward, together. Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role. .Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to .
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As a valued member of our team at Bridgenext, you will be responsible for administering and supporting Atlassian instances such as JIRA and Confluence, ensuring associated user access privileges are maintained effectively. Your role will involve creating projects, workflows, and custom fields based on user requests, as well as capturing and promoting tips and best practices for optimal utilization of Atlassian tools within the company. Collaborating closely with users, you will address and resolve issues, providing recommendations on implementing solutions in alignment with best practices. Furthermore, you will gather business process requirements to enhance JIRA setup and workflows, identifying areas for optimization and improvement, and assessing the compatibility of functionality with user requests. Your responsibilities will extend to installing, integrating, and configuring JIRA & Confluence plug-ins, creating custom webhooks, and developing integrations with other systems. You will also be tasked with generating Application Documentation, including User Guides, Operations Guides, Training Manuals, and Change Management Processes. Managing migrations, upgrades, patches, fixes, and continuous improvement/delivery will be key aspects of your role. Additionally, you will engage in administrative activities for PagerDuty and Datadog applications, ensuring seamless operations and support across various platforms. To excel in this role, you should possess a minimum of 5 years of experience in administering and executing migrations on JIRA and Confluence. Familiarity with JIRA Advanced Roadmaps, developing and managing Jira workflows, schemes, and API integrations to/from Atlassian products is essential. Proficiency in creating informative JIRA dashboards, generating reports using tools like EazyBI, and installing various JIRA plug-ins will be advantageous. Moreover, a solid foundation in Windows & Unix/Linux administration, along with exceptional written and oral communication skills, is required. Your analytical, organizational, troubleshooting, and problem-solving abilities will be instrumental in fulfilling your duties effectively. You should also demonstrate strong presentation skills and be adept at establishing positive relationships with internal and external stakeholders. A willingness to learn and administer new tools & technologies, coupled with good communication skills and customer service experience, will set you up for success in this role. Exposure to PagerDuty and Datadog administrations will be considered a plus, enhancing your contributions to our dynamic and innovative team.,
Posted 2 weeks ago
8.0 - 12.0 years
0 Lacs
karnataka
On-site
As a seasoned Site Reliability Engineer (SRE) Architect with 8-10 years of experience, you will join our team to play a critical role in designing, implementing, and managing scalable, resilient, and secure systems. Your responsibilities will revolve around driving observability, incident management, and automation practices to ensure the smooth functioning of our systems. In terms of observability and monitoring, you will architect and implement solutions using tools like Grafana, Prometheus, ELK Stack, and Kibana. It will be your responsibility to define actionable alerts, dashboards, and reporting systems, as well as drive proactive monitoring strategies to detect and resolve issues before they impact customers. You will lead the implementation of infrastructure as code (IaC) using tools like Helm and Terraform, manage containerized environments with Docker and Kubernetes in Azure or hybrid cloud setups, and design and optimize CI/CD pipelines using tools like Git, Jenkins, Argo CD, and Flux CD. Additionally, you will define and establish SLIs, SLAs, and SLOs, lead the design and implementation of Incident Management and Problem Management processes, and utilize ITSM tools such as BMC Remedy for effective incident management. Setting up robust on-call processes using tools like Opsgenie or PagerDuty, coordinating and managing escalations during major incidents, collaborating effectively with development teams and other stakeholders, and fostering a culture of ownership and reliability engineering will also be part of your responsibilities. You will be expected to architect resilient and scalable systems with a focus on automation, performance optimization, and security. This includes conducting regular chaos engineering and postmortem analyses to improve system reliability. In terms of qualifications, you should have 8-10 years of hands-on experience in SRE, DevOps, or related roles, a proven track record of designing and managing highly scalable, available, and secure systems, expertise in observability tools, hands-on experience with various technologies, and strong knowledge of ITSM tools and incident management processes. Strong leadership, team management, problem-solving, communication, and collaboration skills are essential for this role. At GlobalLogic, we prioritize a culture of caring, continuous learning and development, interesting and meaningful work, balance, flexibility, and integrity. Join us to work on impactful projects, grow personally and professionally, and contribute to shaping the world through intelligent products and services.,
Posted 2 weeks ago
1.0 - 5.0 years
0 Lacs
karnataka
On-site
As a dedicated Site Reliability Engineer (SRE) - Cloud Ops, you will be a key player in ensuring the stability and scalability of our cloud infrastructure. Your responsibilities will include proactively monitoring infrastructure and application alerts, managing pipelines, and addressing environment-related issues in our dynamic 24/7 operational environment. You will work in a shift-based operation with flexible availability for rotational shifts, ensuring the smooth deployment of updates and releases while maintaining operational excellence. Key Responsibilities - Infrastructure Monitoring and Alert Response: Monitor infrastructure and application alerts to maintain uptime and performance. - Shift-Based Operations: Work in a 24/7 environment with flexible availability for rotational shifts. - Cloud Environment Management: Resolve environment-related issues to ensure stability and efficiency. - Pipeline Management: Oversee CI/CD pipelines for smooth deployment of updates and releases. - Operational Tasks: Handle day-to-day operational activities, including incident management and change management. - Tool Management: Utilize tools like Kubernetes, PagerDuty, and GCP Cloud to support operational activities. Ideal Candidate The ideal candidate should be a B.E/B.Tech graduate with 2+ years of experience in Site Reliability and Cloud Ops. You should have expertise in monitoring tools such as Prometheus, Grafana, and ELK, as well as hands-on experience with Kubernetes. Proficiency in setting up and managing alerting systems like PagerDuty, a basic understanding of GCP services and operations, and strong problem-solving skills for incident management are essential. Additionally, basic knowledge of CI/CD pipelines, automation, and infrastructure-as-code practices is preferred. Skills Required - Monitoring tools (Prometheus, Grafana, ELK) - Incident management - Infrastructure management - GCP (Google Cloud Platform) - CI/CD pipelines - Infrastructure as code - Kubernetes - PagerDuty - Automation - Cloud fundamentals,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
As an experienced AppDynamics Subject Matter Expert (SME), your main responsibility will be to lead the deployment, configuration, and optimization of application performance monitoring (APM) tools. Your extensive knowledge of AppDynamics will be crucial in ensuring the seamless monitoring of application performance availability and user experience. You will collaborate closely with development operations and infrastructure teams to achieve this goal. Your expertise in AppDynamics should include installation, configuration, and management within large-scale enterprise environments. Additionally, you should possess a solid understanding of APM concepts such as transaction tracing, service monitoring, database performance, and end-user experience tracking. A strong grasp of application architectures (monolithic, microservices, containerized) and cloud technologies (AWS, Azure, GCP) will be beneficial in this role. You should also have experience in performance tuning for distributed systems and databases, along with the ability to identify and resolve application performance issues like memory leaks, thread contention, and slow database queries. Furthermore, your knowledge of integrating AppDynamics with other monitoring and incident management tools (e.g., Splunk, PagerDuty, ServiceNow) will be valuable. Experience working with CI/CD pipelines and DevOps practices is also preferred for this position.,
Posted 2 weeks ago
5.0 - 7.0 years
0 Lacs
bengaluru, karnataka, india
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily . Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We're also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We're always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters The future of cybersecurity starts with you. About the Role: We are looking for a network software developer who has a solid track record of building network centric software applications. In this role, you will develop tools to improve automated workflows, create UI portals for partner interactions, develop REST API endpoints, and create and maintain microservices to support our network infrastructure. You will work collaboratively with other teams, network engineers, and leads to provide software solutions to decrease deployment times, improve engineer efficiency, and create self-service tools What You'll Do: Supports full lifecycle automated solutions to work with existing and new network technologies running on switches, routers, load balancers, firewalls, and servers Build new features in modern UI Single Page Applications Develop and maintain microservices using Python, Golang, and modern cloud-native technologies Build and enhance gRPC and RESTful APIs Participate in code reviews and provide constructive feedback to other developers Contribute to continuous integration and deployment pipelines Write automated tests for your code Responsible for resolving deployment or tooling outage events and will be part of an On-Call rotation What You'll Need: 5+ years of experience in software development supporting production networks at-scale Solid understanding of networking concepts and protocols. Experience with front end frameworks such as React/Next.js, Vue, Ember, and Lit Internal tooling development experience, including API development and integration experience Hands-on experience with infrastructure-as-code using tools such as Python, Golang, Ansible, YAML, or Netconf/YANG, Chef, and Terraform Experience in building end to end pipelines with CI tools such as Jenkins, Temporal, etc Hands-on experience with cloud-native technologies like Docker and Kubernetes. Knowledge of CI/CD best practices and tools for improving and maintaining performance and code quality Comprehensive knowledge of design metrics, analytics tools, benchmarking activities, and related reporting to identify best practices. Exposure to Monitoring/Observability/IPAM tools such as: Prometheus, Grafana, Splunk, Netbox, PagerDuty, etc. Experience operating in Linux environments Familiarity with Agile/Scrum methodologies Strong communication and collaboration abilities Strong problem-solving skills and the ability to work well under pressure. Bonus Points: Ability to work varying hours to interface with other teams and leads Experience with automating and deploying distributed network services on Linux servers Exposure to Streams processing using Kafka and data visualization reporting experience using Grafana etc is a plus Authored and led successful open source libraries and projects. Contributions to the open source community (GitHub, Stack Overflow, blogging). #LI-VJ1 Benefits of Working at CrowdStrike: Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at for further assistance.
Posted 2 weeks ago
1.0 - 6.0 years
8 - 12 Lacs
gurugram
Work from Office
IMEA (India, Middle East, Africa) India LIXIL INDIA PVT LTD Employee Assignment Hybrid Full Time 30 June 2025 Dealer network sales-managing & appointing new sub-dealers in a given territory by completing all formalities, Regularly meeting and developing relationships with Architects, Builders, interior designers, etc to generate inquiries and close sales, Achievement of targeted sales volumes through Retail & Distribution Sales Procurement of orders from dealers and timely supply there on, Work in coordination with the marketing team for proper and timely display, Conducting Architect & plumber meetings to promote the products, Informing management about the competitors strategy and pricing, Responsible for the payments & collections,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
haryana
On-site
You are a skilled and experienced Senior Cloud Platform Engineer sought by Unlimit to join a dynamic team and support/enhance the AWS cloud infrastructure. Your role involves designing, deploying, and maintaining scalable, secure, and highly available cloud solutions. This position requires hands-on technical expertise, a commitment to operational excellence, and the ability to provide 24/7 support through rotational shift work. Your key responsibilities include designing, deploying, and maintaining robust, scalable, secure, and cost-effective AWS cloud infrastructure using Terraform, Ansible, Kubernetes, and other modern technologies. You will develop and enhance automation scripts and infrastructure-as-code solutions to optimize deployment, monitoring, and maintenance processes. Additionally, you will implement and manage CI/CD pipelines using GitLab and Jenkins for rapid deployment and operational resilience. Monitoring cloud infrastructure performance, availability, and reliability using observability tools such as Grafana and responding promptly to incidents via PagerDuty is crucial. Providing 24/7 operational support through rotational shift work to ensure continuous uptime and rapid incident resolution is part of your role. Collaboration with development, operations, and security teams to align cloud solutions with business requirements and best practices is essential. You will proactively identify opportunities for infrastructure improvements, efficiency gains, and enhanced system reliability. Participation in disaster recovery planning, backup management, and regular system testing is required. Supporting legacy environments and actively participating in migration efforts from Linux hosts and legacy application platforms to Kubernetes is also a part of your responsibilities. Qualifications: - Bachelor's degree in Computer Science, Information Technology, Engineering, or related field - Mandatory AWS certification (e.g., AWS Solutions Architect Associate or Professional) - Strong hands-on experience with AWS services, infrastructure deployment, and management - Proven expertise with Terraform, Ansible, Kubernetes, GitLab, Jenkins, CI/CD pipelines, and GitOps practices - Proficiency in Python scripting for automation and infrastructure management - Hands-on experience with Docker, including Dockerfiles and containerization technologies - Familiarity with observability tools like Grafana and incident management systems such as PagerDuty - Experience providing 24/7 operational support and working in shifts - Strong understanding of cloud security practices, compliance requirements, and data protection - Excellent problem-solving, analytical, and troubleshooting skills - Exceptional Linux administration skills, preference for RHEL certification holders - Strong communication skills and the ability to work collaboratively in a cross-functional team environment If you are ready to take on this challenging role, join the Unlimit team now!,
Posted 2 weeks ago
5.0 - 8.0 years
0 Lacs
bengaluru, karnataka, india
On-site
Our Mission: 6sense is on a mission to revolutionize how B2B organizations create revenue by predicting customers most likely to buy and recommending the best course of action to engage anonymous buying teams. 6sense Revenue AI is the only sales and marketing platform to unlock the ability to create, manage and convert high-quality pipeline to revenue. Our People: People are the heart and soul of 6sense. We serve with passion and purpose. We live by our Being 6sense values of Accountability, Growth Mindset, Integrity, Fun and One Team. Every 6sensor plays a part in defining the future of our industry-leading technology. 6sense is a place where difference-makers roll up their sleeves, take risks, act with integrity, and measure success by the value we create for our customers. We want 6sense to be the best chapter of your career. We are seeking a highly organized and proactive Senior Program Manager, Engineering Operations to lead critical operational programs across our engineering teams. This role will focus on release management, incident response, change control, and engineering metrics to improve reliability, scalability, and velocity of software delivery. You will also be responsible for scaling operational excellence by supporting engineering best practices, managing on-call escalation programs, and collaborating closely with technical customer support to resolve customer-impacting issues. The ideal ca ndidate thrives in fast-paced environments, communicates with clarity, and has a strong background in engineering or technical operations. Key Responsibilities Release Management Coordinate software releases across multiple engineering teams. Own and maintain a release calendar with consistent communication. Ensure all release-related artifacts (e.g., change tickets, checklists) are completed. Enforce release readiness policies including freeze periods and rollback plans. Lead post-release retrospectives and drive process improvements. Promote engineering standards and best practices to ensure consistent, high-quality deployments across teams . Incident Management Serve as facilitator during high-severity (SEV) incidents. Manage and improve incident response templates, tools, and on-call practices. Ensure timely and effective stakeholder communication during incidents. Lead incident reviews and ensure follow-up actions are completed. Analyze incident trends and recommend preventive improvements. Oversee PagerDuty configuration and escalation policies to ensure 24/7 operational coverage. Manage on-call rotation programs, track escalation health, and continuously optimize team alerting workflows. Change Management Own the change request and approval process ensuring compliance and audit readiness. Partner with engineering teams on planning and reviewing major changes. Maintain documentation for change control processes and policies. Continuously evolve frameworks for assessing change risk and rollout strategies. Engineering Metrics Define, track, and report key delivery and reliability metrics, including: DORA Metrics: Deployment Frequency, Lead Time for Changes, MTTR, Change Failure Rate Cycle Time: Issue creation to production deployment Build visibility into engineering efficiency, throughput, and incident performance. Collaborate with engineering and product leaders to ensure metrics drive action and accountability. Maintain operational dashboards and lead monthly metrics reviews. Identify gaps and support continuous improvement in engineering practices and resource allocation based on metric insights. Cross-Functional Collaboration Partner closely with Technical Customer Support to ensure customer-reported incidents are prioritized, escalated, and resolved effectively. Support readiness programs that prepare engineering teams to respond efficiently to live customer issues. Collaborate with Product Management and Product Design to ensure operational requirements, scalability considerations, and incident learnings inform product planning and user experience decisions. Required Skills & Experience 5-8 years in Engineering Operations, DevOps, or Site Reliability Engineering (SRE). Proven track record managing software releases and high-severity incidents. Strong familiarity with tools such as Jira, PagerDuty, LaunchDarkly, GitHub Actions, Confluence, and LinearB. Exceptional communication skills to interface across technical and non-technical teams. Highly organized with a continuous improvement mindset. Demonstrated experience implementing engineering best practices and on-call management programs. Preferred Qualifications Exposure to ITIL or similar operational governance frameworks. Experience using incident-related metrics (e.g., MTTA, MTTR) and dashboards for analysis. Understanding of Agile/Scrum methodologies and CI/CD pipelines. Prior participation in production readiness reviews or Change Advisory Boards (CABs). Experience collaborating with customer support or success teams to address technical escalations. Background in standardizing operational playbooks and service ownership across engineering teams. Our Benefits: Full-time employees can take advantage of health coverage, paid parental leave, generous paid time-off and holidays, quarterly self-care days off, and stock options. We'll make sure you have the equipment and support you need to work and connect with your teams, at home or in one of our offices. We have a growth mindset culture that is represented in all that we do, from onboarding through to numerous learning and development initiatives including access to our LinkedIn Learning platform. Employee well-being is also top of mind for us. We host quarterly wellness education sessions to encourage self care and personal growth. From wellness days to ERG-hosted events, we celebrate and energize all 6sense employees and their backgrounds. Equal Opportunity Employer: 6sense is an Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. If you require reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to We are aware of recruiting impersonation attempts that are not affiliated with 6sense in any way. A ll email communications from 6sense will originate from the @6sense.com domain . We will not initially contact you via text message and will never request payments . If you are uncertain whether you have been contacted by an official 6sense employee, reach out to
Posted 2 weeks ago
5.0 - 7.0 years
0 Lacs
bengaluru, karnataka, india
On-site
Job Requisition ID # 25WD89646 Position Overview We are seeking a Senior Cloud Engineer with deep expertise in Amazon Web Services (AWS) to join our engineering team. You will be responsible for designing, implementing, and maintaining scalable, secure, and resilient cloud infrastructure. You will build one-click deployment solutions, manage serverless and containerized applications, and ensure high availability and disaster recovery across environments. This role is highly technical and requires hands-on experience across a broad spectrum of AWS services, infrastructure as code (IaC), CI/CD, and automation tools. Familiarity with Microsoft Azure is also beneficial for hybrid or multi-cloud strategies. Key Responsibilities: Design, build, and maintain scalable and secure AWS infrastructure using services such as Lambda, EC2, ECS, S3, RDS, DynamoDB, API Gateway, CloudFront, etc Build and maintain one-click deployment pipelines using AWS SAM, CDK, CloudFormation, or Terraform Implement and maintain CI/CD pipelines using AWS Code Pipeline, Code Build, GitHub Actions, or similar tools Develop and deploy serverless applications using AWS Lambda, API Gateway, and DynamoDB Write and maintain automation scripts in Python and JavaScript/Node.js Design and implement active-passive disaster recovery (DR) strategies across AWS regions Set up comprehensive monitoring, logging, and alerting using AWS CloudWatch, X-Ray, SNS, and third-party tools like Datadog, New Relic, or PagerDuty Optimize cost, performance, and security of cloud resources. Perform routine infrastructure maintenance, updates, and security patches Collaborate with developers to support full application lifecycle from development to production Participate in on-call rotations and provide root-cause analysis for incidents Minimum Qualification: 5+ years of hands-on experience in cloud engineering, with 3+ years on AWS Strong proficiency in AWS core services, including EC2, S3, IAM, VPC, CloudFormation, Lambda, API Gateway, CloudFront, RDS, DynamoDB, and SNS Proficient with Infrastructure as Code tools such as AWS SAM, CDK, CloudFormation, or Terraform Strong experience in CI/CD pipeline creation and automation. Solid scripting and programming skills in Python and JavaScript (Node.js) Experience in designing and implementing disaster recovery plans in AWS Knowledge of networking, security best practices, and cost optimization in AWS Experience with observability tools and designing effective monitoring and alerting systems Strong problem-solving skills and ability to troubleshoot complex system issues Preferred Qualifications: AWS Certified Solutions Architect - Professional or DevOps Engineer certification Experience with containerization (Docker, ECS, EKS) and Kubernetes Experience with compliance frameworks like SOC2, HIPAA, or ISO 27001 Knowledge of other cloud providers (GCP, Azure) is a plus Additional Responsibilities (Azure): Support hybrid or multi-cloud deployments leveraging Microsoft Azure where needed Integrate Azure services such as Azure Functions, Azure App Services, and Azure Storage with AWS-based systems Contribute to architecture decisions involving Azure-native tools alongside AWS services Knowledge and experience with Microsoft Azure services such as Azure Functions, Azure DevOps, Azure Kubernetes Service (AKS), and Azure Monitor Ability to design and manage solutions in hybrid or multi-cloud environments #L!-KS2 Learn More About Autodesk Welcome to Autodesk! Amazing things are created every day with our software - from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. We take great pride in our culture here at Autodesk - it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world. When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future Join us! Salary transparency Salary is one part of Autodesk's competitive compensation package. Offers are based on the candidate's experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package. Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: Are you an existing contractor or consultant with Autodesk Please search for open jobs and apply internally (not on this external site).
Posted 2 weeks ago
5.0 - 10.0 years
40 - 100 Lacs
bengaluru, delhi / ncr, mumbai (all areas)
Hybrid
Experience in Site Reliability Engineering, DevOps,managing teams, including mentoring and developing engineers.Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, AWS, GCP, Azure,Docker, Kubernetes,Python, Go, Bash, or simila.
Posted 2 weeks ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
Onsurity is a rapidly growing employee healthcare benefits platform that provides flexible and customized healthcare subscriptions for SMEs, start-ups, and enterprises. The belief is that access to healthcare benefits should not be a luxury, which strengthens the commitment towards making healthcare affordable and accessible for all. Subscriptions offered by Onsurity include discounts on medicine orders, health checkups, fitness plans, free doctor teleconsultations, and insurance, among other benefits. Inclusivity is a core value, ensuring that plans are not limited to full-time employees but also cover contractual workers, interns, freelancers, and more. The role involves hands-on experience in writing DevOps-as-Code to enable software application development teams to specify their DevOps pipeline flows, infrastructure configurations, and deployment settings in code that can be versioned and stored alongside application code. Responsibilities include developing scripts to automate visualization, defining deployment packages, infrastructure, environments, release templates, etc. Building software and systems to manage platform infrastructure and applications, improving reliability, quality, and time-to-market of software solutions, and understanding stakeholder needs are crucial aspects of the position. The position also entails testing and examining code written by others, analyzing results, ensuring system safety and security against cybersecurity threats, identifying technical problems, and developing software updates and fixes. Collaborating with software developers and engineers to ensure development follows best practices, building tools to reduce errors and enhance customer experience, integrating software with internal back-end systems, and performing root cause analysis of production errors are key responsibilities. In terms of measuring and monitoring, the role involves running the production environment by monitoring availability, taking a holistic view of system health, and measuring and optimizing system performance to enhance capabilities, address customer needs, and drive innovation. Gathering and analyzing metrics from operating systems and applications to assist in performance tuning and fault finding are critical tasks. Innovation plays a significant role, with responsibilities including building and implementing new development tools and infrastructure, being hands-on in containerizing and clustering using technologies like Compose, Docker, Dockerfiles, Kubernetes, Nginx, and having experience with tools such as GitHub, GitLab, Jenkins, and databases like MongoDB, MySQL, PostgreSQL, Redis, and NoSQL databases like ElasticSearch, Kafka, MongoDB, Redis. Knowledge of monitoring tools such as CloudWatch, Pagerduty, Sentry, networking concepts including Firewalls, NAT, Port, Subnetting, VPC, VPN, and operational aspects like HA backups are essential for the role at Onsurity.,
Posted 3 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
73564 Jobs | Dublin
Wipro
27625 Jobs | Bengaluru
Accenture in India
22690 Jobs | Dublin 2
EY
20638 Jobs | London
Uplers
15021 Jobs | Ahmedabad
Bajaj Finserv
14304 Jobs |
IBM
14148 Jobs | Armonk
Accenture services Pvt Ltd
13138 Jobs |
Capgemini
12942 Jobs | Paris,France
Amazon.com
12683 Jobs |