Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
9.0 - 14.0 years
30 - 35 Lacs
Pune
Work from Office
: Job TitleProduction Specialist, AVP LocationPune, India Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. As an Assistant Vice President, your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. You will also be working as application lead and will be responsible for technical & operational processes for all application you support. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support." What well offer you , 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Accident and Term life Insurance Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Manage regional L2 team and vendor teams supporting the application. Ensure the team is up to speed and picks up the support duties. Build up technical subject matter expertise on the applications being supported including business flows, application architecture, and hardware configuration. Define and track KPIs, SLAs and operational metrics to measure and improve application stability and performance. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time) using an array of monitoring tools. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Approach support with a proactive attitude, desire to seek root cause, in-depth analysis, and strive to reduce inefficiencies and manual efforts. Mentor and guide junior team members, fostering technical upskill and knowledge sharing. Provide strategic input into disaster recovery planning, failover strategies and business continuity procedures Collaborate and deliver on initiatives and install these initiatives to drive stability in the environment. Perform reviews of all open production items with the development team and push for updates and resolutions to outstanding tasks and reoccurring issues. Drive service resilience by implementing SRE(site reliability engineering) principles, ensuring proactive monitoring, automation and operational efficiency. Ensure regulatory and compliance adherence, managing audits,access reviews, and security controls in line with organizational policies. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Weekend on-call coverage needs to be provided on rotational/need basis. Your skills and experience 9-15 years of experience in providing hands on IT application support. Experience in managing vendor teams providing 24x7 support. Preferred Team lead role experience, Experience in an investment bank, financial institution. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred ITIL v3 foundation certification or higher. Knowledgeable in cloud products like Google Cloud Platform (GCP) and hybrid applications. Strong understanding of ITIL /SRE/ DEVOPS best practices for supporting a production environment. Understanding of KPIs, SLO, SLA and SLI Monitoring ToolsKnowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring, Airflow,Splunk. Working Knowledge of creation of Dashboards and reports for senior management Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Proven experience in leading L2 support teams, including managing vendor teams and offshore resources. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. Ability to manage high-pressure issues, coordinating across teams to drive swift resolution. Strong negotiation skills with interface teams to drive process improvements and efficiency gains. How well support you . . . About us and our teams Please visit our company website for further information: https://www.db.com/company/company.htm We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively. Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group. We welcome applications from all people and promote a positive, fair and inclusive work environment.
Posted 3 weeks ago
6.0 - 8.0 years
5 - 9 Lacs
Pune
Work from Office
: Job Title- Production Support Analyst, AS Location- Pune, India Role Description L2 Technical Application Support What well offer you 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Accident and Term life Insurance Your key responsibilities Provide hands on technical support for a suite of applications/platforms within Deutsche Bank Build up technical subject matter expertise on the applications/platforms being supported including business flows, the application architecture, and the hardware configuration. Resolve service requests submitted by the application end users to the best of L2 ability and escalate any issues that cannot be resolved to L3. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time). Assist in the process to approve all new releases and production configuration changes, keep stakeholders informed and conduct any release tasks assigned to support. Manage incidents through to resolution keeping all stakeholders abreast of the situation and working to minimize impact wherever possible. Conduct post-mortems of incidents and drive relevant feedback into Incident, Problem and Change management programs. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Ensure all knowledge is documented and that support runbooks and knowledge articles are kept up to date. Approach support with a proactive attitude, working to improve the environment before issues occur. The candidate may have to work in shifts as part of a rota covering APAC and EMEA hours between 06:30 IST and 10:30 PM IST (2 shifts) Weekend coverage may need to be provided on rotational basis. Your skills and experience 6 to 8 years providing hands on IT support and interacting with application end users. Preferred: Experience in an investment bank, financial institution, or large corporation. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). Good analytical and problem-solving skills. Exceptional written and oral communication skills, including the ability to communicate technical information to a non-technical audience and with executive levels. Understanding of ITIL / SRE best practices for supporting a production environment Preferred: Experience in Google Cloud Understanding of how to get things done in large organizations, where to use processes and how to build and operate a network. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. TECHNICAL COMPETENCIES Experience using operating systems such as UNIX, Linux and Wintel from the command line interface. Knowledge of commands need to navigate, troubleshoot issues and provide status of these systems. Preferred: Familiarity with coding language such as JAVA and .Net or Perl/Shell scripting. Preferred: Experience with Java hosting environments like WebSphere, Tomcat etc. Ability to write SQL to extract and patch data in Oracle databases as well as monitor database health and performance. Experience of monitoring tools such as Geneos and New Relic. Experience and hands-on on IBM BPM, Camunda , WAS etc. (preferred) Experience on RPA platforms like Blueprism/Chatbots (Preferred) Experience in Devops/SRE. Knowledge and development experience in Ansible automation. Experience in shell scripting, python. How well support you
Posted 3 weeks ago
10.0 - 15.0 years
7 - 12 Lacs
Hyderabad
Work from Office
The IBM Cloud Platform Compliance team is looking for a talented, innovative and enthusiastic software development manager that will support the team building automation to make our customers succeed. IBM Cloud Platform Compliance has a global cloud presence that continues to grow and expand its reach. Our automation engineering team is responsible for delivering compliance at scale for all IBM Cloud platform services. As a trusted platform, first-rate security, fail-safe reliability and exceptional quality is of the utmost importance.As an IBM Cloud Engineering Manager, you will specialize in ensuring the reliability, resiliency and security of our systems. Bringing a unique blend of knowledge and skills in both software and systems, you will play a key role in analyzing business needs, identifying and solving problems, guiding solutions, and developing a high-performing team of developers and Site Reliability Engineers. You will work in an agile, collaborative environment where we build, deploy, configure and maintain systems for IBM. Working closely with our worldwide teams, you will have a unique opportunity to gain first-hand knowledge of the latest technologies and be supported by a global team of IBMers to grow your own skills and develop your career. Key Responsibilities: Provide guidance, coaching, and support to team members to help them grow professionally and achieve their career goals Maintain a high-performance culture through timely goal setting, feedback and regular conversations with team members Drive a culture of continuous improvement within the development team, encouraging innovation, experimentation, and knowledge sharing Ensure that projects are completed on time, and meet quality standards, supporting the team by removing blockers to progress Monitor project progress, identify risks and issues, and take proactive measures to address them Act as a focal for senior management by providing regular updates on project status, milestones, and deliverables Help manage stakeholder relationships Promote Agile and Design Thinking processes to streamline development workflows and produce technical output that delights our customers Actively participate in organization initiatives and activities to support employee engagement Work in a global team collaborating with IBMers to share recommendations, solutions and ideas Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise 10+ years’ experience working in software development 4+ years’ experience leading a software development team ensuring that commitments are upheld, and stakeholders are well managed Sustained experience in coaching and mentoring technical employees Demonstrated ability to set expectations in others and balance priorities to achieve desired deliverables Passion towards driving and delivering automation solutions to large, complex problems Ability to think analytically and communicate rational plans to colleagues Proven ability to lead and drive collaboration across teams to achieve desired outcomes Excellent written and verbal communication skills Flexibility to work with team members in other time zones Preferred technical and professional experience Understanding of Agile and experience coaching teams adopting the methodology and values Held a prior management position with HR responsibilities for employees Understanding of Cloud/DevSecOps/SRE Experience in Design Thinking Familiarity with any major cloud provider Familiarity with Docker, Kubernetes/OpenShift Knowledge of IT compliance frameworks, e.g. SOC2, PCI, HIPAA
Posted 3 weeks ago
16.0 - 22.0 years
0 - 0 Lacs
Kolkata, Pune, Bengaluru
Hybrid
Greetings from LTIMindtree ! We are hiring for below role : Role: SRE Architect Experience: 16-22 Years Work Location: Kolkata/ Mumbai / Pune / Chennai / Bangalore / Delhi / Noida/ Coimbatore / Hyderabad Job Description:- The SRE Architect will play a pivotal role in consulting SRE related solution across domains, designing and implementing Observable, Scalable, Reliable, and Resilient systems and applications that ensure the highest levels of availability and performance for the applications and services. This role requires a consulting mindset, deep understanding of software engineering, system architecture, and operations, along with a passion to automate repetitive tasks with GenAI tools and scripts. Key Responsibilities SRE Consulting: SRE design and architecture solutioning, capability building and customer interactions on SRE. System Design and Architecture: Lead the design and architecture of scalable and reliable systems that meet the needs of our growing user base and business requirements. Automation and Tooling: Develop and maintain automation tools and frameworks that streamline operations and improve system reliability. Monitoring and Observability: Implement and enhance monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues. Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle current and future demands. Incident Management: Lead incident response efforts, perform root cause analysis, and implement corrective actions to prevent recurrence. Collaboration and Mentorship: Work closely with software engineers, DevOps, and other stakeholders to promote best practices in reliability engineering and provide mentorship to junior team members. Continuous Improvement: Identify areas for improvement in existing systems and processes, and drive initiatives to enhance system reliability and performance. Skillset: Experience: Overall 16-20 years of experience along with minimum of 10+ years of experience in site reliability engineering, DevOps, or a related field, with a proven track record of designing and implementing reliable systems at scale. Technical Skills: Strong programming skills in languages such as Python, Go, or Java/.Net. In-depth knowledge of cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker). Experience with infrastructure as code (Terraform, Ansible, Puppet). Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk, AppDynamics, Dynatrace, ELK stack). Solid understanding of networking, security, and system performance tuning. Soft Skills: Strong problem-solving and analytical skills. Excellent communication and collaboration abilities. Ability to work in a fast-paced environment and manage multiple priorities. Passion for continuous learning and staying up-to-date with industry trends and technologies. Preferred Skillset: Experience with chaos engineering and resilience testing. Familiarity with service mesh architectures (Istio, Linkerd). Certifications in cloud platforms (Azure Certified Architect, AWS Certified Architect, Google Cloud Professional Architect, etc.). If interested, please share your updated resume on Nidhi.kumari3@ltimindtree.com.
Posted 3 weeks ago
8.0 - 13.0 years
15 - 25 Lacs
Pune
Hybrid
Designation/Role: Lead - SRE DevOps Experience 8+ years Location ( India - Pune ) - Hybrid Job Description : System reliability and availability Incident response and postmortems Observability: monitoring, logging, alerting Infrastructure as Code (Terraform, Ansible, etc.) CI/CD and automation pipelines Scalability and performance tuning Cloud platforms (e.g., AWS/GCP/Azure) Scripting (Python, Bash, etc.) Qualifications: A bachelors or masters degree in computer science, Engineering, or a related discipline. Ability to communicate effectively across multiple audiences, including firm-wide business units, senior leaders, associates and clients. Exceptional interpersonal skills, including teamwork, facilitation, and negotiation. Strong planning and organizational skills Opus Technologies focuses on shaping the future of payments technology. With experience building highly innovative solutions and products, we combine our deep technology proficiency with unmatched domain expertise in Payments and Fintech, enabling us to deliver unparalleled quality and value in everything we do.Were headquartered in Alpharetta, Georgia, USA and our offshore software development center based out of Pune & Hyderabad offices in India. Please visit our website for more information. Supercharge your career with Opus https://opustechglobal.com/company/ https://opustechglobal.com/careers/ https://opustechglobal.com/resources/ Stay connected with us on social media https://www.linkedin.com/company/opustechnologies/ http://www.facebook.com/opustechglobal https://twitter.com/OpusTechGlobal
Posted 3 weeks ago
5.0 - 8.0 years
15 - 18 Lacs
Bengaluru
Work from Office
We are seeking a skilled and proactive engineer with expertise in Kubernetes, Java-based applications, and cloud platforms (AWS/Azure/GCP), along with experience in ServiceNow for support ticket management. The ideal candidate will be responsible for maintaining cloud-native applications, troubleshooting production issues, and ensuring smooth operations through effective ticket handling and resolution. Key Responsibilities: Kubernetes & Cloud Operations: Deploy, manage, and monitor containerized applications using Kubernetes. Maintain and optimize cloud infrastructure (AWS, Azure, or GCP). Automate deployments and infrastructure using CI/CD pipelines and Infrastructure as Code (IaC) tools like Terraform or Helm. Monitor system performance, availability, and security. Java Application Support: Troubleshoot and debug Java-based microservices and APIs. Collaborate with development teams to resolve application issues. Participate in code reviews and suggest performance improvements. ServiceNow (SNOW) Support: Handle incident, problem, and change management via ServiceNow. Raise, track, and resolve support tickets in coordination with internal and external teams. Document root cause analysis (RCA) and resolution steps for recurring issues. Collaboration & Documentation: Work closely with DevOps, QA, and development teams. Maintain technical documentation, runbooks, and knowledge base articles. Participate in on-call rotations and provide timely support for critical issues. Required Skills: Strong hands-on experience with Kubernetes and container orchestration. Proficiency in Java and related frameworks (Spring Boot, REST APIs). Experience with cloud platforms (AWS, Azure, or GCP). Familiarity with ServiceNow or similar ITSM tools. Good understanding of CI/CD tools (Jenkins, GitLab CI, etc.). Knowledge of monitoring tools (Prometheus, Grafana, ELK, etc.) Qualification: Bachelor's or Masters degrees in Computer Science, Computer Engineering, or related technical discipline. Ability to work independently and to adapt to a fast-changing environment. Creative, self-disciplined, and capable of identifying and completing critical tasks independently and with a sense of urgency. Driving Results: A good single contributor and a good team player. Flexible attitude towards work, as per the needs. Proactively identify & communicate issues and risks. Other Personal Characteristics: Dynamic, engaging, self-reliant developer Ability to deal with ambiguity Manage a collaborative and analytical approach Self-confident and humble Open to continuous learning Intelligent, rigorous thinker who can operate successfully amongst bright people.
Posted 3 weeks ago
10.0 - 15.0 years
12 - 17 Lacs
Pune
Work from Office
With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we’re only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieveRead on. Here, we know that you’re more than your work. That’s why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose — a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If you’re passionate about our purpose — people —then we can’t wait to support whatever gives you purpose. We’re united by purpose, inspired by you. Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation. Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an “automate everything” mindset, helping us bring value to our customers by deploying services with incredible speed, consistency and availability. Primary/Essential Duties and Key Responsibilities Engage in and improve the lifecycle of services from conception to EOL, includingsystem design consulting, and capacity planning Define and implement standards and best practices related toSystem Architecture, Service delivery, metrics and the automation of operational tasks Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response. Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis Collaborate closely with engineering professionals within the organization to deliver reliable services Identify and eliminate operational toil by treating operational challenges as a software engineering problem Actively participate in incident response, including on-call responsibilities Partner with stakeholders to influence and help drive the best possible technical and business outcomes Guide junior team members and serve as a champion for Site Reliability Engineering Engineering degree, or a related technical discipline, and 10+years of experience in SRE. Experience coding in higher-level languages (e.g., Python, Javascript, C++, or Java) Knowledge of Cloud based applications & Containerization Technologies Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing Ability to analyze current technology utilized and engineering practices within the company and develop steps and processes to improve and expand upon them Working experience with industry standards like Terraform, Ansible. (Experience, Education, Certification, License and Training) Must have hands-on experience working within Engineering or Cloud. Experience with public cloud platforms (e.g. GCP, AWS, Azure) Experience in configuration and maintenance of applications & systems infrastructure.Experience with distributed system design and architecture Experience building and managing CI/CD Pipelines Where we’re going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it’s our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! Disability Accommodation UKGCareers@ukg.com
Posted 3 weeks ago
8.0 - 12.0 years
10 - 14 Lacs
Bengaluru
Work from Office
FICO (NYSEFICO) is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potential! The Opportunity "We are seeking an experienced DevOps Engineer to join our development team to assist in the continuing evolution of our Platform Orchestration product. You will be able to demonstrate the required potential and technical curiosity to work on software that utilizes a range of leading-edge technologies and integration frameworks. Staff training, investment and career growth form an important part of our team ethos. Consequently, you will gain exposure to different software validation techniques supported by industry-standard engineering processes that will help to grow your skills and experience." - VP, Software Engineering. What Youll Contribute Build and maintain CI/CD pipelines for multi-tenant deployments using Jenkins and GitOps practices. Manage Kubernetes infrastructure (AWS EKS), Helm charts, and service mesh configurations (ISTIO). Use kubectl, Lens, or other dashboards for real-time workload inspection and troubleshooting. Evaluate security, stability, compatibility, scalability, interoperability, monitorability, resilience, and performance of our software. Support development and QA teams with code merge, build, install, and deployment environments. Ensure continuous improvement of the software automation pipeline to increase build and integration efficiency. Oversee and maintain the health of software repositories and build tools, ensuring successful and continuous software builds. Verify final software release configurations, ensuring integrity against specifications, architecture, and documentation. Perform fulfillment and release activities, ensuring timely and reliable deployments. What Were Seeking A Bachelors or Masters degree in Computer Science, Engineering, or a related field. 812 years of hands-on experience in DevOps or SRE roles for cloud-native Java-based platforms. Deep knowledge of AWS Cloud Services (EKS, IAM, CloudWatch, S3, Secrets Manager), including networking and security components. Strong experience with Kubernetes, Helm, ConfigMaps, Secrets, and Kustomize. Expertise in authoring and maintaining Jenkins pipelines integrated with security and quality scanning tools. Hands-on experience with infrastructure provisioning tools such as Docker and CloudFormation. Familiarity with CI/CD pipeline tools and build systems including Jenkins and Maven. Experience administering software repositories such as Git or Bitbucket. Proficient in scripting/programming languages such as Ruby, Groovy, and Java. Proven ability to analyze and resolve issues related to performance, scalability, and reliability. Solid understanding of DNS, Load Balancing, SSL, TCP/IP, and general networking and security best practices. Our Offer to You An inclusive culture strongly reflecting our core valuesAct Like an Owner, Delight Our Customers and Earn the Respect of Others. The opportunity to make an impact and develop professionally by leveraging your unique strengths and participating in valuable learning experiences. Highly competitive compensation, benefits and rewards programs that encourage you to bring your best every day and be recognized for doing so. An engaging, people-first work environment offering work/life balance, employee resource groups, and social events to promote interaction and camaraderie. Why Make a Move to FICO At FICO, you can develop your career with a leading organization in one of the fastest-growing fields in technology today Big Data analytics. Youll play a part in our commitment to help businesses use data to improve every choice they make, using advances in artificial intelligence, machine learning, optimization, and much more. FICO makes a real difference in the way businesses operate worldwide Credit Scoring FICO Scores are used by 90 of the top 100 US lenders. Fraud Detection and Security 4 billion payment cards globally are protected by FICO fraud systems. Lending 3/4 of US mortgages are approved using the FICO Score. Global trends toward digital transformation have created tremendous demand for FICOs solutions, placing us among the worlds top 100 software companies by revenue. We help many of the worlds largest banks, insurers, retailers, telecommunications providers and other firms reach a new level of success. Our success is dependent on really talented people just like you who thrive on the collaboration and innovation thats nurtured by a diverse and inclusive environment. Well provide the support you need, while ensuring you have the freedom to develop your skills and grow your career. Join FICO and help change the way business thinks! Learn more about how you can fulfil your potential at www.fico.com/Careers FICO promotes a culture of inclusion and seeks to attract a diverse set of candidates for each job opportunity. We are an equal employment opportunity employer and were proud to offer employment and advancement opportunities to all candidates without regard to race, color, ancestry, religion, sex, national origin, pregnancy, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Research has shown that women and candidates from underrepresented communities may not apply for an opportunity if they dont meet all stated qualifications. While our qualifications are clearly related to role success, each candidates profile is unique and strengths in certain skill and/or experience areas can be equally effective. If you believe you have many, but not necessarily all, of the stated qualifications we encourage you to apply. Information submitted with your application is subject to theFICO Privacy policy at https://www.fico.com/en/privacy-policy
Posted 3 weeks ago
5.0 - 8.0 years
7 - 10 Lacs
Hyderabad
Work from Office
Required Skills Experience5 to 8 years Shift Timings :EST Time Zome Location Hyderabad / Proven experience in a DevOps, SRE, or Systems Engineering role with a focus on automation. Strong proficiency with Ansible, including creating complex playbooks and custom roles. Direct experience with Red Hat Ansible Automation Platform (AAP) is essential. Hands-on experience with Terraform** for managing infrastructure as code, including the development of reusable Terraform modules. In-depth knowledge of VMware vSphere administration and automation (e.g., using PowerCLI or API). Experience managing and automating AWS cloud services (e.g., EC2, S3, VPC, IAM, AWS Lambda). Solid experience in Windows Server administration and management. Proficiency in Python scripting for automation and integration. Experience with the ELK Stack for centralized logging and monitoring. Familiarity with enterprise storage solutions, specifically Pure Storage. Willingness to participate in an on-call rotation and a strong sense of ownership for production systems. Excellent problem-solving, communication, and collaboration skills.
Posted 3 weeks ago
3.0 - 5.0 years
4 - 8 Lacs
Hyderabad
Work from Office
Role Purpose The purpose of the role is to resolve, maintain and manage clients software/ hardware/ network based on the service requests raised from the end-user as per the defined SLAs ensuring client satisfaction Do Ensure timely response of all the tickets raised by the client end user Service requests solutioning by maintaining quality parameters Act as a custodian of clients network/ server/ system/ storage/ platform/ infrastructure and other equipments to keep track of each of their proper functioning and upkeep Keep a check on the number of tickets raised (dial home/ email/ chat/ IMS), ensuring right solutioning as per the defined resolution timeframe Perform root cause analysis of the tickets raised and create an action plan to resolve the problem to ensure right client satisfaction Provide an acceptance and immediate resolution to the high priority tickets/ service Installing and configuring software/ hardware requirements based on service requests 100% adherence to timeliness as per the priority of each issue, to manage client expectations and ensure zero escalations Provide application/ user access as per client requirements and requests to ensure timely solutioning Track all the tickets from acceptance to resolution stage as per the resolution time defined by the customer Maintain timely backup of important data/ logs and management resources to ensure the solution is of acceptable quality to maintain client satisfaction Coordinate with on-site team for complex problem resolution and ensure timely client servicing Review the log which Chat BOTS gather and ensure all the service requests/ issues are resolved in a timely manner Deliver NoPerformance ParameterMeasure1.100% adherence to SLA/ timelines Multiple cases of red time Zero customer escalation Client appreciation emails Mandatory Skills: SRE Operations. Experience3-5 Years.
Posted 3 weeks ago
5.0 - 8.0 years
15 - 25 Lacs
Chennai, Bengaluru
Hybrid
Job Profile: Strong Knowledge in Linux internals (Preferable RHEL / Ubuntu) Essential Knowledge in Windows internals Comprehensive understanding in DevOps / SRE, IaC and 12 Factor Principles Excellent hands-on experience in configuration management, orchestration and IaC tools (Ansible, Jenkins, Terraform) Strong understanding of Virtualization Technologies (KVM / Libvirt / oVirt / KubeVirt. OVM, Openstack) Strong understanding of Software Defined Storage Technologies (CEPH, GlusterFS) Strong understanding of Repository and Artifact management Tools (Red Hat Satellite, Spacewalk, Nexus) Strong understanding of Container Technologies (Docker, Kubernetes, Openshift) Strong understanding of ELK and its beats (Auditbeat, FileBeat) Strong understanding of OS Compliance Policies (CIS Benchmark) Agile methodologies and its ceremonies Architect, write and implement software that improves the stability, scalability, availability of products. Own multiple services and have the authonomy to do what suits the business and our customers in IT. Solve occurring problems and create solutions and automation to prevent them from happen again. Plan for reliability for systems to work across multi datacenter/environment and handle the outages. Conceptual understanding about infrastructure and how it works, DNS (Authoritive and Non-Authoritive DNS, Dynamic and bind DNS, Forwarder) SSL Communication (Handshake of SSL traffic, Cipher Suites, Enc Algorithyms,) Active Directory (Security OUs, policies) Certificates (SAN, client-authentication, keystores, mutual ssl) Loadbalancers / Site Selectors / Firewall Vault Tools (Cyberark / Hashicorp) High Availability Knowledge about API communications (Rest/Soap), developing a new consumer/publisher for any API. Excellent Scripting in Groovy (writing Jenkins Files) Bash / Powershell Python GITOPS driven configuration management and deployment. Familiar and openminded to Opensource technologies Team player / quick adaptation to context change Security Awareness Strong understanding of troubleshooting. Deep dive to an issue, read logs, track the clues and identify the problems. Strategic Thinking with Research and Development minds Candidates Profile & Technical Competency: BE/B Tech, MCA/BCA with 5+ years of experience as a Linux DevOps Engineer Hands on experience in all above technologies Ready for 6 months contract role at Bengaluru/Chennai Can you join 15 days
Posted 3 weeks ago
5.0 - 8.0 years
9 - 15 Lacs
Bengaluru
Work from Office
Develop data pipeline with python Proficiency in Python, SQL, Databricks ,Mongo DB ,Azure required Write SQL Quires ,utilize Databricks for analytics ,manage MongoDB Database ,implement Azure cloud services. Exp-6 to 7 9140679821 Drop cv on whatsapp
Posted 3 weeks ago
7.0 - 12.0 years
10 - 20 Lacs
Hyderabad, Chennai, Bengaluru
Hybrid
Dear candidate, Greetings from Wipro!!! We are hiring Devops SRE with python scripting -Bangalore/Hyderabad/Chennai. Exp: 7 to 15 years. Job location: Bangalore and Hyderabad, Chennai Note: pls share only who can join in 0 to 15 days. JD SRE - Very good in Unix, Jenkins and Scripting python. Should be proficient in creating Workflows in Jenkins and Ansible playbooks Should have understanding Monitoring Tools like Grafana, Splunk, Epic and Inginx Should be able to understand of Databases like MySQL/Oracle/Cassandra Very good in DevOps process and troubleshooting Issues Experience in Production Deployment and On-Call Support. Good to have knowledge in Spinnaker Excellent Analytical, Troubleshooting and problem-solving skills Experience in solving problems and working with a team to resolve large scale production environment issues. To drive the team during Production Maintenances, Outages and Load test activities. Please share profile to kasturi.mettin@wipro.com with below details. Total exp: Rel: CTC:: ECTC:: NP: Current Location: Pref Location: Interview Time: Thanks, Kasturi Mettin kasturi.mettin@wipro.com
Posted 3 weeks ago
6.0 - 11.0 years
6 - 10 Lacs
Pune
Work from Office
Over 6 years of experience in DevOps, Infrastructure Automation, or SRE. Hands-on experience with CI/CD tools like Jenkins, GitLab CI/CD, or CircleCI. Strong expertise in Docker, Kubernetes, and Helm charts. Deep knowledge of Azure infrastructure services. Experience with Infrastructure as Code (IaC) tools Terraform, Ansible, or CloudFormation. Proficiency in Python, Bash, or Go scripting languages. Solid understanding of networking, load balancing, and security best practices. Experience with log management and monitoring tools (eg, ELK, Prometheus, Grafana). Design, implement, and manage CI/CD pipelines for application deployment. Automate infrastructure provisioning using Terraform, Ansible, or CloudFormation. Worked with cloud platforms (with deep knowledge of Azure) to optimize infrastructure. Managed Docker/Kubernetes environments for scalable deployments. Implemented monitoring, logging, and alerting using tools like Prometheus, Grafana, ELK, or Datadog. Ensured security best practices in IAM, network security, and vulnerability management. Collaborated with developers to enhance build and deployment processes. Applied SRE principles to improve system reliability. Automated and streamline IT operations for better efficiency. Good to have Experience with AWS or GCP in addition to Azure. Exposure to Splunk for log analysis. Background in incident response and disaster recovery strategies. Familiarity with Go in automation tasks. Prior experience in cross-functional teams with agile/devops workflows.
Posted 3 weeks ago
5.0 - 10.0 years
20 - 35 Lacs
Bengaluru
Remote
Role Role : Site Reliability Engineer (SRE) Location : Remote Work Hours : US Working Hours (Weekends on Rotation Basis) Upsmart Solutions At Upsmart Solutions, were focused on delivering high-performing digital solutions backed by strong engineering teams. Were looking for a skilled and proactive Site Reliability Engineer (SRE) to support and enhance the performance of systems that impact thousands of users on both buyer and seller sides. This role is ideal for someone with prior experience in high-traffic e-commerce and/or video platforms like Twitch, Whatnot, etc. You will collaborate with cross-functional teams to troubleshoot issues, build reliable systems, and maintain high availability. A strong background in Java and NodeJS is essential, along with excellent communication skills and a customer-first mindset. Objectives of this role: Ensure high availability, reliability, and performance of production systems. Handle escalated technical issues impacting users and vendors, driving quick and lasting resolutions. Collaborate with Engineering teams to improve observability, alerting, and system robustness. Own incident management, postmortems, and RCA documentation. Continuously improve automation for monitoring, deployment, and infrastructure. Key Responsibilities: Monitor system performance and troubleshoot production issues. Manage infrastructure reliability for platforms built on Java and NodeJS. Collaborate with development teams to optimize applications for scale and performance. Build internal tools for improved operational efficiency. Provide on-call support during US hours and on a rotational weekend basis. Maintain detailed records of incidents, fixes, and preventive measures. Required Skills and Qualifications: Minimum 5 years of experience in SRE or DevOps roles. Hands-on expertise in Java and NodeJS . (Mandatory) Prior experience supporting e-commerce or video streaming platforms . Proven troubleshooting experience across frontend, backend, and infrastructure layers. Strong grasp of system design, scalability, and observability. Excellent verbal and written communication skills. Preferred Skills and Qualifications: Experience with cloud platforms (AWS, GCP, or Azure). Familiarity with CI/CD pipelines, Docker, Kubernetes, and monitoring tools (Grafana, Prometheus, etc.). Incident response and RCA reporting experience.
Posted 3 weeks ago
8.0 - 13.0 years
50 - 60 Lacs
Pune
Hybrid
Responsibilities As an expert, the IT Resiliency Lead will be on a team that will deliver end-to-end technical resiliency solutions to the organization, utilizing the latest technologies and leveraging automation mechanisms for reducing recovery times. The Resiliency Architect should have a solid understanding of program management and leadership skills to engage various teams. Disaster Recovery Orchestration Tools Configure and integrate interfaces with existing systems with the DR Orchestration tool. Integrate project management, disaster recovery and functional business expertise to create a superior customized solution for the Resiliency Team. Work with teams to assess and implement high availability and seamless failover resiliency mechanisms across multiple layers of the application and infrastructure stacks. Develop documentation for onboarding of applications onto this toolset, create training modules, and apply project manager expertise to ensure project milestones are met. Apply new technologies and design of highly complex infrastructure and software solutions.
Posted 3 weeks ago
8.0 - 13.0 years
10 - 15 Lacs
Bengaluru
Work from Office
Project description We are seeking a highly skilled and motivated DevOps Engineer with 8+ years of experience to join our engineering team. You will work in a collaborative environment, automating and streamlining processes related to infrastructure, development, and deployment. As a DevOps Specialist, you will help implement and manage CI/CD pipelines, configure on-prem Windows OS infrastructure, and ensure the reliability and scalability of our systems. The system is on Windows with Microsoft SQL. Responsibilities CI/CD Pipeline ManagementDesign from scratch, implement, and manage automated build, test, and deployment pipelines to ensure smooth code integration and delivery. Infrastructure as Code (IaC)Develop and maintain infrastructure using tools for automated provisioning and management. System Monitoring & MaintenanceSet up monitoring systems for production and staging environments, analyze system performance, and provide solutions to increase efficiency. Deploy and manage configuration using fit-to-purpose tools and scripts with version controls, CI, etc. CollaborationWork closely with software developers, QA teams, and IT staff to define, develop, and improve DevOps processes and solutions. Automation & ScriptingCreate and maintain custom scripts to automate manual processes for deployment, scaling, and monitoring. SecurityImplement security practices and ensure compliance with industry standards and regulations related to cloud infrastructure. Troubleshooting & Issue ResolutionDiagnose and resolve issues related to system performance, deployments, and infrastructure. Drive DevOps thought leadership and delivery experience to the offshore client delivery team. Implement DevOps best practices based on developed patterns. SkillsMust have Total 9 to 12 years of experience as a DevOps Engineer 3+ years of experience in AWS Excellent knowledge of DevOps toolchains like GitHub Actions /GitHub Co-pilot Self-starter, capable of driving solutions from 0 to 1 and able to deliver projects from scratch Familiarity with containerization and orchestration tools (Docker, Kubernetes) Working understanding of platform security constructs Good exposure to Monitoring tools/Dashboards like Grafana, Obstack, or similar monitoring solutions Experience of working with Jira, Agile SDLC practices Expert knowledge of CI/CD Excellent written and verbal communication skills, strong collaboration, and teamwork skills Proficient in scripting languages like Python and PowerShell, and Database knowledge of MS SQL Experience with Windows or IIS, including installation, configuration, and maintenance Strong troubleshooting skills, with the ability to think critically, work under pressure, and resolve complex issues Excellent communication skills with the ability to work cross-functionally with development, operations, and IT teams Security Best PracticesKnowledge of security protocols, network security, and compliance standards Adaptability to new learning and strong attention to detail with a proactive approach to identifying issues before they arise Nice to have Cloud CertificationsAWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent. IAC pipelines and best practice Snyk, sysdiag knowledge Worked on windows OS, SRE, monitoring on Prometheus
Posted 4 weeks ago
5.0 - 10.0 years
15 - 30 Lacs
Hyderabad
Work from Office
We Advantum Health Pvt. Ltd - US Healthcare MNC looking for DevOps Analyst. We Advantum Health Private Limited is a leading RCM and Medical Coding company, operating since 2013. Our Head Office is located in Hyderabad, with branch operations in Chennai and Noida. We are proud to be a Great Place to Work certified organization and a recipient of the Telangana Best Employer Award. Our office spans 35,000 sq. ft. in Cyber Gateway, Hitech City, Hyderabad Job Title: DevOps Analyst Location: Hitech City, Hyderabad, India Work from office Ph: 9177078628, 7382307530, 9059683624 Address: Advantum Health Private Limited, Cyber gateway, Block C, 4th floor Hitech City, Hyderabad. Location: https://www.google.com/maps/place/Advantum+Health+India/@17.4469674,78.3747158,289m/data=!3m2!1e3!5s0x3bcb93e01f1bbe71:0x694a7f60f2062a1!4m6!3m5!1s0x3bcb930059ea66d1:0x5f2dcd85862cf8be!8m2!3d17.4467126!4d78.3767566!16s%2Fg%2F11whflplxg?entry=ttu&g_ep=EgoyMDI1MDMxNi4wIKXMDSoASAFQAw%3D%3D Job Summary: We are seeking a proactive DevOps Analyst to join our team, focusing on automation and product scaling support. In this role, you will play a critical part in optimizing deployment processes, ensuring system reliability, and supporting the scaling of our products to meet growing demand. You will work closely with development, QA, and operations teams to implement automation solutions and troubleshoot production issues effectively Key Responsibilities: Develop, maintain, and enhance automation scripts and tools to streamline deployment, configuration, and monitoring processes. Collaborate with engineering teams to support product scaling, ensuring high availability and performance. Monitor system health, identify bottlenecks, and proactively recommend improvements. Participate in incident management and root cause analysis for production issues. Manage CI/CD pipelines and integrate automated testing and deployment workflows. Support infrastructure as code (IaC) initiatives using tools like Terraform, Ansible, or CloudFormation. Assist in capacity planning and load testing to prepare systems for scale. Document operational procedures, automation workflows, and troubleshooting guides. Stay current with DevOps best practices, tools, and emerging technologies. Required Skills and Qualifications: Bachelors degree in Computer Science, Information Technology, or related field, or equivalent experience. Proven experience in a DevOps or SRE role with a focus on automation and system scaling. Proficiency with scripting languages such as Python, Bash, or PowerShell. Hands-on experience with CI/CD tools like Jenkins, GitLab CI, or Azure DevOps. Familiarity with container orchestration platforms like Kubernetes or Docker Swarm. Experience with cloud platforms such as AWS, Azure, or Google Cloud. Strong understanding of Linux/Unix system administration. Knowledge of infrastructure automation tools like Terraform, Ansible, or Puppet. Experience monitoring tools such as Prometheus, Grafana, ELK stack, or similar. Excellent problem-solving skills and ability to work in a fast-paced environment. Strong communication skills to collaborate effectively across teams Follow us on LinkedIn, Facebook, Instagram, Youtube and Threads for all updates: Advantum Health Linkedin Page: https://www.linkedin.com/showcase/advantum-health-india/ Advantum Health Facebook Page: https://www.facebook.com/profile.php?id=61564435551477 Advantum Health Instagram Page: https://www.instagram.com/reel/DCXISlIO2os/?igsh=dHd3czVtc3Fyb2hk Advantum Health India Youtube link: https://youtube.com/@advantumhealthindia-rcmandcodi?si=265M1T2IF0gF-oF1 Advantum Health Threads link: https://www.threads.net/@advantum.health.india HR Dept, Advantum Health Pvt Ltd Cybergateway, Block C, Hitech City, Hyderabad Ph: 9177078628, 7382307530, 9059683624
Posted 4 weeks ago
12.0 - 20.0 years
45 - 55 Lacs
Hyderabad
Work from Office
Incharge Run position Job Description: Head Software Maintenance Position Overview The Head of Software Maintenance will be responsible for overseeing the maintenance, support, and continuous improvement of the organizations software systems. This role ensures optimal performance, security, and reliability of all software applications while managing a team of engineers and collaborating with other departments to address business needs. Key Responsibilities 1. Software Maintenance & Support Oversee the maintenance and troubleshooting of enterprise applications, ensuring high availability and performance. Develop and implement strategies for proactive software monitoring, issue resolution, and system optimization. Lead the identification and resolution of bugs, security vulnerabilities, and performance bottlenecks. 2. Team Leadership & Management Manage and mentor a team of software maintenance engineers and support specialists. Define clear roles, responsibilities, and KPIs for the maintenance team. Foster a culture of continuous learning and process improvement. 3. Process Improvement & Automation Establish and optimize software maintenance processes, including version control, patch management, and rollback strategies. Identify opportunities for automation to improve efficiency and reduce downtime. Ensure adherence to ITIL best practices and industry standards. 4. Collaboration & Stakeholder Management Work closely with software development, infrastructure, and business teams to align maintenance strategies with organizational goals. Act as the escalation point for critical software issues impacting business operations. Communicate effectively with leadership on system health, risks, and improvement plans. 5. Security & Compliance Ensure compliance with data security regulations and industry standards. Oversee the implementation of security patches and software updates. Conduct periodic audits to assess system vulnerabilities and risks. 6. Vendor & Third-Party Management Manage relationships with third-party software vendors and service providers. Oversee software licensing, renewals, and support contracts. Evaluate vendor performance and negotiate service-level agreements (SLAs). Key Requirements 1. Education & Experience Bachelors or Masters degree in Computer Science, Information Technology, or a related field. 10+ years of experience in software maintenance, IT operations, or application support. 5+ years of leadership/management experience in a similar role. 2. Technical Skills Strong expertise in software maintenance methodologies, troubleshooting, and debugging. Proficiency in cloud platforms (AWS, Azure, or GCP), databases, and enterprise applications. Experience with monitoring tools, IT service management (ITSM) tools, and automation frameworks. Understanding of cybersecurity best practices and compliance frameworks. 3. Leadership & Soft Skills Excellent leadership, communication, and stakeholder management skills. Strong analytical and problem-solving capabilities. Ability to work in a fast-paced and high-pressure environment. Preferred Qualifications •ITIL certification or relevant IT service management experience. •Experience in fintech, banking, or high-availability system environments. •Exposure to DevOps, CI/CD, and Agile methodologies. Note: Position is for incharge BHIM RUN and not HEAD Skill set - Dev Ops, CI, CD, SRE
Posted 1 month ago
5.0 - 8.0 years
14 - 24 Lacs
Kochi
Hybrid
We are looking for someone who thrives in automation, system observability, and high-scale operations, while also supporting CI/CD and deployment pipelines. You will blend operational execution with engineering rigor to support system reliability, incident response, and automation at scale. This role provides a unique opportunity to grow into full-fledged SRE responsibilities while working in tight coordination with our global reliability strategy. Responsibilities: Maintain, standardize, and enhance CI/CD pipelines (GitHub Actions, Azure Pipelines, GitLab). Automate testing, deployment, and rollback processes. Champion end-to-end CI/CD workflow reliabilityincluding build validation, environment consistency, and deployment rollbacks. Deploy and manage observability tools (Datadog, Grafana, Prometheus, ELK). Assist in root cause analysis using telemetry and logs. Maintain alerting systems and participate in incident drills. Shadow and support Houston-based SRE team during follow-the-sun incident response. Create postmortem documentation for incidents and track remediation tasks. Develop scripts and tooling to reduce operational toil. Contribute to performance tuning of PostgreSQL and containerized services. Assist in distributed system optimization efforts (AKKA.NET knowledge is a bonus). Participate in rollout strategies, canary releases, and availability planning. Requirements: 5+ years in DevOps, SRE, or Infrastructure Engineering. Strong scripting ability (Python, Bash, PowerShell). Experience in managing Kubernetes clusters and container-based deployments. Working knowledge of SQL databases and performance optimization. Hands-on experience with CI/CD tools and source control systems (GitHub, GitLab). Exposure to monitoring and observability platforms (Datadog, Prometheus, ELK). Experience with incident management and postmortems. Familiarity with distributed systems (bonus: AKKA.NET or similar frameworks). Infrastructure as Code (Terraform) and GitOps practices. Exposure to global operations teams and 24/7 handover workflows.
Posted 1 month ago
10.0 - 16.0 years
30 - 45 Lacs
Hyderabad
Work from Office
Position Overview The Head of Software Maintenance will be responsible for overseeing the maintenance, support, and continuous improvement of the organizations software systems. This role ensures optimal performance, security, and reliability of all software applications while managing a team of engineers and collaborating with other departments to address business needs. Key Responsibilities 1. Software Maintenance & Support Oversee the maintenance and troubleshooting of enterprise applications, ensuring high availability and performance. Develop and implement strategies for proactive software monitoring, issue resolution, and system optimization. Lead the identification and resolution of bugs, security vulnerabilities, and performance bottlenecks. 2. Team Leadership & Management Manage and mentor a team of software maintenance engineers and support specialists. Define clear roles, responsibilities, and KPIs for the maintenance team. Foster a culture of continuous learning and process improvement. 3. Process Improvement & Automation Establish and optimize software maintenance processes, including version control, patch management, and rollback strategies. Identify opportunities for automation to improve efficiency and reduce downtime. Ensure adherence to ITIL best practices and industry standards. 4. Collaboration & Stakeholder Management Work closely with software development, infrastructure, and business teams to align maintenance strategies with organizational goals. Act as the escalation point for critical software issues impacting business operations. Communicate effectively with leadership on system health, risks, and improvement plans. 5. Security & Compliance •Ensure compliance with data security regulations and industry standards. •Oversee the implementation of security patches and software updates. •Conduct periodic audits to assess system vulnerabilities and risks. 6. Vendor & Third-Party Management Manage relationships with third-party software vendors and service providers. Oversee software licensing, renewals, and support contracts. Evaluate vendor performance and negotiate service-level agreements (SLAs). Key Requirements 1. Education & Experience Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. 10+ years of experience in software maintenance, IT operations, or application support. 5+ years of leadership/management experience in a similar role. 2. Technical Skills Strong expertise in software maintenance methodologies, troubleshooting, and debugging. Proficiency in cloud platforms (AWS, Azure, or GCP), databases, and enterprise applications. Experience with monitoring tools, IT service management (ITSM) tools, and automation frameworks. Understanding of cybersecurity best practices and compliance frameworks. 3. Leadership & Soft Skills •Excellent leadership, communication, and stakeholder management skills. •Strong analytical and problem-solving capabilities. •Ability to work in a fast-paced and high-pressure environment. Preferred Qualifications •ITIL certification or relevant IT service management experience. •Experience in fintech, banking, or high-availability system environments. •Exposure to DevOps, CI/CD, and Agile methodologies. Skill set - Dev Ops, CI, CD, SRE
Posted 1 month ago
2.0 - 7.0 years
7 - 14 Lacs
Hyderabad
Work from Office
Position: Site Reliability Engineer Place of posting: Hyderabad OR Chennai Qualification: BTech or MTech Role Description A site reliability engineer (SRE) creates a bridge between development and IT operations by taking on the tasks typically done by operations. Instead, such tasks are given to these types of engineers who use automation tools to solve problems by creating scalable and reliable software systems. Standardization and automation are at the heart of what an SRE does. Responsibilities Developing software systems and automated solutions for operational aspect Seek to bridge the gap between operations and development teams to deliver software faster Writing and developing code to automate processes, such as analyzing logs, testing production environments and responding to any issues Able to shift between development and operations work and maintain a balance Participates in planning delivery time, code quality, and process efficiency improvement projects Identifies bottlenecks in development and deployment processes and designs automation solutions to mitigate Maintains and grows knowledge of platform configuration management, monitoring, and troubleshooting Building and integrating software tools to enhance an organizational system’s reliability and scalability Domain Experience - 3+ years of proven tech experience with deployment of automation solutions Bachelors in Computer Science (or related field) Experience with Data center operations (DCOps) Proven work experience in installing, configuring, and troubleshooting Linux based environments Experience with deploying application on containers (Docker and Kubernetes) in CI/CD environments Experience on Automation (Configuration Management) using Shell Scripting, Python and Ansible Experience with continuous integration and related tools such as GitlabCI, Jenkins, Hudson, Maven, Ant, Git, Sonar, etc. Familiar with security automation tools such as static application security testing etc. Comfortable using tracking tools e.g. Jira, Trello Basin understanding on monitoring and log management tools like, Splunk, Grafana, Prometheus, Kibana e.t.c., Agile/Digital Experience Experience as a SRE Engineer on a cross-functional agile team preferred Proven experience across testing, integration, source code management, deployment and containerization in agile team Individual Skills Strong communication skills with ability to communicate complex technical concepts and align organization on decisions Utilizes team collaboration to create innovative solutions efficiently
Posted 1 month ago
10.0 - 15.0 years
35 - 40 Lacs
Noida
Work from Office
Job Summary: We are seeking a highly skilled and experienced DevOps Architect / Senior DevOps Engineer with 10+ years of expertise in designing, implementing, and managing robust DevOps ecosystems across AWS , Azure , and GCP . The ideal candidate will possess a deep understanding of cloud infrastructure, automation, CI/CD pipelines, container orchestration, and infrastructure as code. This role is both strategic and hands-ondriving innovation, scalability, and operational excellence in cloud-native environments. Key Responsibilities: Architect and manage DevOps solutions across multi-cloud platforms (AWS, Azure, GCP) . Build and optimize CI/CD pipelines and release management processes. Define and enforce cloud-native best practices for scalability, reliability, and security. Design and implement Infrastructure as Code (IaC) using tools like Terraform , Ansible , CloudFormation , or ARM templates . Deploy and manage containerized applications using Docker and Kubernetes . Implement monitoring, logging, and alerting frameworks (e.g., ELK, Prometheus, Grafana, CloudWatch). Drive automation initiatives and eliminate manual processes across environments. Collaborate with development, QA, and operations teams to integrate DevOps culture and workflows. Lead cloud migration and modernization projects. Ensure compliance, cost optimization, and governance across environments. Required Skills & Qualifications: 10+years of experience in DevOps / Cloud / Infrastructure / SRE roles. Strong expertise in at least two major cloud platforms ( AWS , Azure , GCP ) with working knowledge of the third. Advanced knowledge of Docker , Kubernetes , and container orchestration. Deep understanding of CI/CD tools (e.g., Jenkins, GitLab CI, Azure DevOps, ArgoCD). Hands-on experience with IaC tools : Terraform, Ansible, Pulumi, etc. Proficiency in scripting languages like Python , Shell , or Go . Strong background in networking , cloud security , and cost optimization . Experience with DevSecOps and integrating security into DevOps practices. Bachelor's/Master's degree in Computer Science, Engineering, or related field. Relevant certifications preferred (e.g., AWS DevOps Engineer, Azure DevOps Expert, Google Professional DevOps Engineer). Preferred Skills: Multi-cloud or hybrid cloud experience. Exposure to service mesh , API gateways , and serverless architectures . Familiarity with GitOps , policy-as-code , and site reliability engineering (SRE) principles. Experience in high-availability, disaster recovery, and compliance (SOC2, ISO, etc.). Agile/Scrum or SAFe experience in enterprise environments.
Posted 1 month ago
8.0 - 13.0 years
14 - 24 Lacs
Hyderabad, Pune, Bengaluru
Work from Office
Location Hyderabad , Pune , Bangalore Job Description - 8 years of experience in Java/.NET based application support like Issues Resolution and Incident management . RCA Creation. Strong trouble shooting skills in debugging multiarchitecture systems and experience with microservices architecture patterns Devops and Cloud computing (GCP/AWS) Very strong communication and stakeholder coordination skill Experience in Altering and Monitoring which includes thousand eyes monitoring, Splunk alerts monitoring, google cloud alerts monitoring. Experience in Managing CI/CD pipeline deployments using harness and bamboo. GIT. Experience working with containers e.g., Docker, Kubernetes, Cloud Foundry, etc Deep knowledge of Internet protocols and web services technologies e.g., HTTP, DNS, TCP/UDP, SOAP, JSON and REST Unix Shell Scripting or any programming language mandatory
Posted 1 month ago
4.0 - 9.0 years
15 - 30 Lacs
Chennai
Hybrid
ACV Auctions is looking for an experienced Site Reliability Engineer III with a systems and software engineering background to focus on site reliability. We believe in taking a software engineers approach to operations by providing standards and software tools to all engineering projects. As a Site Reliability Engineer, you will split your time between developing software that improves overall reliability and providing operational support for production systems. What you will do: Maintain reliability and performance for your particular infrastructure area while working with software engineers to improve service quality and health. Develop, design, and review new software tools in Python & Java to improve infrastructure reliability and provide services with better monitoring, automation, and product delivery. Practice efficient incident response through on-call rotations alongside software engineers and document incidents through postmortems. Support service development with capacity plans, launch/deployment plans, scalable system design, and monitoring plans. What you will need: BS degree in Computer Science or a related technical discipline or equivalent practical experience. Experience building/managing infrastructure deployments on Google Cloud Platform 3+ years managing cloud infrastructure. Experience programming in at least one of the following: Python or Java You are experienced in Linux/Unix systems administration, configuration management, monitoring, and troubleshooting. You are comfortable with production systems including load balancing, distributed systems, microservice architecture, service meshes, and continuous delivery. Experience building and delivering software tools for monitoring, management, and automation that support production systems. Comfortable working with teams across multiple time -zones and working flexible hours as needed. Preferred Qualifications Experience maintaining and scaling Kubernetes clusters for production workloads is a plus
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough