Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
3.0 - 7.0 years
7 - 11 Lacs
pune
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
gurugram
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
lucknow
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
coimbatore
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
bengaluru
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
jaipur
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
kochi
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
kolkata
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
noida
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
3.0 - 7.0 years
7 - 11 Lacs
mumbai
Work from Office
Ensure system reliability and uptime Develop monitoring and alerting tools Automate operational tasks
Posted 4 days ago
10.0 - 12.0 years
10 - 20 Lacs
chennai
Work from Office
Requirements Elicitation, Understanding, Analysis, & Management • Understand the project's Vision and requirements, and contribute to the creation of the supplemental requirements, building the low-level technical specifications for a particular platform and/or service solution. Project Planning, Tracking, & Reporting • Estimate the tasks and resources required to design, create (build), and test the code for assigned module(s). • Provide inputs in creating the detailed schedule for the project. • Support the team in project planning activities, in evaluating risks, and shuffle priorities based on unresolved issues. • During development and testing, ensure that assigned parts of the project/modules are on track with respect to schedules and quality. • Note scope changes within the assigned modules and work with the team to shuffle priorities accordingly. • Communicate regularly with the team about development changes, scheduling, and status. • Participate in project review meetings. • Tracking and reporting progress for assigned modules Design: • Create a detailed (LLD) design for the assigned piece(s) with possible alternate solutions. • Ensure that LLD design meets business requirements. • Submit the LLD design for review. • Fix the detailed (LLD) design for the assigned piece(s) for the comments received from team. Development & Support • Build the code of high-priority and complex systems according to the functional specifications, detailed design, maintainability, and coding and efficiency standards. • Use code management processes and tools to avoid versioning problems. • Ensure that the code does not affect the functioning of any external or internal systems. • Perform peer reviews of code to ensure it meets coding and efficiency standards. • Act as the primary reviewer to review the application code created by software engineers to ensure compliance to defined standards. Recommend changes to the code as required. Testing & Debugging • Attend the Test Design walkthroughs to help verify that the plans and conditions will test all functions and features effectively. • Perform impact analysis for issues assigned to self and Software Engineers /Sr Engineers. • Actively assist with project- and code-level problem solving, such as suggesting paths to explore when testing engineers or software engineers encounter a debugging problem, and escalate urgent issues. Documentation • Review technical documentation for the code for accuracy, completeness, and usability. • Document and maintain the reviews conducted and the unit test results. Process Management • Adhere to the project and support processes. • Adhere to best practices and comply with approved policies, procedures, and methodologies, such as the SDLC cycle for different project sizes. • Shows responsibility for corporate funds, materials and resources. • Ensure adherence to SDLC and audits requirements. • Adhere to best practices and comply with approved policies, procedures, and methodologies. Coaching and Mentoring • Act as a technical subject matter expert for the internal team on areas such as system functionality and approach including solving systems operations issues, performance initiatives. Leverage existing knowledge and expertise in multiple ways. • Build team skills using formal and/or informal training sessions. • Create and maintain knowledge repositories for lessons learnt and developments in the respective domains. Lead CI/CD design and standardization across projects (Jenkins, Argo CD, GitHub) Own end-to-end cloud infra with Terraform, AWS, EKS Improve automation coverage using Python/Shell/Puppet Manage deployment strategies: rolling, blue/green, canary Set up and optimize dashboards, alerts, and log pipelines in Splunk Perform advanced troubleshooting and root cause analysis Participate in on-call with full L2/L3 ownership Lead operational reviews and provide RCA after critical incidents Mentor juniors and review their code/scripts/infra designs Location: This position can be based in any of the following locations: Chennai For internal use only: R000107396
Posted 4 days ago
4.0 - 9.0 years
11 - 21 Lacs
hyderabad, pune, bengaluru
Hybrid
We are seeking skilled and proactive Infra Security Engineers to join our growing team. This role is critical to ensuring the security and integrity of our infrastructure through scripting, automation, and compliance monitoring. The ideal candidate will have a strong foundation in Unix/Linux systems, scripting languages, and a passion for security automation. Key Responsibilities: Develop and maintain shell scripts and Python programs to automate security tasks and system checks. Administer and secure Unix/Linux environments , ensuring best practices are followed. Implement and manage security automation frameworks to streamline compliance and vulnerability management. Perform vulnerability assessments and policy compliance checks using tools like Qualys PC . Collaborate with cross-functional teams to identify and remediate security gaps. Support incident response activities and contribute to forensic investigations when required. Stay updated with the latest security trends, threats, and technologies to proactively enhance infrastructure security. Mandatory Skills: Shell Scripting for automation and system-level tasks. Unix/Linux Administration strong hands-on experience with system configuration and security. Python Programming – for building scalable automation and integration scripts. Security Automation Tools – experience with frameworks or custom-built solutions. Qualys Policy Compliance (PC) – hands-on experience with compliance scanning and reporting. Exposure to DevSecOps practices and integration of security into CI/CD pipelines. Ideal Candidate Profile: 3 to 9 years of relevant experience in infrastructure security roles. Strong analytical and problem-solving skills. Ability to work independently and in a team-oriented, collaborative environment. Excellent communication skills to interact with technical and non-technical stakeholders.
Posted 4 days ago
4.0 - 5.0 years
9 - 14 Lacs
kochi
Work from Office
Job Summary We are seeking a skilled Linux Administrator to manage, maintain, and optimize Linux-based systems and servers. The ideal candidate should have strong expertise in Linux administration, system monitoring, and troubleshooting to ensure smooth operations and high availability of enterprise systems. Key Responsibilities Install, configure, and maintain Linux servers and applications. Monitor system performance, troubleshoot issues, and ensure high availability. Manage user accounts, permissions, and security policies. Perform system backups, patch management, and upgrades. Automate routine tasks using scripting (Shell, Python, or Perl). Collaborate with IT, DevOps, and application teams for seamless operations. Maintain documentation for system configurations, procedures, and troubleshooting steps.
Posted 4 days ago
3.0 - 8.0 years
3 - 7 Lacs
chennai
Work from Office
About The Role Project Role : Application Support Engineer Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems. Must have skills : EPIC Systems Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Support Engineer, you will act as software detectives, providing a dynamic service that identifies and resolves issues within various components of critical business systems. Your typical day will involve collaborating with team members to troubleshoot software problems, ensuring that systems operate smoothly and efficiently, and contributing to the overall improvement of application performance. You will engage with users to understand their challenges and work diligently to implement effective solutions, all while maintaining a focus on delivering high-quality service to support business operations. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Assist in the documentation of processes and solutions to enhance team knowledge.- Engage with stakeholders to gather requirements and feedback for continuous improvement. Professional & Technical Skills: - Must To Have Skills: Proficiency in EPIC Systems.- Must To Have Skills: Willow Inpatient- Good To Have Skills: Experience with application performance monitoring tools.- Strong analytical skills to diagnose and resolve software issues.- Familiarity with database management and query optimization.- Ability to work collaboratively in a team-oriented environment. Additional Information:- The candidate should have minimum 3 years of experience in EPIC Systems.- This position is based at our Chennai office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 4 days ago
3.0 - 8.0 years
1 - 5 Lacs
bengaluru
Work from Office
About The Role Project Role : Infra Tech Support Practitioner Project Role Description : Provide ongoing technical support and maintenance of production and development systems and software products (both remote and onsite) and for configured services running on various platforms (operating within a defined operating model and processes). Provide hardware/software support and implement technology at the operating system-level across all server and network areas, and for particular software solutions/vendors/brands. Work includes L1 and L2/ basic and intermediate level troubleshooting. Must have skills : Red Hat OS Administration Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Infra Tech Support Practitioner, you will engage in the ongoing technical support and maintenance of production and development systems and software products. Your typical day will involve addressing various technical issues, providing both remote and onsite assistance, and ensuring that configured services operate smoothly across multiple platforms. You will work within a defined operating model and processes, focusing on delivering effective solutions to enhance system performance and reliability. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Assist in the implementation of technology at the operating system level across all server and network areas.- Conduct basic and intermediate level troubleshooting for hardware and software issues. Professional & Technical Skills: - Must To Have Skills: Proficiency in Red Hat OS Administration.- Strong understanding of server and network management.- Experience with system monitoring and performance tuning.- Familiarity with virtualization technologies and cloud services.- Knowledge of scripting languages for automation tasks. Additional Information:- The candidate should have minimum 3 years of experience in Red Hat OS Administration.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 4 days ago
1.0 - 4.0 years
3 - 7 Lacs
bengaluru
Work from Office
About The Role Project Role : Application Support Engineer Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems. Must have skills : AWS Architecture Good to have skills : NAMinimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Support Engineer, you will act as software detectives, providing a dynamic service that identifies and solves issues within multiple components of critical business systems. Your typical day will involve collaborating with various teams to troubleshoot and resolve application-related challenges, ensuring the smooth operation of essential services and systems. You will engage in problem-solving activities, analyze system performance, and contribute to the continuous improvement of application support processes, all while maintaining a focus on delivering high-quality service to stakeholders. Roles & Responsibilities:- Expected to be an SME.- Collaborate and manage the team to perform.- Responsible for team decisions.- Provide extended coverage to address global operational issues across EMEA, NA, LA, and APJ regions.- Onboard & align upcoming Cloud Providers CDC, Ali Cloud and Stack IT into existing harmonized processes.- Act as backup support for current platform (AWS, GCP, Azure, SAP DC) vertical owners to ensure continuity.- Coordinate with AWS, GCP for capacity reservations, rescheduling, and cost tracking.- Contribute to MSR and QBR reporting with relevant system performance data.- Track and close decommissioning and cleanup tasks, including unused VMs and storage.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Facilitate knowledge sharing sessions to enhance team capabilities.- Monitor application performance and proactively address potential issues. Professional & Technical Skills: - Must To Have Skills: Proficiency in AWS Architecture.- Strong understanding of cloud computing principles and best practices.- Coordinate with AWS, GCP for capacity reservations, rescheduling, and cost tracking.- Knowledge of networking concepts and security best practices in cloud environments. Additional Information:- The candidate should have minimum 5 years of experience in AWS Architecture.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 4 days ago
4.0 - 7.0 years
9 - 13 Lacs
bengaluru
Work from Office
As an Associate level Infrastructure Specialist at IBM you will support the infrastructure running industries likes transportation, energy, insurance, banking or healthcare which are rapidly changing as the worlds relationship with technology evolves. Ready to help our clients take the next step forward? Companies have more choices than ever before between on-premise, off-premise, or a hybrid approach. As an Associate Infrastructure Specialist, you'll be responsible for keeping up with the latest changes, using your expertise to deliver solutions that meet the needs of our clients and products. In your role you may be responsible for: Define, analyze and review technical architecture on required platform and coming up with architecture options and recommendations Define, detail, and scope the technical requirements into solutions Work across key activities in configuration, systems management tools and backup and recovery Support Technical Consultants and lead in building solutions and providing technical mentoring and guidance Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise Minimum 5years hands on experience in implementing and administering AWS Ability to work both analytically using data and systems with a logical sequence to complex tasks Strong knowledge on server migration from legacy to Cloud AWS Familiarity with various cloud service providers such as AWS, Azure, IBM, Google, Alibaba, Oracle and with cloud computing concepts Experience provisioning, operating, and maintaining systems running on AWS Preferred technical and professional experience Initiative to actively seek new knowledge and improve skills Exposure to basic data science concepts Ability to handle multiple tasks concurrently and meet deadlines, while maintaining focus in an environment with conflicting demands
Posted 4 days ago
4.0 - 6.0 years
9 - 14 Lacs
gurugram
Work from Office
Job Profile Summary Provides administration for cloud computing platforms, networks, and systems. Responsible for delivering a great customer experience. Serves as an escalation point to provide technical support to customers over chat, phone and via support tickets. Responsible for responding to the Rackspace global support ticket queues and completing first line resolution to issues in scope. Expected to follow process, display good judgment in decisions and to create and maintain customer loyalty by going above and beyond the customers expectation. Responsible for adhering to company security policies and procedures and any other relevant policies and standards as directed. Career Level Summary Requires working knowledge and skills to perform a defined set of analytical scientific or operational processes Applies experience and skills to complete assigned work within own area of expertise Leverages standard operating procedures and/or scientific methods Works with a moderate degree of supervision Critical Competencies Service Delivery Effectiveness: Understands where service gaps can occur within scope of own work Value Analysis: Provides customers with basic, standard information regarding products/offerings Knowledge: Developing OS troubleshooting knowledge for Linux and Windows Developing expertise in a cloud computing platform, such as AWS Developing knowledge to provide increased level of investigation into issues such as application servers, distributions, hosting servers, database servers, user audits, patches, and upgrades. Developing understanding of OS specific webhosts and database technologies, such as MSSQL/IIS for Windows or MYSQL/APACHE for Linux Basic ability with cross platform troubleshooting tasks such as virtualization, containers, disk storage, encryption, security, network connectivity, NFS, DNS, SSL/TLS, firewalls, and load balancers Basic knowledge of DevOps and/or Micro-services with at least one technology including Chef, Puppet, Ansible, Docker, Kubernetes, Azure Container Service etc Basic understanding of patching - documents changes based on requests for change Basic ability to apply change control procedures Requires broadened technical skills in analytical/ scientific methods or operational processes to perform a defined array of activities Understands how the team integrates with others to accomplish the team objectives Key Responsibilities Other Incidental tasks related to the job, as necessary. Resolve or escalate level-appropriate technical issues for customers in accordance with team playbook guidelines via phone and ticketing Secure, administer, and improve customer technical issues which can include cloud platform and infrastructure services, user management and permissions, or other software issues Troubleshoot monitoring alerts and create tickets accordingly Act as an escalation point for techs inside and outside the team encouraging peers to participate in problem solving Escalate support requests according to escalation procedures Perform incident management identification, assist in managing and escalation Ensure adherence to customer & SLA commitments Manage personal ticket cue and monitor ticket response times and take appropriate actions to ensure team response time targets are met Collaborate with Account Managers and Business Development Consultants to build strong customer relationships Collaborate and share knowledge with other administrators on the support floor Provide Fanatical Experience to customers in all the above Skills: Critical Competencies 4-6 years of Cloud or System Operations Administration experience in a client-centric ticket queue environment Self-motivated with a strong desire to learn and improve both technical and people skills Strong verbal and written communication skills and the ability to communicates basic technical information with team members Strives for performance improvements in oneself and peers Leads by example and motivates team members Organizational skills with the ability to provide quality at pace Ability to handle multiple tasks and prioritize work under pressure Ability to work at a team level as well as an individual level Ability to interact confidently with more senior and/or skilled areas of the business Able to communicate constructive feedback effectively Ability to adapt to changing business and technology requirements. Sound problem solving and troubleshooting skills
Posted 5 days ago
8.0 - 13.0 years
15 - 22 Lacs
vadodara
Work from Office
Job Summary: We are looking for a skilled and proactive Linux Engineer with strong communication skills and hands-on experience in Linux systems. The ideal candidate should also possess expertise in at least one of the following areas: Storage systems (Isilon, NetApp, Pure), Backup solutions (Commvault), Virtualization platforms (VMware), or Hyperconverged infrastructure (Nutanix). You will be responsible for maintaining, troubleshooting, and optimizing enterprise Linux environments. Must Have Skills: Excellent verbal and written communication skills Strong proficiency in Linux system administration (RHEL and Ubuntu) Basic scripting (Bash or equivalent) VMware ESXi and vCenter administration Good To Have Skills: Nutanix cluster operations and troubleshooting Experience with Storage systems: Dell EMC Isilon, NetApp, PureStorage Commvault Backup configuration and maintenance ITIL process familiarity (Incident, Change, and Problem Management) Automation with Ansible or similar tools Key Responsibilities: Manage and maintain Linux-based systems and servers in an enterprise environment Troubleshoot and resolve system issues, performance bottlenecks, and hardware failures Collaborate with storage, backup, and virtualization teams on cross-platform integrations Participate in patching, upgrades, and routine system maintenance Document procedures, configurations, and changes accurately Support 24x7 production environment as part of an on-call rotation Ensure system security, backup, and redundancy strategies are followed Certifications (Preferred but not Mandatory: Red Hat Certified System Administrator (RHCSA) / Engineer (RHCE) VMware Certified Professional (VCP) Commvault Certified Professional Nutanix Certified Professional NetApp Certified Data Administrator (NCDA)
Posted 5 days ago
8.0 - 12.0 years
15 - 18 Lacs
bengaluru
Remote
Reporting to the Cloud Team Leader and will be working with a team of systems administrators to manage and maintain the client infrastructure using cutting-edge technologies and real-time data. Collaborate with the teams to ensure best practices are maintained, good technical solutions are in place and provide third level user support for all network and PC related issues. You will be part of a fast-paced agile environment and required to deliver quality projects and services in accordance with company strategic timelines You will be working for our Australia based client. Established in 2002, they strive to modernize the movement of goods and provide supply chain participants the best on the go IT solutions and services. They support organizations across the globe, connecting people, goods & technology and their mission is to deliver seamless, secure, real-time data fueled connections that power the logistics of delivery. REQUIRED COMPETENCIES: Technical knowledge of AWS namely, RDS, VPC, EKS, ECS, Ec2, AWS CLI, S3, CloudWatch, etc Experience with Containerisation and operating Kubernetes (EKS) Superior troubleshooting and problem-solving skills AWS network configuration and security best practice Python/bash Linux experience Experience with CI/CD, automation, and configuration management Experience in infrastructure orchestration using CloudFormation or Terraform Ability to troubleshoot and diagnose issues within mixed software, hardware, network, and database environments Ability to communicate effectively both written and verbal to other teams on security related matters Amenable to provide after-hours support. DESIRED COMPETENCIES: AWS solution architect certification Linux Certification Experience working for an offshore company Ansible QUALIFICATIONS: Candidate must possess at least a Bachelors/College Degree in Computer Science, Information Technology, Engineering (Computer/Telecommunication), or equivalent experience. Solid background in developing and supporting the base infrastructure in AWS and managing Linux environment.
Posted 5 days ago
6.0 - 10.0 years
0 Lacs
thiruvananthapuram, kerala
On-site
Are you passionate about backend operations, Linux systems, and enterprise infrastructure Claidroid Technologies is hiring a dynamic Linux & Backend Systems Specialist to support critical application and server operations for our customer. As a Backend Systems Specialist, you'll play a key role in managing Linux-based servers and backend applications across environments, contributing to system upgrades, performance monitoring, automation, and ticket resolution. You will be responsible for the administration of Linux servers both on-premises and in AWS, including patching and upgrades. Additionally, you will provide support for backend systems deployments across Test, Pre-prod, and Production environments. Monitoring system performance using tools like BMC Patrol, True Sight, Xmatters, and acting on alerts will be a crucial part of your role. Managing Control-M and Microfocus job scheduling, along with user account and identity management, will also be within your responsibilities. Automation of operational workflows using Ansible and custom bash/Python scripts will be part of your daily tasks. You will also be required to prepare technical documentation and handle incidents via Jira and ServiceNow. Working on L1/L2/L3 tickets, performing system setups, and supporting business continuity during shifts are essential aspects of this position. To be successful in this role, you should have 6+ years of hands-on experience in Linux environments and server administration. Proficiency with Control-M, Ansible, and Linux scripting (bash/Python) is required. Knowledge of Jira, ServiceNow, and Confluence is also necessary. Flexibility for 24x7 shift work, with a strong teamwork and ownership mindset, is crucial. Excellent communication, documentation, and troubleshooting skills are highly valued. Having an RHSE/RHCSA Certification and exposure to AWS/Azure, Jenkins, Kubernetes, Microfocus tools, and Agile methodologies (Scrum/KanBan) would be advantageous. Familiarity with Git/SVN and automation development experience are considered good to have. Join Claidroid Technologies and be part of mission-critical backend ops supporting a global financial services leader. Thrive in a team-driven, automation-focused culture with hybrid flexibility. Access growth through career development, international exposure, and continuous learning. We offer performance-based bonuses, healthcare, work-from-abroad options, and more. If you are ready to own the backend, enable transformation, and care for tomorrow together, apply now. To apply, kindly share: - Full Name - Contact Details (Email, Mobile) - Total & Relevant Experience - Current Employer & CTC - Expected CTC - Location Preference - Notice Period Email your details to talent.acquisition@claidroid.com or visit www.claidroid.com for more information.,
Posted 5 days ago
2.0 - 6.0 years
0 Lacs
hyderabad, telangana
On-site
As a Technology Operations Engineer at our organization, you will play a crucial role in supporting our Technology Operations Center (TOC). Your main responsibilities will include monitoring, maintaining, and troubleshooting our platforms. You will collaborate closely with various teams, such as product management, engineering, information technology, customer success, and security, to drive innovation and enhance efficiency in our SaaS operations. Your duties will involve operating and supporting both private and public cloud platforms. Within the Technical Operations Center, you will handle day-to-day activities like incident management, monitoring, and escalation procedures. You will be part of a 24/7 schedule rotation and coordinate with other teams during critical incidents and outages. Additionally, you will perform deployments, maintenance tasks, data updates, and other necessary duties. Managing our incident management system will be a key aspect of your role. You will identify and troubleshoot anomalies, performance bottlenecks, and connectivity issues. As the primary point of contact for incidents, you will follow standard operating procedures to restore services and escalate complex issues to the appropriate teams and stakeholders. Collaboration with cross-functional teams will be essential to implement best practices, standards, and policies related to technical operations at scale. To be successful in this role, you should have a Bachelor's degree in computer science, information technology, or a related field, OR a CompTIA A+ or Net+ certification, OR at least 4 years of relevant experience. You should also possess 2+ years of experience in operating mission-critical software and services, as well as in 24x7x365 technology operations. An understanding of DevOps principles, experience with on-call and incident management systems, and proficiency with modern observability tools are also required. Preferred qualifications include proficiency with Infrastructure as Code (IaC) tools like Terraform or CloudFormation, experience with Security Operations or working in a SOC, and familiarity with administering Linux and Windows servers in a cloud or datacenter environment. This role involves working in 24/7 rotational shifts, including evenings, late nights, and weekends. The position will follow a hybrid work mode, with at least two days required in the office each week, based in Hyderabad or Gurugram. Please note that this job description may not encompass all activities, duties, or responsibilities required of the employee. Duties and responsibilities are subject to change at any time with or without notice.,
Posted 5 days ago
9.0 - 13.0 years
0 Lacs
karnataka
On-site
At Capgemini Engineering, a leading global provider of engineering services, you will join a diverse team of engineers, scientists, and architects dedicated to empowering the world's most innovative companies. From cutting-edge technologies like autonomous vehicles to life-saving robots, our digital and software experts are committed to delivering unique R&D and engineering solutions across various industries. A career at Capgemini Engineering offers you a multitude of opportunities to make a difference every day. As a Site Reliability Engineer (SRE) within our organization, your primary responsibility will be to design, implement, and maintain scalable and reliable compute infrastructure. You will focus on Wintel, Linux, VMWare, and Redhat KVM environments, collaborating closely with development teams to ensure that applications are optimized for performance and reliability across different operating systems and virtualization platforms. Key responsibilities include automating repetitive tasks to enhance efficiency and reduce manual intervention, particularly within Wintel and Linux systems. You will also be tasked with monitoring system performance, identifying bottlenecks, and implementing solutions to enhance overall system reliability within VMWare and Redhat KVM environments. Additionally, you will develop and maintain tools tailored to deployment, monitoring, and operations specific to Wintel, Linux, VMWare, and Redhat KVM systems. As part of your role, you will troubleshoot and resolve issues in development, test, and production environments with a focus on compute-related challenges. Participation in on-call rotations and prompt incident response to ensure high availability of compute resources is crucial. You will be expected to implement best practices for security, compliance, and data protection within Wintel, Linux, VMWare, and Redhat KVM systems. Documenting processes, procedures, and system configurations related to the compute infrastructure will also be an essential aspect of your role. The ideal candidate will possess proficiency in Wintel Administration, Linux Administration, VMWare Administration, and Redhat. Strong scripting skills in languages such as Python, Java, C/C++, and Bash are required, along with experience in infrastructure tools like Terraform and Ansible. Familiarity with monitoring and logging tools such as Prometheus, Grafana, and ELK stack is essential. A solid understanding of networking, security, and system administration within Wintel and Linux environments is expected, along with experience in CI/CD pipelines and tools like Jenkins and GitLab CI. Knowledge of database management systems like MySQL and PostgreSQL will be advantageous. Join us at Capgemini, a global leader in business and technology transformation, and be a part of a responsible and diverse team of over 340,000 professionals in more than 50 countries. With a rich heritage spanning over 55 years, Capgemini is trusted by clients worldwide to leverage technology to meet their business objectives comprehensively. Our end-to-end services and solutions encompass strategy, design, engineering, and more, driven by our expertise in AI, generative AI, cloud, and data, complemented by deep industry knowledge and a robust partner ecosystem.,
Posted 5 days ago
4.0 - 8.0 years
8 - 12 Lacs
navi mumbai
Work from Office
Job Description : IT & Network Engineer Location: Navi Mumbai About the Role: As a Linux, VMware, Storage & SAN Administrator, you will be responsible for managing and maintaining enterprise-level systems including Linux servers, VMware virtual environments, storage arrays, and SAN switches. You will ensure high availability, performance, and security across the infrastructure while supporting business continuity and disaster recovery strategies. Key Responsibilities: Server Installation, Configuration & Hardware Monitoring & Maintenance Rack, stack, and cable physical servers in data center environments. Install and configure server hardware (Dell, HP, Lenovo, Cisco UCS, etc.). Perform BIOS/firmware updates and configure RAID/storage controllers. Install operating systems (Linux, Windows Server) and hypervisors (VMware ESXi). Monitor server health using tools like iDRAC, HP iLO, or Lenovo XClarity. Replace faulty components (RAM, HDD/SSD, power supplies, fans). Maintain inventory of server hardware and spares. Schedule and perform preventive maintenance. Diagnose hardware issues and coordinate with vendors for RMA or support. Respond to alerts and incidents related to physical server failures. Work with network and storage teams to resolve infrastructure issues. Implement access controls and audit logs for server access. Maintain compliance with organizational and regulatory standards (ISO, PCI-DSS). Generate reports on server uptime, performance, and incidents. Collaborate with IT teams for capacity planning and hardware lifecycle management. Linux Administration Install, configure, and maintain Linux servers (Red Hat). Perform system updates, patching, and kernel upgrades. Automate tasks using scripting languages (Bash, Python, Perl). Monitor system performance and troubleshoot OS-level issues. Harden Linux systems by configuring firewalls, SELinux, auditd, and secure SSH practices. Monitor logs and system activity for suspicious behaviour. Ensure patch compliance and secure configurations across all Linux distributions. Manage user accounts, permissions, and security policies. VMware Administration Manage VMware vSphere environments including ESXi hosts and vCenter (distributed switch & standard switch). Ensure compliance with security and performance standards. Deploy, configure, and monitor virtual machines. Perform capacity planning and resource allocation. Apply patches and firmware upgrades to VMware infrastructure. Secure vSphere environments including ESXi hosts and vCenter. Implement role-based access control (RBAC), secure VM templates, and encrypted vMotion. Conduct configuration reviews and vulnerability scans on virtual infrastructure. Ensure compliance with security and performance standards 2. Storage Management Administer SAN/NAS storage systems (Dell EMC). Provision and optimize storage volumes and arrays. Implement and manage backup and recovery solutions. Monitor storage performance and conduct regular audits. Ensure compliance with data protection standards (e.g., NIST, ISO 27001). Document configurations and storage policies. SAN Switch Administration Configure and manage SAN switches (Brocade). Perform zoning, provisioning, and troubleshooting. Monitor SAN fabric health and performance. Support firmware upgrades and patching. Monitor storage traffic and audit logs for anomalies. Ensure compliance with data protection standards (e.g., NIST, ISO 27001). Collaborate with vendors for support and escalations 3. Qualifications: Certifications: VMware Certified Professional (VCP), Red Hat Certified Engineer (RHCE), CompTIA Storage+, ITIL (preferred). Strong understanding of virtualization, storage protocols (iSCSI, NFS, SMB), and networking basics. Skills Required: Linux OS (Red Hat) VMware vSphere, ESXi, vCenter SAN/NAS technologies Scripting: Bash, Shell & PowerShell Backup & Recovery Tools System monitoring and performance tuning. SAN switch configuration and troubleshooting. Hands-on experience with server hardware (Dell, HP, Lenovo, Cisco UCS). Familiarity with server monitoring tools and remote management interfaces. Basic understanding of networking, storage, and virtualization.
Posted 5 days ago
5.0 - 10.0 years
6 - 10 Lacs
bengaluru
Work from Office
The role of Engineering Manager - Site Reliability, is to primarily manage, mentor and develop a team of Site Reliability Engineers, ensuring the development of both (the individual and team as a whole) are in line with organizational objectives and direction. Manages all activities in scope through the direction of activities, to design new products and modify existing designs, ensuring that deliverables are on time and with acceptable quality. The role holder is required to analyze technology trends, human resource needs, and market demand to plan projects to ensure resilience in line with current demand and future ambition. In addition to this, the role will confer with leaders, production, key stakeholders and marketing teams to determine engineering feasibility, cost effectiveness, scalability and time-to-market for new and existing products. FinTech is a complex, competitive and exciting industry. To accomplish Booking.coms mission (making it easier for everyone to experience the world), we aim to offer frictionless payment experiences to our guests and partners. The FinTech business unit creates best in class payment products that offer choice to guests and help Bookings business partners grow their business. What youll be doing: Managing People Inspire, grow and develop individuals by helping the creation of their personal development plan, leveraging available learning resources and offering stretch opportunities. Get things done in the right way by taking ownership, being proactive and collaborating with business counterparts, peers, other craft managers and stakeholders. Ensure delivery by tracking team health metrics and KPIs, monitoring roadmap progress, identifying blockers and resolving or escalating them. End to End System Ownership Own a service end to end by actively monitoring application health and performance, setting and monitoring relevant metrics and act accordingly when violated. Reduce business continuity risks and bus factor by applying state-of-the-art practices and tools, and writing the appropriate documentation such as runbooks and OpDocs. Independently manage an application or service by working through deployment and operations in production and guide more junior members of the team in this topic. Technical Incident Management Address and resolve live production issues by mitigating the customer impact within SLA. improve the overall reliability of systems by producing long term solutions through root cause analysis. Keep track of incidents by contributing to postmortem processes and logging live issues. Building software applications Build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area. Write readable and reusable code by applying standard patterns and using standard libraries. Refactor and simplify code by introducing design patterns when necessary. Ensure the quality of the application by following standard testing techniques and methods that adhere to the test strategy. Maintain data security, integrity and quality by effectively following company standards and best practices. Architectural Guidance Has sufficient knowledge to advise product teams towards a technical solution that meets the functional, nonfunctional & architectural requirements by challenging the rationale for an application design and providing context in the wider architectural landscape Set a clear direction for a technical capability by evaluating and aligning the target architecture improvements, reframing architectural designs and decisions for varied stakeholders. What youll bring: Strong people management skills and experience; Excellent communicator with strong stakeholder management experience, good commercial awareness and technical vision; You are a humble and thoughtful technology leader, you lead by example and gain your teammates respect through actions, not the title; Experience in software development, building complex and scalable solutions; Proven experience leading and managing a team of engineers in a fast-paced and complex environment; Solid experience in at least one programming language (Java, C/C++, Python, Go) Ability to formulate software solutions from scratch Solid understanding of Service Oriented Architecture, Microservices & OOP patterns Hands-on experience in Linux administration and troubleshooting Creative approach to problem-solving Practical experience in understanding and defining SLIs and SLOs Past experience with Payments or FinTech and working in a regulated environment is a plus; Strong analytical skills and data-driven mindset.
Posted 5 days ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
62336 Jobs | Dublin
Wipro
24848 Jobs | Bengaluru
Accenture in India
20859 Jobs | Dublin 2
EY
18920 Jobs | London
Uplers
13736 Jobs | Ahmedabad
IBM
12924 Jobs | Armonk
Bajaj Finserv
12820 Jobs |
Accenture services Pvt Ltd
11998 Jobs |
Amazon
11950 Jobs | Seattle,WA
Oracle
11422 Jobs | Redwood City