Jobs
Interviews

930 Failover Jobs - Page 3

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 5.0 years

5 - 7 Lacs

Gurgaon

Remote

Key Responsibilities · Manage and maintain Microsoft SQL Server databases (2016 and later) across development, UAT, and production environments. · Monitor and improve database performance using Query Store, Extended Events, and Dynamic Management Views (DMVs). · Design and maintain indexes, partitioning strategies, and statistics to ensure optimal performance. · Develop and maintain T-SQL scripts, views, stored procedures, and triggers. · Implement robust backup and recovery solutions using native SQL Server tools and third-party backup tools (if applicable). · Ensure business continuity through high-availability configurations such as Always On Availability Groups, Log Shipping, or Failover Clustering. · Perform database capacity planning and forecast growth requirements. · Ensure SQL Server security by managing logins, roles, permissions, and encryption features like TDE. · Collaborate with application developers for schema design, indexing strategies, and performance optimization. · Handle deployments, patching, and version upgrades in a controlled and documented manner. · Maintain clear documentation of database processes, configurations, and security policies. Required Skills & Qualifications · Bachelor’s degree in Computer Science, Engineering, or related field. · 3–5 years of solid experience with Microsoft SQL Server (2016 or later). · Strong command of T-SQL including query optimization, joins, CTEs, window functions, and error handling. · Proficient in interpreting execution plans, optimizing long-running queries, and using indexing effectively. · Understanding of SQL Server internals such as page allocation, buffer pool, and lock escalation. · Hands-on experience with backup/restore strategies and consistency checks (DBCC CHECKDB). · Experience with SQL Server Agent Jobs, alerts, and automation scripts (PowerShell or T-SQL). · Ability to configure and manage SQL Server high-availability features. · Exposure to tools like Redgate SQL Monitor, SolarWinds DPA, or similar is a plus. Nice to Have · Exposure to Azure SQL Database or cloud-hosted SQL Server infrastructure. · Basic understanding of ETL workflows using SSIS. · Microsoft Certification: MCSA / Azure Database Administrator Associate or equivalent. · Experience with database deployments in CI/CD pipelines. Job Types: Full-time, Permanent Pay: ₹500,000.00 - ₹700,000.00 per year Benefits: Paid sick time Paid time off Provident Fund Work from home Education: Bachelor's (Preferred) Experience: SQL: 3 years (Required) Location: Gurgaon, Haryana (Required) Work Location: In person

Posted 3 days ago

Apply

2.0 years

3 - 6 Lacs

Ahmedabad

On-site

Job Title: Information Technology (IT) Executive Company: Safebooks Global Location: Ahmedabad Job Type: Full-Time Industry: US Accounting Outsourcing Department: Information Technology About Safebooks Global Safebooks Global is a fast-growing US accounting outsourcing firm offering bookkeeping, payroll, and tax support services to CPAs, EAs, and accounting firms across the United States. We help our clients reduce overhead, improve turnaround times, and increase profitability through skilled offshore support. Position Overview: The IT Executive will be responsible for end-to-end management of the organization’s IT infrastructure, including system configuration, user support, server administration, network and security management, backup operations, and client-side IT support. The role demands strong technical expertise, proactive problem-solving, excellent documentation skills, and cross-departmental coordination to ensure smooth IT operations and business continuity. Key Responsibilities: 1. System Configuration & User Support Configure operating systems (Windows, Linux, macOS) on user machines with 100% accuracy. Respond to IT tickets within 30 minutes during business hours; resolve 90%+ within SLA. Troubleshoot hardware/software issues (printers, applications, OS) with ≥ 95% resolution efficiency. 2. Server, Backup & Data Security Management Monitor server health and ensure ≥ 99.5% uptime. Execute daily, weekly, and monthly backups for critical systems with 100% success rate and logs. Manage firewall rules, perform daily security checks and backups, ensuring 100% uptime. Conduct monthly test restores to verify backup integrity with zero data loss tolerance. 3. Network, Domain & Security Management Maintain stable networks (routers, switches, VPNs) with ≥ 98% uptime. Administer Google Workspace (G Suite) for users, access, and email controls with 100% accuracy. Ensure biometric and CCTV systems are always operational; complete daily checklists. Perform daily internal network and security inspections to detect and mitigate risks. 4. IT Asset, License & Vendor Management Maintain up-to-date and accurate inventory using Snipe IT or equivalent (≥ 98% accuracy). Track, manage, and renew all software licenses before expiry. Identify and propose IT cost-saving strategies with demonstrable impact per quarter. 5. Project Implementation, Testing & Client Support Test new tools, applications, and upgrades with full documentation and reporting. Deploy and configure new servers with correct domain integration and failover mechanisms. Provide IT support to clients like Ratanakar and ABJ, ensuring ≥ 95% satisfaction levels. 6. Interdepartmental & Field Support Assist HR/Admin teams with IT setup for campaigns, employee onboarding, and events. Attend to out-of-office/client-side issues with 100% resolution of assigned tasks. 7. Reporting & Documentation Complete all daily IT checklists including CCTV, backup, server and network logs. Maintain accurate records of users, devices, licenses, and credentials. Submit monthly KPI reports and internal IT summaries within designated timelines. Qualifications: Bachelor’s Degree in IT, Computer Science, or a related field. 2+ years of experience in a similar IT support/administrator role. Strong knowledge of operating systems, networking, firewalls, and Google Workspace. Experience with server management, backups, and IT asset tracking tools (e.g., Snipe IT). Excellent problem-solving, multitasking, and documentation skills. Preferred Skills: Hands-on experience with FortiGate firewalls and Ubiquiti/TP-Link networking equipment. Familiarity with backup and recovery tools. Exposure to cloud and SaaS tools used by small to medium businesses. Work Conditions: Must be available for on-site and occasional client visits. Willing to support out-of-office hours in case of urgent issues or deployments. To Apply: Please send your resume and a brief note on your past sales or client acquisition wins to shailesh@safebooksglobal.com and jobs@safebooksglobal.com Immediate joiners preferred! These positions are urgent, and we are looking for candidates who are available to join immediately . We would appreciate it if you could send me the details below. Name : Phone : Email : Current Location : No. Of Years of Experience in Relevant : Current CTC : Expected CTC : Designation: Current Company : Notice Period : Relocation : Additional Comments : Job Type: Full-time Pay: ₹25,000.00 - ₹55,000.00 per month Benefits: Provident Fund Work Location: In person

Posted 3 days ago

Apply

8.0 years

0 Lacs

Kochi, Kerala, India

On-site

Experience: 6–8+ years Role Overview We are seeking a seasoned and hands-on Database Administrator (DBA) with deep expertise in Oracle (Core/EBS/Exadata) and PostgreSQL administration. This role demands the ability to operate across a heterogeneous and high-availability database ecosystem, proactively tune performance, drive automation, and ensure enterprise-grade uptime and compliance. Key Responsibilities Manage, tune, and optimize performance across 220+ database instances using tools such as Oracle Enterprise Manager (OEM), Quest Spotlight, and custom observability scripts. Implement and maintain robust backup and recovery strategies for production and non-production environments. Participate in and support Disaster Recovery (DR) tests, failover drills, and maintain High Availability (HA) setups (e.g., Oracle DataGuard, Oracle RAC, SQL Clustering). Build and maintain system health dashboards, track performance baselines, and provide capacity planning insights. Collaborate cross-functionally with application, BI, and infrastructure teams to identify and resolve performance bottlenecks and database-related issues. Administer database patching, access control, and provisioning activities, ensuring alignment with SOX compliance standards. Contribute to infrastructure automation using Shell scripting, Python, or Ansible. Support and maintain replication and integration tools including Oracle GoldenGate, Qlik Replicate, and Oracle Data Integrator (ODI). Required Skills & Qualifications 6–8+ years of enterprise DBA experience across Oracle and PostgreSQL environments. Strong Oracle DBA experience (including RAC, DataGuard, Exadata, EBS administration, and patching via Rimini Support). Hands-on experience with PostgreSQL (on-prem and AWS-hosted) — including deployment, scaling, and tuning. Proven expertise in monitoring, diagnosing, and optimizing high-throughput transactional databases. Experience managing replication tools like Oracle GoldenGate and Qlik Replicate. Proficiency in shell scripting and at least one automation/configuration management tool (Python, Ansible, etc.). Familiarity with compliance frameworks (e.g., SOX) and implementation of access controls and audit logs. Comfortable working in a 24x7 support model, including enhanced responsiveness during period-close cycles.

Posted 3 days ago

Apply

10.0 years

0 Lacs

Kochi, Kerala, India

On-site

Experience: 10+ years Role Type: Full-time Role Overview We are seeking a Senior Oracle and SQL Server SME to serve as the Track Lead within a global database managed services engagement. This is a hands-on leadership role involving performance tuning, optimization, HA/DR planning, and managing operations across a heterogeneous enterprise DB environment, including Oracle (EBS/Exadata), SQL Server, PostgreSQL (on-prem and AWS). The ideal candidate will have a deep understanding of enterprise performance diagnostics, observability tooling, compliance standards, and team coordination in a 24x7 support model. Key Responsibilities Track Leadership & Oversight. Lead and mentor the DBA team delivering 24x7 on-desk support across all platforms. Ensure SLA compliance for all database support tickets. Serve as the escalation point for P1/P2 incidents and drive root cause analysis. Coordinate daily operations, performance health checks, and scheduled activities including DR drills, code deployments, and database cloning. Performance Tuning & Optimization Perform deep-dive performance analysis using tools like AWR, ASH, ADDM, and SQL Trace/Monitor. Review and interpret AWR/ASH reports to identify inefficient SQL, wait events, I/O bottlenecks, and system load issues. Tune problematic queries, optimize indexes, analyze execution plans, and make schema design recommendations. Partner with development and application teams to implement long-term performance improvements. Monitoring, Automation & Observability Monitor database health using OEM, Quest Spotlight, and custom scripts. Establish and continuously refine baselines, KPIs, and automated alerts for availability and performance anomalies. Drive automation of routine DBA tasks including backups, patching, and reporting. Administration & Lifecycle Management Oversee patching, cloning, upgrades, and regular maintenance across ~220 databases. Manage backup/recovery strategies, database provisioning, and access control in accordance with SOX compliance. Maintain DR and HA setups, including Oracle DataGuard, SQL Server clustering, and storage replication for EBS. EBS & Middleware Stack Administer Oracle E-Business Suite (EBS) Database environments Ensure database support for applications like ODI, STAT, and replication tools like Oracle GoldenGate and Qlik. Required Skills & Experience 10+ years DBA experience, including Oracle 19c/Exadata/EBS and SQL Server administration. Hands-on tuning expertise using AWR, ADDM, ASH, Statspack, and advanced troubleshooting techniques. Strong knowledge of PostgreSQL (AWS-hosted administration is preferred). Expertise in GoldenGate, ODI, Qlik Replicate, and replication troubleshooting. Experience in HA/DR architecture, capacity planning, and SOX-compliant auditing. Scripting/automation using Shell, Python, Ansible, or similar tools. Soft Skills Proven leadership in managing global delivery models and multi-vendor teams. Strong communication skills and ability to interface with business, security, and application owners. Structured thinker with a focus on continuous improvement and automation-first mindset. Work Conditions & Expectations Responsible for 24x7 support, including failover drills, backups, and code deployment cycles. Coordination with OEM, Rimini, and Oracle Support for escalations and patching.

Posted 3 days ago

Apply

8.0 years

0 Lacs

Trivandrum, Kerala, India

On-site

Experience: 6–8+ years Role Overview We are seeking a seasoned and hands-on Database Administrator (DBA) with deep expertise in Oracle (Core/EBS/Exadata) and PostgreSQL administration. This role demands the ability to operate across a heterogeneous and high-availability database ecosystem, proactively tune performance, drive automation, and ensure enterprise-grade uptime and compliance. Key Responsibilities Manage, tune, and optimize performance across 220+ database instances using tools such as Oracle Enterprise Manager (OEM), Quest Spotlight, and custom observability scripts. Implement and maintain robust backup and recovery strategies for production and non-production environments. Participate in and support Disaster Recovery (DR) tests, failover drills, and maintain High Availability (HA) setups (e.g., Oracle DataGuard, Oracle RAC, SQL Clustering). Build and maintain system health dashboards, track performance baselines, and provide capacity planning insights. Collaborate cross-functionally with application, BI, and infrastructure teams to identify and resolve performance bottlenecks and database-related issues. Administer database patching, access control, and provisioning activities, ensuring alignment with SOX compliance standards. Contribute to infrastructure automation using Shell scripting, Python, or Ansible. Support and maintain replication and integration tools including Oracle GoldenGate, Qlik Replicate, and Oracle Data Integrator (ODI). Required Skills & Qualifications 6–8+ years of enterprise DBA experience across Oracle and PostgreSQL environments. Strong Oracle DBA experience (including RAC, DataGuard, Exadata, EBS administration, and patching via Rimini Support). Hands-on experience with PostgreSQL (on-prem and AWS-hosted) — including deployment, scaling, and tuning. Proven expertise in monitoring, diagnosing, and optimizing high-throughput transactional databases. Experience managing replication tools like Oracle GoldenGate and Qlik Replicate. Proficiency in shell scripting and at least one automation/configuration management tool (Python, Ansible, etc.). Familiarity with compliance frameworks (e.g., SOX) and implementation of access controls and audit logs. Comfortable working in a 24x7 support model, including enhanced responsiveness during period-close cycles.

Posted 3 days ago

Apply

0.0 - 3.0 years

11 - 12 Lacs

Gurugram, Haryana

On-site

Job Title: Pre-Sales & Solutioning Engineer – Data Center & Integration Technologies Location: [Gurgaon] Experience: 3+ years Department: Sales Engineering / Solutions Reporting to: Director -Sales About Giniminds: Giniminds is a technology-driven innovation company at the forefront of Digital Transformation, AI, and Enterprise Architecture. We partner with global enterprises to co-create and implement impactful solutions that integrate modern platforms, intelligent infrastructure, and deep domain knowledge. Role Summary: We are looking for a dynamic Pre-Sales & Solutioning Engineer to lead the technical discovery, design, and presentation of integrated infrastructure and data solutions. You will work closely with customers, sales teams, and engineering partners to position Giniminds' server and data center offerings along with advanced integration platforms like Confluent Kafka, StreamSets, and Software AG WebMethods. Key Responsibilities: Pre-Sales Engagement: Participate in customer meetings, understand business needs, and translate them into technical solutions. Prepare and deliver technical presentations, demos, POCs, and RFP responses. * Collaboration: Act as a bridge between sales, product, and engineering teams to align solution offerings with customer expectations. Work with OEM partners (Dell, Cisco, NetApp, Pure Storage, etc.) for sizing, licensing, and compatibility. Client Advisory: Consult customers on infrastructure modernization, integration strategy, and data fabric architecture. Advise on hybrid/multi-cloud deployment strategies, high availability, and failover architectures. * Documentation & Enablement: Create solution blueprints, BOMs, integration workflows, and architecture documents. Support knowledge transfer to internal teams and customers post-deployment. Required Skills & Experience: Infrastructure Expertise: Strong understanding of server platforms, compute virtualization, hyperconverged infrastructure, and data center networking. Experience with Dell PowerEdge, Cisco UCS, HPE servers, and enterprise storage (Pure, NetApp, etc.). Integration Platforms: Exposure with Confluent Kafka (topics, brokers, schema registry, KSQL, connectors). Exposure to StreamSets, WebMethods, or other enterprise integration tools. Presales & Client Facing: Experience in creating RFP/RFI responses, presentations, and POCs. Strong communication and stakeholder engagement skills. Cloud & DevOps (preferred): Knowledge of Kubernetes, container orchestration, CI/CD pipelines. Experience with cloud infrastructure (AWS/GCP/Azure) is a plus. Qualifications: Bachelor's or Master's in Computer Science, Engineering, or related field. 3 + years in IT infrastructure, integration, or pre-sales engineering roles. Certifications in relevant domains (e.g., Cisco DC, VMware, Openshift) are a plus. Why Join Giniminds? Work on cutting-edge digital transformation programs across industries. Be part of a collaborative, innovation-led team. Opportunities to work with global partners in cloud, data, and AI ecosystems. Job Type: Full-time Pay: ₹1,100,000.00 - ₹1,200,000.00 per year Ability to commute/relocate: Gurugram, Haryana: Reliably commute or planning to relocate before starting work (Required) Experience: Pre sales : 3 years (Required) Solution engineering: 3 years (Required) IT Infrastructure : 3 years (Required) Integration: 3 years (Required) Work Location: In person

Posted 3 days ago

Apply

8.0 years

0 Lacs

Gujarat, India

On-site

Job Summary : We are seeking a highly skilled and motivated Lead DevOps Engineer with Solution Architect expertise to manage end-to-end infrastructure projects across cloud, hybrid, and dedicated server environments. This role demands hands-on experience with WHM/cPanel, OpenPanel, load balancers , and deep knowledge of modern DevOps practices. The ideal candidate will also lead a team of DevOps engineers, drive technical excellence, and serve as the go-to expert for scalable, secure, and high-availability infrastructure solutions. Key Responsibilities : DevOps & Infrastructure Management Architect, implement, and maintain scalable infrastructure solutions across cloud and dedicated server environments. Manage hosting infrastructure including WHM/cPanel, OpenPanel , Apache/Nginx, MySQL, DNS, mail servers, and firewalls. Design and configure load balancing strategies using HAProxy, NGINX, or cloud-native load balancers. Automate provisioning, configuration, deployment, and monitoring using tools like Ansible , Terraform , CI/CD (Jenkins, GitLab CI) . Ensure infrastructure reliability, security, and disaster recovery processes are in place. Solution Architecture Translate business and application requirements into robust infrastructure blueprints. Lead design reviews and architectural discussions for client and internal projects. Create documentation and define architectural best practices for hosting and DevOps. Team Management & Leadership Lead and mentor a team of DevOps engineers across multiple projects. Allocate resources, manage project timelines, and ensure successful delivery. Foster a culture of innovation, continuous improvement, and collaboration. Conduct performance reviews, provide training, and support career development of team members. Monitoring, Security & Optimization Set up and maintain observability systems (e.g., Prometheus, Grafana, Zabbix). Conduct performance tuning, cost optimization, and environment hardening. Ensure compliance with internal policies and external standards (ISO, GDPR, SOC2, etc.). Required Skills & Experience : 8+ years of experience in DevOps, systems engineering, or cloud infrastructure management . 3+ years of experience in team leadership or technical management . Proven expertise in hosting infrastructure , including WHM/cPanel, OpenPanel, Plesk, DNS, and mail configurations. Strong experience with Linux servers , networking , security , and automation scripting (Bash, Python). Hands-on experience with cloud platforms (AWS, Azure, GCP) and hybrid environments. Deep understanding of CI/CD pipelines , Docker/Kubernetes , and version control (Git). Familiarity with load balancing , high availability, and failover strategies. Preferred Qualifications : Certifications such as AWS Solutions Architect , RHCE , CKA , or Linux Foundation Certified Engineer . Experience in IT services or hosting/cloud consulting environments. Knowledge of compliance frameworks (e.g., ISO 27001, SOC 2, PCI-DSS). Familiarity with agile methodologies and DevOps lifecycle management tools.

Posted 4 days ago

Apply

3.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Job Title: Senior DevOps Engineer (SRE2) Location: Gurugram Experience: 3+ Years About HaaNaa HaaNaa is a skill-based opinion trading platform that lets users trade their opinions on diverse topics using simple Yes/No choices. From politics, crypto, and finance to sports, entertainment, and current affairs—HaaNaa transforms opinions into assets. With a gamified interface, users get rewarded for informed predictions, while tracking real-time trends, analyzing insights, and engaging with a vibrant community. Role Overview We are looking for a Senior DevOps Engineer (SRE2) to lead and scale our infrastructure as we grow our real-time trading platform. This role demands a mix of hands-on DevOps skills and strong ownership of system reliability, scalability, and observability. Key Responsibilities Design, deploy, and manage scalable, secure, and resilient infrastructure on AWS, focusing on EKS (Elastic Kubernetes Service) for container orchestration. Implement and manage service mesh using Istio, enabling traffic control, observability, and security across microservices. Drive Infrastructure-as-Code (IaC) using Terraform for consistent and repeatable provisioning of cloud resources. Build and maintain robust CI/CD pipelines (GitHub Actions, Jenkins, or CircleCI) to ensure efficient and automated delivery workflows. Ensure high system availability, performance, and reliability—taking ownership of SLIs/SLOs/SLAs, alerts, and dashboards. Implement observability practices using tools like Prometheus, Grafana, ELK/EFK, or OpenTelemetry. Manage incident response, root cause analysis (RCA), and drive postmortem culture. Collaborate with cross-functional teams (engineering, QA, product) to ensure DevOps and SRE best practices are followed. Harden platform against security threats (including DDoS) using Cloudflare, Akamai, or equivalent. Automate repetitive tasks using scripting (Python, Bash) and tools like Ansible. Contribute to platform cost optimization, auto-scaling, and multi-region failover strategies. Requirements 3+ years of hands-on DevOps/SRE experience including team mentorship or leadership. Proven expertise in managing AWS cloud-native architecture, especially EKS, IAM, VPC, ALB/NLB, S3, RDS, CloudWatch. Hands-on with Istio for service mesh and microservice observability/security. Deep experience with Terraform for managing cloud infrastructure. Proficiency in CI/CD and automation tools (GitHub Actions, Jenkins, CircleCI, Ansible). Strong scripting skills in Python, Bash, or equivalent. Familiar with Kubernetes administration, Helm charts, and container orchestration. Strong understanding of monitoring, alerting, and logging systems. Experience handling DDoS mitigation, WAF rules, and CDN configuration. Excellent problem-solving and incident management skills with a proactive mindset. Strong collaboration and communication skills. Nice to Have Experience in high-growth startups or gaming platforms. Understanding of security best practices, IAM policies, and compliance frameworks (SOC2, ISO, etc.). Experience in backend performance tuning, horizontal scaling, and chaos engineering. Familiarity with progressive delivery techniques like Canary deployments or Blue/Green strategies. Why Join HaaNaa? Ownership: Play a key role in shaping the platform’s infrastructure and reliability. Innovation: Work on scalable, low-latency systems powering real-time gamified trading. Teamwork: Join a dynamic, talented team solving complex engineering challenges. Growth: Be part of a rapidly expanding company with leadership growth opportunities. Perks & Benefits: Competitive salary, health insurance, and the freedom to experiment with the latest cloud-native tools. Skills: devops,terraform,ci/cd,cloudformation,go,networking,datadog,aws,grafana,sre,kubernetes,azure,security,prometheus,infrastructure-as-code,gcp,bash,docker,python,linux system administration,elk stack

Posted 4 days ago

Apply

12.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

About Aeris: For more than three decades, Aeris has been a trusted cellular IoT leader enabling the biggest IoT programs and opportunities across Automotive, Utilities and Energy, Fleet Management and Logistics, Medical Devices, and Manufacturing. Our IoT technology expertise serves a global ecosystem of 7,000 enterprise customers and 30 mobile network operator partners, and 80 million IoT devices across the world. Aeris powers today’s connected smart world with innovative technologies and borderless connectivity that simplify management, enhance security, optimize performance, and drive growth. Built from the ground up for IoT and road-tested at scale, Aeris IoT Services are based on the broadest technology stack in the industry, spanning connectivity up to vertical solutions. As veterans of the industry, we know that implementing an IoT solution can be complex, and we pride ourselves on making it simpler. Our company is in an enviable spot. We’re profitable, and both our bottom line and our global reach are growing rapidly. We’re playing in an exploding market where technology evolves daily and new IoT solutions and platforms are being created at a fast pace. A few things to know about us: We put our customers first . When making decisions, we always seek to do what is right for our customer first, our company second, our teams third, and individual selves last We do things differently. As a pioneer in a highly competitive industry that is poised to reshape every sector of the global economy, we cannot fall back on old models. Rather, we must chart our own path and strive to out-innovate, out-learn, out-maneuver and out-pace the competition on the way We walk the walk on diversity. We’re a brilliant and eclectic mix of ethnicities, religions, industry experiences, sexual orientations, generations and more – and that’s by design. We see diverse perspectives as a core competitive advantage Integrity is essential. We believe in doing things well – and doing them right. Integrity is a core value here: you’ll see it embodied in our staff, our management approach and growing social impact work (we have a VP devoted to it). You’ll also see it embodied in the way we manage people and our HR issues: we expect employees and managers to deal with issues directly, immediately and with the utmost respect for each other and for the Company We are owners. Strong managers enable and empower their teams to figure out how to solve problems. You will be no exception, and will have the ownership, accountability and autonomy needed to be truly creative Job Title: Senior Oracle Database Administrator (DBA) – GCP Location: Noida, India We are seeking a highly skilled and experienced Senior Oracle DBA to manage and maintain our critical Oracle 12c, 18c, 19c, 21c single instance with DG and RAC databases, hosted on Google Cloud Platform (GCP). The ideal candidate will possess deep expertise in Oracle database administration, including installation, configuration, patching, performance tuning, security, and backup/recovery strategies within a cloud environment. They will also have expertise and experience optimizing the underlying operating system and database parameters for maximum performance and stability. Responsibilities: Database Administration: Install, configure, and maintain Oracle 12c, 18c, 19c, 21c single instance with DG and RAC databases on GCP Compute Engine. Implement and manage Oracle Data Guard for high availability and disaster recovery, including switchovers, failovers, and broker configuration. Perform database upgrades, patching, and migrations. Develop and implement backup and recovery strategies, including RMAN configuration and testing. Monitor database performance and proactively identify and resolve performance bottlenecks. Troubleshoot database issues and provide timely resolution. Implement and maintain database security measures, including user access control, auditing, and encryption. Automate routine database tasks using scripting languages (e.g., Shell, Python, PL/SQL). Create and maintain database documentation. Database Parameter Tuning: In-depth knowledge of Oracle database initialization parameters and their impact on performance, with a particular focus on memory management parameters. Expertise in tuning Oracle memory structures (SGA, PGA) for optimal performance in a GCP environment. This includes: Precisely sizing the SGA components (Buffer Cache, Shared Pool, Large Pool, Java Pool, Streams Pool) based on workload characteristics and available GCP Compute Engine memory resources. Optimizing PGA allocation (PGA_AGGREGATE_TARGET, PGA_AGGREGATE_LIMIT) to prevent excessive swapping and ensure efficient SQL execution. Understanding the interaction between SGA and PGA memory regions and how they are affected by GCP instance memory limits. Tuning the RESULT_CACHE parameters for optimal query performance, considering the available memory and workload patterns. Proficiency in using Automatic Memory Management (AMM) and Automatic Shared Memory Management (ASMM) features and knowing when manual tuning is required for optimal results. Knowledge of how GCP instance memory limits can impact Oracle's memory management and the appropriate adjustments to make. Experience with analysing AWR reports and identifying areas for database parameter optimization, with a strong emphasis on identifying memory-related bottlenecks (e.g., high buffer busy waits, excessive direct path reads/writes). Proficiency in tuning SQL queries using tools like SQL Developer and Explain Plan, particularly identifying queries that consume excessive memory or perform inefficient memory access patterns. Knowledge of Oracle performance tuning methodologies and best practices, specifically as they apply to memory management in a cloud environment. Experience with database indexing strategies and index optimization, understanding the impact of indexes on memory utilization. Solid understanding of Oracle partitioning and its benefits for large databases, including how partitioning can affect memory usage and query performance. Ability to perform proactive performance tuning based on workload analysis and trending, with a focus on memory usage patterns and potential memory-related performance issues. Expertise in diagnosing and resolving memory leaks or excessive memory consumption issues within the Oracle database. Deep understanding of how shared memory segments are managed within the Linux OS on GCP Compute Engine and how to optimize them for Oracle. Data Guard Expertise: Deep understanding of Oracle Data Guard architectures (Maximum Performance, Maximum Availability, Maximum Protection). Expertise in configuring and managing Data Guard broker for automated switchovers and failovers. Experience in troubleshooting Data Guard issues and ensuring data consistency. Knowledge of Data Guard best practices for performance and reliability. Proficiency in performing Data Guard role transitions (switchover, failover) with minimal downtime. Experience with Active Data Guard is a plus. Operating System Tuning: Deep expertise in Linux operating systems (e.g., Oracle Linux, Red Hat, CentOS) and their interaction with Oracle databases. Performance tuning of the Linux operating system for optimal Oracle database performance, including: Kernel parameter tuning (e.g., shared memory settings, semaphores, file descriptor limits). Memory management optimization (e.g., HugePages configuration). I/O subsystem tuning (e.g., disk scheduler selection, filesystem optimization). Network configuration optimization (e.g., TCP/IP parameters). Monitoring and analysis of OS performance metrics using tools like vmstat, iostat, top, and sar. Identifying and resolving OS-level resource contention issues (CPU, memory, I/O). Good to Have: GCP Environment Management: Provision and manage GCP Compute Engine instances for Oracle databases, including selecting appropriate instance types and storage configurations. Configure and manage GCP networking components (VPCs, subnets, firewalls) for secure database access. Utilize GCP Cloud Monitoring and Logging for database monitoring and troubleshooting. Implement and manage GCP Cloud Storage for database backups. Experience with Infrastructure as Code (IaC) tools like Terraform or Cloud Deployment Manager to automate GCP resource provisioning. Cost optimization of Oracle database infrastructure on GCP. Other Products and Platforms Experience with other cloud platforms (AWS, Azure). Experience with NoSQL databases. Experience with Agile development methodologies. Experience with DevOps practices and tools (e.g., Ansible, Chef, Puppet). Experience with GoldenGate. Qualifications: Bachelor's degree in Computer Science or a related field. Minimum 12+ years of experience as an Oracle DBA. Proven experience managing Oracle 12c, 18c, 19c, and 21c single instance with DG and RAC databases in a production environment, with strong Data Guard expertise. Extensive experience with Oracle database performance tuning, including OS-level and database parameter optimization. Hands-on experience with Oracle databases hosted on Google Cloud Platform (GCP). Strong understanding of Linux operating systems. Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Oracle Certified Professional (OCP) certification is highly preferred. GCP certifications (e.g., Cloud Architect, Cloud Engineer) are a plus. Aeris may conduct background checks to verify the information provided in your application and assess your suitability for the role. The scope and type of checks will comply with the applicable laws and regulations of the country where the position is based. Additional detail will be provided via the formal application process. Aeris walks the walk on diversity. We’re a brilliant mix of varying ethnicities, religions, cultures, sexual orientations, gender identities, ages and professional/personal/military experiences – and that’s by design. Diverse perspectives are essential to our culture, innovative process and competitive edge. Aeris is proud to be an equal opportunity employer.

Posted 4 days ago

Apply

5.0 years

3 - 9 Lacs

Delhi, Delhi

On-site

Job Title: Telecom Development Engineer – FreeSWITCH & Kazoo Department: Engineering / VoIP Platform Location: On-Site Delhi Employment Type: Full-time Experience Level: 5+ years in VoIP/Telecom Development Role Summary: We are seeking a highly skilled Telecom Development Engineer with hands-on experience in FreeSWITCH and Kazoo , alongside strong programming skills in Go , Python , and familiarity with Cloud Databases , RabbitMQ , REST APIs , Ansible , Prometheus , Grafana , and Git The ideal candidate will be responsible for developing and maintaining VoIP applications and modules in FreeSWITCH and integrating them into the Kazoo multi-tenant telephony platform using Monster UI. Key Responsibilities: Design and Develop Custom FreeSWITCH Modules: Create scalable, high-performance modules and dialplans in FreeSWITCH using Lua, Go, or C. Work with ESL (Event Socket Library) and mod_xml_curl to extend call handling logic Kazoo Integration and Configuration: Deploy FreeSWITCH modules and services into Kazoo via Monster UI and Kazoo APIs. Customize and extend Kazoo applications using Kazoo’s AMQP and REST API interfaces. Application Development: Build automation tools and microservices using Go and Python to manage telecom workflows. Develop backend services that interface with SIP, RTP, and Kazoo/FreeSWITCH subsystems. Infrastructure Automation & Monitoring: Automate deployments with Ansible . Monitor system health using Prometheus and Grafana . Implement scalable logging, alerting, and system health-checks. DevOps & Source Control: Use Git for version control and CI/CD workflows. Collaborate on code reviews and participate in agile sprints. API Integration: Consume and expose RESTful APIs to support user interface functionality and backend logic. Integrate with third-party systems and internal services using RabbitMQ message queues. Troubleshooting and Optimization: Investigate and resolve SIP signaling issues, one-way audio, NAT traversal, and codec mismatches. Optimize RTP stream handling, failover, load balancing, and call quality. Required Skills & Qualifications: VoIP Expertise: Deep understanding of SIP, RTP, SDP, NAT , and SIP tracing tools (e.g., sngrep, Wireshark). Experience building and maintaining VoIP platforms using FreeSWITCH and Kazoo . Programming Languages: Proficiency in Go (Golang) and Python . Familiarity with Lua scripting and C for FreeSWITCH module development. Messaging & Databases: Experience with RabbitMQ (AMQP) and Cloud DBs like CouchDB/Couchbase (used by Kazoo). Infrastructure Tools: Strong skills in Ansible , Git , and CI/CD pipelines. Proficient in Prometheus and Grafana for system observability. Web & API Skills: Proficient in designing and consuming RESTful APIs . Experience with Kazoo REST APIs and Monster UI for provisioning and monitoring. Preferred Qualifications: Experience working in multi-tenant VoIP platforms . Familiarity with WebRTC , STUN/TURN, and SBCs (Session Border Controllers). Previous contributions to open-source VoIP projects. Knowledge of Docker or containerization for telecom applications. Key Attributes: Strong problem-solving skills and ability to work independently. Excellent communication and documentation skills. Passion for scalable systems, performance optimization, and clean architecture. Collaborative mindset and proactive in a team environment. Job Types: Full-time, Permanent Pay: ₹311,015.97 - ₹900,000.00 per year Benefits: Cell phone reimbursement Internet reimbursement Paid time off Work Location: In person Expected Start Date: 11/08/2025

Posted 4 days ago

Apply

5.0 years

0 Lacs

Ayodhya, Uttar Pradesh, India

On-site

Title: Backend Engineer – Scalable APIs Location: Ayodhya - Uttar Pradesh Experience: 3–5 years Responsibilities: Design and develop scalable REST APIs using Node.js or Go. Implement JWT-based authentication and user management. Build integrations with Stripe/Razorpay for payment flows. Optimize database queries and caching for high performance. Implement microservices architecture for scalability. Requirements: Proficiency in Node.js (Express/Fastify) or Go . Strong knowledge of PostgreSQL and query optimization. Experience with Redis caching and async queues . Familiarity with Docker , NGINX , and API security best practices . Nice to Have: Experience in Kubernetes deployments. Knowledge of event-driven architecture. Title: DevOps & Cloud Engineer – AWS/Kubernetes Location: Ayodhya - Uttar Pradesh Experience: 4–6 years Responsibilities: Set up and manage AWS infrastructure (EKS, RDS, S3, CloudFront). Implement CI/CD pipelines using GitHub Actions or Jenkins. Deploy and manage containerized apps with Kubernetes. Configure auto-scaling , load balancing, and failover strategies. Monitor system performance and ensure 99.9% uptime. Requirements: Strong knowledge of AWS services : EC2, EKS, RDS, S3, ALB. Proficiency in Kubernetes , Helm , and Docker . Experience with monitoring tools (Prometheus, Grafana). Familiarity with Terraform / IaC (Infrastructure as Code). Nice to Have: Experience with multi-region deployments . Knowledge of cost optimization on AWS. Title: QA & Performance Engineer – Load Testing & Automation Location: Ayodhya - Uttar Pradesh Experience: 3–5 years Responsibilities: Design automated test cases for functional and regression testing. Implement load and stress testing using k6 / JMeter / Locust. Analyze performance bottlenecks in backend APIs and DB. Monitor key metrics: response time, throughput, error rate. Work closely with DevOps for scaling strategies. Requirements: Proficiency in load testing tools (k6, JMeter, Locust). Experience with API testing (Postman, REST Assured). Familiarity with CI/CD integration for testing . Knowledge of Grafana / Prometheus dashboards . Nice to Have: Understanding of Kubernetes and containerized test environments. Prior experience with high concurrency systems .

Posted 4 days ago

Apply

0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Business Unit Cubic Transportation Systems Company Details When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people’s lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led by our talented teams around the world, Cubic is committed to solving global issues through innovation and service to our customers and partners. We have a top-tier portfolio of businesses, including Cubic Transportation Systems (CTS) and Cubic Defense (CD). Explore more on Cubic.com. Job Details The Systems Administrator plays an integral role in the deployment team. Assists in system architecture, design, integration, and development. The administrator assures systems are well-behaved and all system platform operating systems (Windows, AIX, UNIX, LINUX, etc.) remain current and secure. In addition, the administrator role ensures that upgrades and installations are well-rehearsed and documented prior to conducting official installations. The administrator is the key liaison who works closely with colleagues and customers to ensure systems in-house or deployed Nextfare suite products software, and peripherals are kept current and functional. This position works under general supervision and direction. “This role requires an employee to work on a rotational basis (24/7) that includes night shifts and fixed weekend, including Saturday and Sunday shifts (12 hours a day or 12 hours evening/night during weekends) and the rest of the weekdays 9 hours, total of 40 hours a week. Such an employee would have 4 days working and 3 days weekly OFF” Essential Job Duties And Responsibilities Performs day-to-day system administration Monitors and manages system health checks, OSs, and system software. Assists the Network administrator as needed with LAN, WAN, and Internet. Manages and controls Software licenses. Maintains secure backend systems and LANs. Provides guidance and recommendations on all backend OS’s. Provides Windows, UNIX, LINUX, AIX, and NT-based platforms. Installs and configures system backup/restore/failover software (NetBackup/Veritas/Legato, etc) and hardware. Conducts performance tuning; optimization of resource configuration – All platforms and LAN. Supports the configuration of Routers, Firewalls, and Load Balancers. Assists in the installation and configuration of databases. Assists in installing and configuring monitoring software such as Big Brother, etc. Applies the system OS and DB. patch sets Installs Oracle database software General Duties And Responsibilities Comply with Cubic’s Quality Management System Comply with Cubic Occupational Health, Safety, and Environment policies and procedures Comply with security in accordance with established policies and procedures of the organisation Comply with Cubic Human Resources Procedures Other duties as requested Minimum Job Requirements Three-year/Four-year college degree in computer science, or a related technical field. Two years of systems administration experience. Knowledge and experience administering various Windows and UNIX Operating Systems. Extensive knowledge and experience in LAN network engineering – TCP/IP, internet. Must have extensive knowledge and experience with HP and Sun UNIX platforms, as well as experience implementing UNIX and LAN security measures (including firewalls). In-depth understanding of System Administration methodology and principles. Must be a self-motivator, good working knowledge of common programming languages (C/C++, Java, PERL, RUBY). Worker Type Employee

Posted 4 days ago

Apply

4.0 years

3 - 7 Lacs

Gurgaon

On-site

About Alphanext Alphanext is a global talent solutions company with offices in London, Pune, and Indore. We connect top-tier technical talent with forward-thinking organizations to drive innovation and transformation through technology. Position Summary Alphanext is hiring an experienced SQL Server Database Administrator to manage and optimize Microsoft SQL Server environments hosted in AWS Cloud. The ideal candidate should be proficient in high availability configurations (Always On), automation (PowerShell, Tidal Workload Automation), and cloud-native operational tasks. Key Responsibilities Administer, configure, and maintain SQL Server environments 2019 and above. Manage Always On Availability Groups , including automated failover/failback during patching or OS upgrades. Automate DBA tasks and compliance monitoring using PowerShell scripting . Develop, schedule, and maintain SQL jobs via Tidal Workload Automation and REST APIs . Plan and execute SQL Server upgrades and migrations (2012/2014 to 2019), including project planning and downtime coordination. Monitor and optimize database performance, manage capacity, and resolve space issues, especially in AWS-hosted environments. Maintain version control for scripts using tools like Visual Studio Code and Bitbucket . Manage SSRS report migration , subscriptions, and deployment using tools like RSS Scripter . Collaborate with CDO and infrastructure teams to develop and maintain purging scripts and data archiving policies. Required Skills 4–6 years of SQL Server database administration experience. Hands-on experience with Always On Availability Groups , PowerShell scripting , and Tidal Workload Automation . Experience in SQL Server upgrade projects and familiarity with AWS RDS or EC2-based SQL deployments . Exposure to SSRS administration and report migrations. Strong analytical, troubleshooting, and performance tuning skills. Qualifications Bachelor//'s degree in Computer Science, Information Technology, or related field. 4+ years of relevant experience in database administration and infrastructure support.

Posted 4 days ago

Apply

5.0 years

6 - 7 Lacs

Greater Noida

On-site

Job Summary: We are seeking a skilled and passionate Azure Engineer L2 to join our growing team. should possess strong technical skills in Azure cloud services, particularly IaaS and PaaS, along with excellent problem-solving and communication abilities. They need hands-on experience in designing, implementing, and managing Azure solutions, including virtual machines, storage, networking, and potentially Windows Virtual Desktop. Experience with DevOps practices, automation, and troubleshooting is also highly desirable. Responsibilities: Users, OU, Security Groups & Permissions, Group Policy Creation and Management - BU Wise SSO integration and changes for applications Involve in all App and SAP cases related to AD Local DNS Management DHCP related issues, we have configured DHCP Failover (Active-Active) for DHCP load balancing Handson experience in powercell for bulk changes in Onpremise & Azure AD and Reports Left Users Data and server backup management Requirements: Bachelor's degree in Computer Science, IT, or a related field (preferred). Minimum 5 years of professional Azure Engineer L2 experience. Job Type: Full-time Pay: ₹600,000.00 - ₹700,000.00 per year Education: Bachelor's (Preferred) Experience: total work: 5 years (Preferred) Work Location: In person

Posted 4 days ago

Apply

5.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Hi All, Greetings! We are urgently hiring for one of our reputed clients in Pune- kharadi. Looking for a 5+ years of experience in below skills : Core expert of Windows Server 2016, 2019, 2021 Should be strong with Installation, Configuration and Management of Windows Servers Should have strong expertise with Cluster Management and Failover Clusters Identify Event Logs and Analyze these Logs for any incidents and problems Should have experience with concepts related to DHCP, DNS and basics of Active Directory (AD) Should be very strong with Implementing security policies, patch management, identifying Vulnerabilities, endpoint protection and isolation. Should have some experience with RDS: Managing and Maintaining Should have experience with PowerShell Scripting for Automation of tasks. Short joiners preferred please apply on : alisha.sh@peoplefy.com

Posted 5 days ago

Apply

5.0 - 8.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Role And Responsibilities EXP required - 5 to 8 years. Reporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive banking, payments and investment landscape. Specifically, the Site Reliability Engineer will be responsible for the following: Design and maintain monitoring solutions and alerting mechanisms for infrastructure, application performance, and user experience metrics, enabling proactive issue detection and mitigation Implement automation tools and processes to automate routine tasks, scale infrastructure, and ensure seamless deployments, updates, and rollbacks with minimal user impact Ensure the reliability, availability, and performance of applications and services, focusing on minimizing downtime, optimizing response times, and maintaining high availability for users Lead incident response efforts for incidents, including identification, triage, resolution, and post-incident analysis to prevent recurrence and improve system resilience Conduct capacity planning, performance tuning, and resource optimization for environments, collaborating with development and operations teams to meet scalability and performance goals Collaborate with security teams to implement security best practices, perform vulnerability assessments, and ensure compliance with security standards and regulatory requirements for applications Manage deployment pipelines, release processes, and configuration management for app deployments, ensuring consistency, reliability, and version control across environments Identify areas for improvement in reliability, performance, and efficiency through data analysis, root cause analysis, and trend analysis, and drive initiatives to enhance system reliability and operational efficiency Create and maintain documentation, runbooks, and knowledge base articles for operational procedures, troubleshooting guides, and best practices, and promote knowledge sharing within the team Develop and test disaster recovery plans, backup strategies, and failover mechanisms for app services, ensuring business continuity and data integrity in case of failures or disasters Collaborate with development, QA, DevOps, and product teams to ensure alignment on reliability goals, performance metrics, release schedules, and incident response processes Participate in on-call rotations and provide 24/7 support for critical incidents, troubleshoot issues, and coordinate with teams for resolution, escalation, and follow-up actions as per defined SLAs Professional Qualifications Proficient in development technologies, architectures, and platforms (web, api) to understand system complexities and performance considerations Experience in cloud platforms (e.g., AWS, Azure, Google Cloud) and infrastructure as code (IaC) tools for managing app infrastructure and deployments Knowledge of monitoring tools (e.g., Prometheus, Grafana, DataDog, New Relic) and logging frameworks (e.g., Splunk, SumoLogic, ELK Stack) for real-time visibility into system health, performance metrics, and user experience Experience in incident management, including incident response, triage, root cause analysis (RCA), and post-mortem reviews to prevent recurring issues Strong troubleshooting skills to diagnose complex technical issues in app environments, infrastructure, networking, and performance bottlenecks Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Terraform, Ansible) for automating routine tasks, deployments, and infrastructure management Experience in implementing continuous integration/continuous deployment (CI/CD) pipelines for apps using tools like Jenkins, GitLab CI/CD, or Azure DevOps Expertise in setting up monitoring solutions, configuring alerts, and creating dashboards to monitor system performance, application metrics, and user experience Familiarity with APM (Application Performance Monitoring) tools to analyze app performance, identify bottlenecks, and optimize resource utilization Familiarity with RUM (Real User Monitoring) for tracking and analyzing user interaction and system performance Commitment to continuous learning, staying updated with industry trends, new technologies, and best practices in app reliability, performance, and operations Adaptability to evolving requirements, technologies, and business needs, with a focus on driving continuous improvement and operational excellence Personal Characteristics Demonstrates judgment and flexibility; thinks about issues and develops solutions that thoughtfully take the broader context into account - positively deals with a shifting demand for time, priorities, and the rapid change of environments Takes an ownership approach to engineering and product outcomes Action-oriented self-starter who can set strategy and drive execution with a "roll up the sleeves" approach Excellent interpersonal communication, negotiation and influencing skills to work effectively with all stakeholders (internal & external), making information-based decisions Penchant for excellence, both personally and professionally, demonstrated by intellectual curiosity, record of accomplishment, and reputation; shows strong attention to detail and implementation of best practices with an inclination for continuous improvement Ability to quickly establish strong credibility with employees, business partners and external resources Embodies and delivers the firm's values and culture towards colleagues, clients, and communities: Win as one team Lead with integrity Be the change Benefits Talent Worx Is a emerging recruitment firm. we are hiring for our client who is in advance the way the world pays, banks, and invests. With decades of expertise, we provide financial technology solutions to financial institutions, businesses, and developer

Posted 5 days ago

Apply

7.0 years

0 Lacs

Kolkata, West Bengal, India

On-site

About the Organisation We are one of India’s leading AMISP (Advanced Metering Infrastructure Service Providers), manufacturing over 5 Lac smart energy meters monthly, supported by in-house teams for Design, Development, Validation, Software Engineering, and Managed Software Services. With a turnover of ₹600 Cr and rising, we are expanding into smart water and gas metering solutions. This position is based in Kolkata and offers a unique opportunity to be part of a data-intensive product ecosystem at scale. Position Overview We are looking for a hands-on, technically mature **Lead Data Platform Engineer** who thrives on architecting and optimizing time-series and high-throughput data platforms. You will own the end-to-end database architecture and engineering function with a sharp focus on PostgreSQL (TimescaleDB), data lifecycle performance, and advanced query optimization. Candidates from high-scale, fast-paced environments such as e-commerce, travel-tech, or dynamic startups will find this role familiar and challenging in the right measure. Suggested Designation Lead Data Platform Engineer – PostgreSQL & Big Data Solutions Key Responsibilities Design and optimize scalable PostgreSQL (Time Series) data architectures to manage billions of telemetry records. Develop and maintain high-performance data models, schemas, indexing strategies, and time-series data workflows. Ensure superior performance for time-bound analytics and search queries across large and partitioned datasets. Work with DevOps and cloud engineering teams to provision AWS-native or hybrid DB environments with cost efficiency. Collaborate closely with product and engineering teams to optimize DB interactions, ingestion pipelines, and data lifecycle policies. Champion coding standards, Postgre SQL practices, and peer reviews across the backend data layer. Act as the go-to expert for database architecture decisions and high-availability, failover strategies. Orchestrate & closely work with the Deployment & / or the Solutions teams for Optimisation of Resources as per the Project Needs. Required Skills & Experience 5–7 years of strong PostgreSQL (TimescaleDB) development experience in data-heavy environments. Prior experience as a Database Architect designing data platforms handling high-volume ingestion and query loads. Hands-on expertise in query optimization, indexing, and partitioning strategies. Sound scripting knowledge in SQL, Python or Bash for automation and integration. 1–2 years’ experience working on AWS RDS, Aurora, or equivalent managed DB platforms. Exposure to ElasticSearch, Redis, Kafka, or other supporting high-throughput technologies is a plus. Strong grounding in techniques relevant to large data platforms. Proficiency in schema evolution, data archival techniques, and long-term retention architecture. Good understanding of security, access control, and encryption best practices in cloud-hosted environments. Preferred Background Hands-on developer-oriented DBA, not just a database manager or administrator. Experience in companies like E-commerce, Travel, Logistics, Food Tech industries, allied-Startups etc. Flipkart, Amazon, MakeMyTrip, Yatra, or other high-scale startups preferred. Bachelor’s or Master’s degree in Computer Science, IT, or allied fields from a reputed institution. Values-driven individual with attention to data integrity, performance, and scalability. Authority & Strategic Impact Own the data platform’s performance, uptime, and design direction. Make authoritative calls on data modeling, indexing, and schema management. Collaborate with software architects and customer IT teams for scalable DB strategies. Contribute to the cloud migration and optimization roadmap with cross-functional stakeholders. Mentor junior developers and database engineers within the platform team.

Posted 5 days ago

Apply

0 years

0 Lacs

Gurugram, Haryana, India

On-site

Backend & MLOps Engineer – Integration, API, and Infrastructure Expert 1.⁠ ⁠Role Objective: Responsible for building robust backend infrastructure, managing ML operations, and creating scalable APIs for AI applications. Must excel in deploying and maintaining AI products in production environments with high availability and security standards. The engineer will be expected to build secure, scalable backend systems that integrate AI models into services (REST, gRPC), manage data pipelines, enable model versioning, and deploy containerized applications in secure (air-gapped) Naval infrastructure. 2.⁠ ⁠Key Responsibilities: 2.1. Create RESTful and/or gRPC APIs for model services. 2.2. Containerize AI applications and maintain Kubernetes-compatible Docker images. 2.3. Develop CI/CD pipelines for model training and deployment. 2.4. Integrate models as microservices using TorchServe, Triton, or FastAPI. 2.5. Implement observability (metrics, logs, alerts) for deployed AI pipelines. 2.6. Build secured data ingestion and processing workflows (ETL/ELT). 2.7. Optimize deployments for CPU/GPU performance, power efficiency, and memory usage 3.⁠ ⁠Educational Qualifications Essential Requirements: 3.1. B.Tech/ M.Tech in Computer Science, Information Technology, or Software Engineering. 3.2. Strong foundation in distributed systems, databases, and cloud computing. 3.3. Minimum 70% marks or 7.5 CGPA in relevant disciplines. Professional Certifications: 3.4. AWS Solutions Architect/DevOps Engineer Professional 3.5. Google Cloud Professional ML Engineer or DevOps Engineer 3.6. Azure AI Engineer or DevOps Engineer Expert. 3.7. Kubernetes Administrator (CKA) or Developer (CKAD). 3.8. Docker Certified Associate Core Skills & Tools 4.⁠ ⁠Backend Development: 4.1. Languages: Python, FastAPI, Flask, Go, Java, Node.js, Rust (for performance-critical components) 4.2. Web Frameworks: FastAPI, Django, Flask, Spring Boot, Express.js. 4.3. API Development: RESTful APIs, GraphQL, gRPC, WebSocket connections. 4.4. Authentication & Security: OAuth 2.0, JWT, API rate limiting, encryption protocols. 5.⁠ ⁠MLOps & Model Management: 5.1. ML Platforms: MLflow, Kubeflow, Apache Airflow, Prefect 5.2. Model Serving: TensorFlow Serving, TorchServe, ONNX Runtime, NVIDIA Triton, BentoML 5.3. Experiment Tracking: Weights & Biases, Neptune, ClearML 5.4. Feature Stores: Feast, Tecton, Amazon SageMaker Feature Store 5.5. Model Monitoring: Evidently AI, Arize, Fiddler, custom monitoring solutions 6.⁠ ⁠Infrastructure & DevOps: 6.1. Containerization: Docker, Podman, container optimization. 6.2. Orchestration: Kubernetes, Docker Swarm, OpenShift. 6.3. Cloud Platforms: AWS, Google Cloud, Azure (multi-cloud expertise preferred). 6.4. Infrastructure as Code: Terraform, CloudFormation, Pulumi, Ansible. 6.5. CI/CD: Jenkins, GitLab CI, GitHub Actions, ArgoCD. 6.6. DevOps & Infra: Docker, Kubernetes, NGINX, GitHub Actions, Jenkins. 7.⁠ ⁠Database & Storage: 7.1. Relational: PostgreSQL, MySQL, Oracle (for enterprise applications) 7.2. NoSQL: MongoDB, Cassandra, Redis, Elasticsearch 7.3. Vector Databases: Pinecone, Weaviate, Chroma, Milvus 7.4. Data Lakes: Apache Spark, Hadoop, Delta Lake, Apache Iceberg 7.5. Object Storage: AWS S3, Google Cloud Storage, MinIO 7.6. Backend: Python (FastAPI, Flask), Node.js (optional) 7.7. DevOps & Infra: Docker, Kubernetes, NGINX, GitHub Actions, Jenkins 8.⁠ ⁠Secure Deployment: 8.1. Military-grade security protocols and compliance 8.2. Air-gapped deployment capabilities 8.3. Encrypted data transmission and storage 8.4. Role-based access control (RBAC) & IDAM integration 8.5. Audit logging and compliance reporting 9.⁠ ⁠Edge Computing: 9.1. Deployment on naval vessels with air gapped connectivity. 9.2. Optimization of applications for resource-constrained environment. 10.⁠ ⁠High Availability Systems: 10.1. Mission-critical system design with 99.9% uptime. 10.2. Disaster recovery and backup strategies. 10.3. Load balancing and auto-scaling. 10.4. Failover mechanisms for critical operations. 11.⁠ ⁠Cross-Compatibility Requirements: 11.1. Define and expose APIs in a documented, frontend-consumable format (Swagger/OpenAPI). 11.2. Develop model loaders for AI Engineer's ONNX/ serialized models. 11.3. Provide UI developers with test environments, mock data, and endpoints. 11.4. Support frontend debugging, edge deployment bundling, and user role enforcement. 12.⁠ ⁠Experience Requirements 12.1. Production experience with cloud platforms and containerization. 12.2. Experience building and maintaining APIs serving millions of requests. 12.3. Knowledge of database optimization and performance tuning. 12.4. Experience with monitoring and alerting systems. 12.5. Architected and deployed large-scale distributed systems. 12.6. Led infrastructure migration or modernization projects. 12.7. Experience with multi-region deployments and disaster recovery. 12.8. Track record of optimizing system performance and cost

Posted 5 days ago

Apply

10.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Purpose of the Role Yum! Brands’ Administration Division is looking for a dynamic candidate who is responsible for the efficient operation and maintenance of the JDE EnterpriseOne ERP system at Yum. This role involves managing JDE system administration, JDE security/Sox, JDE installing/updates, and providing JDE technical support for users globally. Responsibilities System Administration: Manage Yum's JDE EnterpriseOne ERP for all system administration and CNC activities, including managing SQL servers and Windows servers in Azure along with server and DBA teams. Technical Support: Provide overall JDE technical support for the JDE functional team and end-users during specific hours. Collaboration: Work closely with JDE functional team, customers, and external partners to ensure seamless integrations between JDE and external applications. System Monitoring and Performance Tuning: Ensuring the efficient operation of JD Edwards EnterpriseOne systems by monitoring system performance and tuning it as necessary Project Involvement: Lead projects working with in-country subject matter experts and third-party consultants. Identify the best practice solutions to broaden the functionality and benefits derived from the JDE E1 ERP system install base. Provide business process analysis and JDE E1 application configuration expertise. Technical Expertise: Provide conversion and interface expertise for new market and new module installations on E1. Develop and implement the best practice solutions for business processes and integration through the utilization of E1 functionality. Mandatory Skills 4 – 10 years of experience in Database Administration and System Management. Hands on experience in JDE installation, updates, and upgrades. Extensive experience in JDE Security administration. Proficiency in MS SQL Server Administration and T-SQL Scripts Exposure on managing Microsoft Windows servers in Azure Cloud. Extensive experience in Oracle WebLogic server installation, patching, and management. Strong knowledge of JDE Orchestrator and Rest APIs. Proficiency in JDE development and functional knowledge. Exposure on developing and deploying SQL SSIS/ETL packages. Hands on experience in Disaster Recovery and Failover best practices. Proficiency in Networking and Firewalls. Knowledge with ServiceNow, ReportsNow, Krise, and Jams Scheduler is a plus.

Posted 5 days ago

Apply

0 years

0 Lacs

India

On-site

The System Engineer is responsible for designing, implementing, maintaining, and supporting IT infrastructure systems. This includes servers, networks, and cloud environments, ensuring systems are optimized for performance, security, and reliability. The role involves both hands-on technical work and collaboration with other IT and business units. Design, configure, and manage server and network infrastructure (physical and virtual environments). Install, upgrade, and maintain operating systems (Windows, Linux, etc.) and system software. Monitor system performance, identify issues, and implement solutions to ensure high availability and performance. Manage system backups, disaster recovery plans, and failover procedures. Implement and manage security protocols, access controls, and compliance measures. Coordinate with development, IT, and support teams to ensure system compatibility and efficiency. Automate system tasks using scripting tools (PowerShell, Bash, etc.). Participate in capacity planning, performance tuning, and future system upgrades. Create and maintain detailed documentation of system configurations and procedures. Stay updated with the latest industry trends and technologies. Proficiency in managing Windows and/or Linux server environments. Experience with virtualization technologies (VMware, Hyper-V). Familiarity with cloud platforms (AWS, Azure, Google Cloud). Solid understanding of networking concepts (DNS, DHCP, TCP/IP, VPN). Knowledge of cybersecurity principles and system hardening. Experience with monitoring tools (Nagios, Zabbix, SolarWinds). Strong analytical, problem-solving, and troubleshooting skills. Excellent communication and documentation skills.

Posted 5 days ago

Apply

3.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Project Role : Operations Engineer Project Role Description : Support the operations and/or manage delivery for production systems and services based on operational requirements and service agreement. Must have skills : Microsoft Windows Server Administration Good to have skills : NA Minimum 3 Year(s) Of Experience Is Required Educational Qualification : 15 years full time education Summary: As an Operations Engineer, you will support the operations and/or manage delivery of production systems and services based on operational requirements and service agreement. Your typical day will involve ensuring the smooth functioning of production systems and services, addressing operational requirements, and adhering to service agreements. Roles & Responsibilities: 1. Windows Clustering Setup and Configuration Cluster Monitoring Failover management Resource management Vertical and Horizontal Scaling Troubleshoot issues 2. Windows storage management skills 3. Microsoft Windows Server Administration (OS Windows 2016, 2019, 2022) 4. Required active participation/contribution in team discussions. 5. Manage and monitor production systems to ensure optimal performance. 6. Maintain SLA. 7. Implement and maintain system configurations. 8. Inter and Intra team Collaborations for service delivery. 9. Document operational processes and procedures for future reference. Professional & Technical Skills: Strong knowledge of Windows Clusters hosted on Public and Private cloud infrastructures. Strong skill to read cluster logs, diagnose and resolve problems related to cluster communication, storage access, and application failover. Understanding of storage technologies and how to configure shared storage for a cluster. Operational Knowledge of Public Cloud Technologies – AWS / Azure / OCI Must Have Skills: Proficiency in Microsoft Windows Server Administration. Strong understanding of system administration principles. Experience with system monitoring and performance tuning. Knowledge of network protocols and security measures. Good To Have Skills: Experience with cloud platforms like Azure or AWS. Additional Information: The candidate should have a minimum of 3 years of experience in Microsoft Windows Server Administration. Team Player, Good Communication skills, Ability to multitask and adapt to shifting priorities, 24X7 A 15 year full-time education is required., 15 years full time education

Posted 5 days ago

Apply

0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Sr.Disaster Recovery and Backup Engineer Exp : 7+ Yrs Level : L3 Location : Hyderabad Overview: The Disaster Recovery and Backup Engineer is responsible for planning, implementing, and overseeing the organization's disaster recovery (DR) and backup strategies to ensure the integrity, availability, and security of critical data and systems. This role will ensure appropriate measures are in place to recover systems and data during disruptions, disasters, or data loss. The individual will work closely with IT teams, management, and external vendors to ensure compliance with industry standards and internal policies. Responsibilities: Include, but not limited to: • Disaster Recovery Planning: o Develop, maintain, and update disaster recovery plans (DRP) for all critical systems and data. Conduct risk assessments and business impact analyses to identify potential threats and vulnerabilities o Coordinate with IT teams to ensure DR plans are aligned with infrastructure and network configurations o Implement and update disaster recovery strategies to minimize downtime in a disaster • Backup Strategy Management: o Design and implement comprehensive backup and restoration strategies for all systems, data, and applications o Ensure regular backups are scheduled, performed, and validated for accuracy and completeness o Maintain backup retention policies in line with organizational and regulatory requirements. Exercises backup systems regularly to ensure data can be restored promptly and efficiently • Exercises and Audits: o Coordinate and execute regular disaster recovery and backup exercises, including failover and restoration procedure o Ensure exercise results are documented, and issues are addressed to improve processes o Maintain compliance with relevant laws and regulations through regular audits and reporting o Ensure backup and disaster recovery solutions are always audit-ready • Collaboration and Communication: o Collaborate with cross-functional teams to identify critical systems and applications for DR planning o Work with vendors, external consultants, and third-party service providers to enhance disaster recovery capabilities o Communicate with senior management and key stakeholders to ensure alignment on recovery strategies o Provide training and awareness sessions to staff regarding disaster recovery and backup protocols • Incident Response: o Act as the primary IT point of contact during disaster recovery and backup-related incidents o Coordinate disaster recovery efforts to ensure minimal downtime and data loss o Work closely with IT teams to troubleshoot and resolve backup failures and system outages • Documentation and Reporting: o Maintain comprehensive documentation of disaster recovery and backup plans, exercise results, and recovery efforts o Create detailed reports for management outlining the status of disaster recovery readiness and any gaps in current strategies—track and report on recovery time objectives (RTOs) and recovery point objectives (RPOs) Please share your cv to annapurna.t@locuz.com

Posted 5 days ago

Apply

5.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Velotio Technologies is a product engineering company working with innovative startups and enterprises. We are a certified Great Place to Work® and recognized as one of the best companies to work for in India. We have provided full-stack product development for 110+ startups across the globe building products in the cloud-native, data engineering, B2B SaaS, IoT & Machine Learning space. Our team of 400+ elite software engineers solves hard technical problems while transforming customer ideas into successful products. Requirements Design and implement secure cloud infrastructure Hands on expertise in conducting Business Impact Analysis, creating Business Continuity & Disaster Recovery Plans, conducting Tabletop Exercises, and developing failover solutions in public cloud (Azure) Expertise in Cloud Solutions, Azure AD, Azure WVD, Cloud Run, Cloud IAM, Kubernetes, Containers, Terraform, Azure DevOps, Python Hands-on experience with developing automation scripts/pipelines for DR failover and recovery Working experience of data replication strategies, virtual platforms, and other technologies vital to recovery and continuity goals Production experience in Azure IaaS, PaaS, networking, Azure functions, Azure automation and runbooks, insights, Security Center, Azure Monitor, and Log Analytics Hands-on experience with IaaC and infrastructure deployment and configuration using automated tools such as Terraform, Ansible, or CloudFormation, ADO, ARM, Bicep, Ansible, PowerShell, Python, and Azure CLI Technical and operational expertise in Windows/Linux/AKS, SQL and No-SQL DB's, IaaS, PaaS, Data, BCDR, Security, Management, Storage, Networking, Monitoring, Identity, and Connectivity Good understanding of Azure Virtual Network, VWAN, Express route, Load Balancer (L4/L7), Traffic Manager, CDN, Azure DNS, routing & routing protocols, firewall concepts Desired Skills More than 5 years of experience as a Devops/SRE focused on distributed global infrastructure Experience in Azure Governance, Security, Monitoring, Workbooks, Compliance, and cost awareness Experience in Azure Virtual Machines, Containers, and/or Kubernetes (infrastructure perspective) Good understanding of Azure Storage Account, Disk, Snapshot, Backup, Site Recovery, file sync, Data Lake Automate and optimize continuous integration and delivery (CI/CD) pipelines Experience working with modern DevOps tooling, understanding concepts/tooling such as Infrastructure as Code (Terraform), Docker Orchestration, service discovery, secrets management, etc Work with cross-functional teams that include developers, site reliability engineers, Azure administrators, and security engineers Continuously monitor and troubleshoot application performance using Azure Monitoring tools Benefits Our Culture : We have an autonomous and empowered work culture encouraging individuals to take ownership and grow quickly Flat hierarchy with fast decision making and a startup-oriented "get things done" culture A strong, fun & positive environment with regular celebrations of our success. We pride ourselves in creating an inclusive, diverse & authentic environment At Velotio, we embrace diversity. Inclusion is a priority for us, and we are eager to foster an environment where everyone feels valued. We welcome applications regardless of ethnicity or cultural background, age, gender, nationality, religion, disability or sexual orientation.

Posted 6 days ago

Apply

6.0 - 8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Job Summary As a DevOps Engineer, following primary and secondary skills are required. Strategy Adhere to technology roadmap for CEE Hive delivery Adopt the bank’s technology strategy and drive within our programs/ projects. Guide new ideas through the ideation and design process to ensure they are sufficiently defined to address and meet strategic goals. Build strong relationship with production support teams and SRE Ensure tech Obsolescence across CEE Hive is remediated with no impact to stability. Business Manage good relationship with stakeholders in Development, Quality Assurance, PSS, Technology Services and Architecture teams. Work with Development team and develop ADO CI/CD pipeline, deploy applications in non-production environments Work with Quality Assurance Function and Non-Functional teams and ensure deployments are completed in non-production (SIT, UAT, Regression and PT) within half-a-day Grant access in RBAC based on tickets raised by various teams Ensure Certificates in CEE hive applications are update to date and renewal of certificate 1 month before expiry Processes Adhere to ADO and Bank defined principles and guidelines on all Program delivery. Compliance on ICS guidelines, Security and Data protection Compliant to SDF/SIA process and drive bank towards automating process areas removing redundancies. Compliant SCB Group code of conduct and standards Compliance on CCIB Design and architecture guidelines, Data governance and policies Key Responsibilities People & Talent Be the face of the bank to your teams and communicate on things happening in department, bank, locale to drive the best results Be the first person to learn modern technologies proposed by SCB and implement the same in existing application CI/CD pipelines Risk Management Identity and highlight risks in the process to Squad Lead Governance Must be aware of the Group’s regulatory framework and adhere to it. Must understand the oversight and controls related to Business Unit, Job Function and deliver. Regulatory & Business Conduct Display exemplary conduct and live by the Group’s Values and Code of Conduct. Take personal responsibility for embedding the highest standards of ethics, including regulatory and business conduct, across Standard Chartered Bank. This includes understanding and ensuring compliance with, in letter and spirit, all applicable laws, regulations, guidelines and the Group Code of Conduct. Lead to achieve the outcomes set out in the Bank’s Conduct Principles: [Fair Outcomes for Clients; Effective Financial Markets; Financial Crime Compliance; The Right Environment.] Effectively and collaboratively identify, escalate, mitigate and resolve risk, conduct and compliance matters. Serve as a Director of the Board of [insert name of entities] Exercise authorities delegated by the Board of Directors and act in accordance with Articles of Association (or equivalent) Key stakeholders CEE Hive ITO, Solution Architect, Development teams, QA teams, SRE / PSS, Vendors related to CLDM and Interfacing systems Other Responsibilities Embed Here for good and Group’s brand and values in ; Perform other responsibilities assigned under Group, Country, Business or Functional policies and procedures; Multiple functions (double hats). Skills And Experience PRIMARY SKILLS 6 to 8 years of experience in DevOps Hands-on experience in Ansible for automating software provisioning, configuration management, and application deployments Hands-on experience in windows PowerShell scripting to automate deployments in Windows OS Strong OS fundamentals and hands-on skills in ANY - Linux/Unix/Windows Excellent python/bash scripting fundamentals High proficiency with application containerization and cluster management (docker, Kubernetes, OpenShift) Experience in scalability, failover, high-availability, memory, IO and CPU profiling Hands-on experience in installation/configuration/administration in any web servers and J2EE compliant servers In-depth knowledge of build/release systems and hands-on experience in developing and managing CI/CD pipelines – Bitbucket, Jenkins, Sonarqube, artifactory, App Scans tools Hands-on experience in implementing monitoring tools in ANY - AppDynamics, sysdig, Elasticsearch, Grafana, prometheus Understanding common network protocols and services (DNS, HTTP(S), SSH, FTP, SMTP) Hands-on experience on any major cloud platforms (AWS, Azure) Hands-on experience in implementing infrastructure-as-a-code with terraform Hands-on experience in supporting and manage database deployments (Oracle, postgres) Secondary Skills Hands-on experience in Kubernetes Internals and Administration. Experience with OpenShift Platform (Deployments, Objects creation, Storage, Kube Administration) Strong Observability Skills (Logging, Monitoring, Troubleshooting, Alert Notifications related aspects) – EFK/ELK Stack, Prometheus/Grafana Dashboard creation and visualization of metrics Qualifications SKILLS AND COMPETENCIES Ansible Kubernetes and Helm Chart Unix Shell Scripting PowerShell Scripting Core Java SQL in one of the databases (Oracle, Postgres, MySQL, etc) Python Any Web / App Server Azure DevOps CI/CD pipeline Git Jenkins ElasticSearch / LogStash / Kibana Grafana Prometheus About Standard Chartered We're an international bank, nimble enough to act, big enough for impact. For more than 170 years, we've worked to make a positive difference for our clients, communities, and each other. We question the status quo, love a challenge and enjoy finding new opportunities to grow and do better than before. If you're looking for a career with purpose and you want to work for a bank making a difference, we want to hear from you. You can count on us to celebrate your unique talents and we can't wait to see the talents you can bring us. Our purpose, to drive commerce and prosperity through our unique diversity, together with our brand promise, to be here for good are achieved by how we each live our valued behaviours. When you work with us, you'll see how we value difference and advocate inclusion. Together We Do the right thing and are assertive, challenge one another, and live with integrity, while putting the client at the heart of what we do Never settle, continuously striving to improve and innovate, keeping things simple and learning from doing well, and not so well Are better together, we can be ourselves, be inclusive, see more good in others, and work collectively to build for the long term What We Offer In line with our Fair Pay Charter, we offer a competitive salary and benefits to support your mental, physical, financial and social wellbeing. Core bank funding for retirement savings, medical and life insurance, with flexible and voluntary benefits available in some locations. Time-off including annual leave, parental/maternity (20 weeks), sabbatical (12 months maximum) and volunteering leave (3 days), along with minimum global standards for annual and public holiday, which is combined to 30 days minimum. Flexible working options based around home and office locations, with flexible working patterns. Proactive wellbeing support through Unmind, a market-leading digital wellbeing platform, development courses for resilience and other human skills, global Employee Assistance Programme, sick leave, mental health first-aiders and all sorts of self-help toolkits A continuous learning culture to support your growth, with opportunities to reskill and upskill and access to physical, virtual and digital learning. Being part of an inclusive and values driven organisation, one that embraces and celebrates our unique diversity, across our teams, business functions and geographies - everyone feels respected and can realise their full potential.

Posted 6 days ago

Apply

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Avant de postuler à un emploi, sélectionnez votre langue de préférence parmi les options disponibles en haut à droite de cette page. Découvrez votre prochaine opportunité au sein d'une organisation qui compte parmi les 500 plus importantes entreprises mondiales. Envisagez des opportunités innovantes, découvrez notre culture enrichissante et travaillez avec des équipes talentueuses qui vous poussent à vous développer chaque jour. Nous savons ce qu’il faut faire pour diriger UPS vers l'avenir : des personnes passionnées dotées d’une combinaison unique de compétences. Si vous avez les qualités, de la motivation, de l'autonomie ou le leadership pour diriger des équipes, il existe des postes adaptés à vos aspirations et à vos compétences d'aujourd'hui et de demain. Job Summary Fiche de poste : We are seeking a skilled and proactive Site Reliability Engineer (SRE) with 5–8 years of experience and deep expertise in Google Cloud Platform (GCP) . The ideal candidate will be responsible for the reliability, availability, and performance of cloud-based applications and infrastructure. You will collaborate with development, operations, and security teams to build and maintain scalable, secure, and highly available systems. Key Responsibilities Design, develop, and maintain reliable, scalable, and highly available systems on GCP. Build and manage CI/CD pipelines, infrastructure as code (IaC), and monitoring solutions. Proactively monitor and manage system performance, uptime, and capacity using observability tools. Troubleshoot and resolve infrastructure and application-level issues in real-time. Implement and maintain disaster recovery, failover mechanisms, and backup strategies. Automate repetitive tasks and processes to improve efficiency and reduce toil. Participate in on-call rotations, incident management, and root cause analysis (RCA). Ensure compliance with security standards, privacy regulations, and governance policies. Collaborate with cross-functional teams to support DevOps and SRE best practices. Drive improvements in SLAs, SLOs, and error budgets through data-driven insights. Required Qualifications 5–8 years of relevant experience as an SRE, DevOps Engineer, or Cloud Infrastructure Engineer. Strong hands-on experience with Google Cloud Platform (GCP) – Compute Engine, GKE, Cloud Functions, Cloud Storage, IAM, BigQuery, etc. Proficiency in Infrastructure as Code tools like Terraform, Deployment Manager, or CloudFormation. Experience with Kubernetes, Docker, and container orchestration. Proficiency in scripting languages like Python, Shell, or Go. Deep understanding of monitoring and logging tools such as Prometheus, Grafana, Stackdriver, or Datadog. Knowledge of CI/CD tools such as Jenkins, GitLab CI, or Cloud Build. Experience with incident response, postmortem analysis, and site reliability principles. Strong problem-solving and communication skills. Preferred Qualifications GCP certifications (e.g., Professional Cloud DevOps Engineer, Cloud Architect). Exposure to multi-cloud environments or hybrid cloud infrastructure. Familiarity with Agile and ITIL frameworks. Experience working in regulated environments with compliance standards (e.g., ISO, SOC2). Type De Contrat en CDI Chez UPS, égalité des chances, traitement équitable et environnement de travail inclusif sont des valeurs clefs auxquelles nous sommes attachés.

Posted 6 days ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies