Key Responsibilities
System Architecture & Deployment:
- Design, implement, and manage highly available, scalable, and secure Linux infrastructure solutions (e.g., clustered services, load balancers, container orchestration platforms).
- Lead the migration and upgrade of complex production environments.
- Capacity planning and performance tuning for critical systems and applications.
Advanced Troubleshooting & Incident Management
- Serve as the final escalation point for complex, system-wide performance, security, or stability issues that Level 1/2 staff cannot resolve.
- Conduct deep-dive root cause analysis (RCA) for major incidents, including kernel-level tracing, memory/disk/network analysis, and performance bottlenecks.
- Implement proactive monitoring and alerting strategies.
Security & Compliance
- Implement and maintain robust security hardening standards (e.g., SELinux/AppArmor, firewalls, intrusion detection).
- Ensure compliance with industry standards (e.g., HIPAA, PCI-DSS, ISO 27001).
3rd Party Application Lifecycle Management
- Lead the installation, configuration, and optimization of complex, multi-tier third-party applications (e.g., monitoring suites like Prometheus/Grafana, logging platforms like ELK/Loki, middleware, or proprietary vendor software).
- Securely integrate new applications into the existing infrastructure, managing dependencies, library requirements, and centralized authentication mechanisms (SSO/SAML/OAuth).
- Develop and manage application configuration templates and deployment scripts to ensure consistency across development, staging, and production environments.
- Troubleshooting and Performance Tuning: Deeply analyze application performance using tools like strace, lsof, tcpdump, and application-specific metrics. Work directly with application vendors to troubleshoot complex bugs, apply patches, and coordinate necessary system-level changes.
Automation & Infrastructure As Code (IaC)
- Develop, maintain, and enforce automation standards using tools like Ansible (a plus)
Required Skills And Qualifications
Category
Must-Have Experience
Operating SystemsExpert-level proficiency in multiple distributions (RHEL/CentOS, Ubuntu/Debian), including kernel tuning, patching, and compilation.Virtualization/CloudExpert knowledge of hypervisors (e.g., VMware vSphere, KVM) and extensive experience with a major public cloud provider (AWS, Azure, or GCP).NetworkingDeep understanding of advanced networking concepts (e.g., TCP/IP stack, BGP, OSPF, VLANs, software-defined networking, load balancing).DatabasesStrong familiarity with common databases (e.g., PostgreSQL, MySQL, MongoDB) and deployment/HA best practices.
🎓 Minimum Experience/Education
- Education: Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent practical experience).
- Experience: 8+ years
- Work From office job
- Location- Delhi/NCR
Skills: networking,gcp,aws,,hypervisor,linux admin,infrastructure,application configuration