Operational Support : Provide day-to-day administration, monitoring, and support for SQL Server all the editions (2014, 2016 and above). Manage database performance tuning, backup/recovery, patching, and upgrades. Ensure database availability, resiliency, and security compliance. Handle Database INC, Requests and Changes. Provide 24x7 production support with on-call rotation during weekends.
Database Administration : Install, configure, upgrade, and patch Microsoft SQL Server (SQL 2014 and above all editions). Monitor database performance, tuning queries, indexes, and optimizing execution plans. Troubleshoot database issues, errors, and performance bottlenecks. Work closely with application teams to support new server and database creations, maintenance job creation, CDC configuration.
Multi-cloud Database Modernization: Support and administrate SQL server migrations to various Cloud platforms. Ensure to support both Homogeneous and Heterogeneous Database migration. Assist with database replication, DR solutions, and cross-cloud failover planning.
Automation of Operational Tasks: Develop and maintain scripts, pipelines, and Infrastructure-as-Code (IaC) (Bicep, ARM, Terraform, Ansible, Shell, Python) to automate routine database tasks. Implement AIOps practices for proactive issue detection, anomaly detection, and predictive alerting.
Collaboration & Continuous Improvement: Work closely with operations, application, and cloud engineering teams to deliver reliable database services. Drive innovation through automation, monitoring, and AI-driven solutions to reduce manual efforts. Document best practices, runbooks, and operational procedures.
Capacity Planning & Scalability: Forecast database growth, manage resource utilization, and ensure scalability to handle increasing workloads while maintaining performance.
Database Security & Compliance: Ensure compliance with enterprise security policies (e.g., encryption, access control, auditing, vulnerability scans). Maintain adherence to Lilly security standards as required. Ensure providing less permissions to customers accounts and manage it.
Backup & Recovery Management: Design, automate, and test robust backup and restore processes (full, differential, transaction log backups) by storing the backup copies in PPDM (DELL Storage solution). Regularly validate recovery time objective (RTO) and recovery point objective (RPO). Ensure high availability of PPDM storages and backups.
Cost Optimization & Resource Efficiency : Optimize SQL resource allocation in both on-premises and cloud (Azure SQL DB, Managed Instance, IaaS VMs). Ensure cost efficiency by monitoring and right-sizing workloads.
SLA & SRE Principles Enforcement : Define and enforce SLIs (Service Level Indicators), SLOs (Service Level Objectives), and SLAs (Service Level Agreements) for SQL services, aligning with SRE principles.
Disaster Recovery (DR) Drills & Testing : Work with DR team to perform regular DR drills to validate business continuity, ensuring systems can failover and recover as expected.
Documentation & Knowledge Sharing : Maintain detailed documentation of database configurations, operational runbooks, and troubleshooting guides. Share knowledge across teams to reduce operational silos.