Whiteklay

1 Job openings at Whiteklay
Apache Druid Administrator – Distributed Systems Specialist pune,maharashtra,india 3 years None Not disclosed On-site Full Time

Experience: 3+ years Location: Pune Employment Type: Full-time Position Overview : We’re seeking a seasoned Apache Druid Administrator with deep experience in managing distributed systems. You’ll be responsible for deploying, maintaining, and optimizing Druid clusters to support mission-critical analytics workloads. This role demands hands-on expertise in infrastructure orchestration, performance tuning, and cross-functional collaboration with data engineering and DevOps teams. Responsibilities: Cluster Management: Deploy, configure, and maintain Apache Druid clusters across cloud and on-prem environments. Performance Optimization: Monitor query performance, ingestion pipelines, and system health; implement tuning strategies for sub-second latency. Distributed Systems Oversight: Manage fault-tolerant, horizontally scalable architectures; ensure high availability and disaster recovery. Security & Governance: Implement role-based access control, encryption, and audit logging in compliance with enterprise standards. Automation & CI/CD: Build and maintain automated deployment pipelines using tools like Terraform, Ansible, or Helm. Monitoring & Alerting: Integrate with observability stacks (Prometheus, Grafana, ELK) to proactively detect and resolve issues. Collaboration: Work closely with data engineers, architects, and product teams to align Druid capabilities with business needs. Documentation & SOPs: Create and maintain operational playbooks, SOPs, and knowledge base articles for internal teams. Required Qualifications: 3+ years of experience in administering distributed systems or real-time analytics platforms. 1+ years of hands-on experience with Apache Druid in production environments. Strong understanding of Druid internals: segment management, indexing services, query nodes, and deep storage. Proficiency in Linux, shell scripting, and container orchestration (Kubernetes preferred). Experience with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code. Familiarity with Kafka, Zookeeper, and other components in the Druid ecosystem. Solid grasp of networking, load balancing, and system security principles. Preferred Skills: Experience integrating Druid with BI tools (e.g., Superset, Tableau, Looker). Knowledge of data modeling for OLAP-style queries. Exposure to hybrid cloud architectures and multi-region deployments. Contributions to open-source Druid or related projects.