We are seeking a Senior NoSQL Database Engineer with deep expertise in Apache Cassandra and working knowledge of MongoDB
The ideal candidate is passionate about designing and maintaining resilient, highavailability, distributed data systems, and ensuring zero-downtime performance in production environments
Role & responsibilities
1. Installation & Provisioning
- Plan, install, and configure new
Cassandra clusters
from scratch across development, staging, and production environments. - Automate provisioning using
Infrastructure-as-Code (IaC)
tools such as Terraform
or Ansible
. - Validate environment prerequisites, including
OS tuning, JVM configuration, port management, and security settings
. - Design for
high availability
and resilience using proper replication
and data partitioning
strategies.
2. Cluster Management & Scaling
- Create and manage
multi-node Cassandra clusters
with optimal replication and consistency settings. - Scale clusters by
adding or removing nodes
seamlessly without downtime; manage token assignments
and ring balancing
. - Oversee
data distribution, repair operations, compactions
, and load balancing
across nodes. - Plan and execute
cluster upgrades and patches
with minimal disruption.
3. Performance Tuning & Optimization
- Monitor cluster performance metrics
latency, throughput, GC behavior, disk I/O, and heap utilization
. - Collaborate with development teams to optimize
schema design
and query performance
. - Fine-tune
JVM parameters, caching
, and compaction strategies
for optimal throughput. - Identify and mitigate performance bottlenecks using
nodetool
, JMX
, and Grafana
dashboards.
4. Backup, Restore & Disaster Recovery
- Implement
automated backup and restore
strategies using tools like nodetool snapshot
or Medusa
. - Validate
backup integrity
through periodic recovery drills. - Define and maintain
disaster recovery (DR) plans
with tested RTO/RPO
objectives.
5. Monitoring & Maintenance
- Set up continuous monitoring using
Prometheus
, Grafana
, or OpsCenter
. - Configure alerts for key cluster health indicators
node status, compaction, heap usage, disk utilization, dropped mutations
, etc. - Participate in
on-call rotations
for production support and incident response. - Conduct
root cause analysis (RCA)
and drive preventive maintenance
for recurring issues.
6. Security & Compliance
- Implement and maintain
role-based access control (RBAC)
and secure authentication mechanisms
. - Enable
TLS/SSL encryption
for data-in-transit and data-at-rest. - Perform regular
audits
, credential rotations
, and ensure compliance with organizational and regulatory standards.
Required Skills :
- At least 5 years of hands-on experience managing Cassandra in production environments.
- Strong knowledge of Cassandra internals gossip, token distribution, consistency, repair, and compaction
- Proven experience with database provisioning, scaling, and upgrades
- Proficient in Linux systems and scripting (Bash or Python)
- Experience with data replication, backup, and recovery automation
- Familiarity with cloud platforms (AWS/GCP/Azure) for distributed Cassandra deployments
- Good understanding of database security, encryption, and auditing.
- Effective communication and problem-solving skills.
Preferred (Nice to Have) :
- Experience with MongoDB as a complementary NoSQL database
- Familiarity with Terraform, Ansible, or Kubernetes for automation and containerized deployments
- Exposure to Elasticsearch, Kafka, or other distributed systems
- Hands-on with CI/CD pipelines for database release management
- Knowledge of Prometheus, Grafana, or OpsCenter for performance visualization.