Data Platform Engineer

0 years

0 Lacs

Posted:1 month ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Contractual

Job Description

Spark Cluster Deployment: Deploy, configure, and maintain Apache Spark clusters on Kubernetes, ensuring scalability, reliability, and performance. Application Deployment: Collaborate with data engineers and data scientists to deploy Spark applications and workloads, ensuring they run efficiently. Monitoring and Optimization: Implement monitoring solutions to track cluster performance, resource utilization, and application health. Proactively identify and resolve performance bottlenecks. Resource Management: Manage cluster resources, including CPU, memory, and storage allocation, to ensure optimal utilization and cost efficiency. Security: Implement and maintain security measures, including authentication, authorization, and encryption, to protect sensitive data and Spark clusters. Backup and Recovery: Develop and maintain backup and recovery strategies to ensure data integrity and availability in case of failures. Documentation: Maintain clear and comprehensive documentation of Spark cluster configurations, deployment procedures, and best practices. Troubleshooting: Quickly diagnose and resolve issues related to Spark clusters, applications, and Kubernetes infrastructure. Collaboration: Work closely with cross-functional teams, including data engineers, data scientists, and DevOps, to understand application requirements and optimize Spark clusters accordingly. Requirements Proven experience deploying and managing Apache Spark on Kubernetes in a production environment. Proficiency in containerization technologies, particularly Docker and Kubernetes. Strong knowledge of Spark architecture, including cluster, driver, and worker nodes. Familiarity with Spark tuning, optimization, and performance monitoring. Experience with resource management tools like Kubernetes Resource Quotas and LimitRanges. Understanding of data processing and analytics workflows. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills. Experience with Spark cluster orchestration tools like Helm. Knowledge of Spark ecosystem components such as Spark SQL, Spark Streaming, and MLlib. Familiarity with cloud-based solution (Azure). Certification in Kubernetes (e.g., Certified Kubernetes Administrator - CKA). Knowledge of CI/CD pipelines and infrastructure as code (IaC) tools (e.g., Terraform). Scripting skills in languages like Python, Bash, or Shell. Understanding of DevOps practices and automation. Show more Show less

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You