AI Data Engineer

5 - 9 years

0 Lacs

Posted:20 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: Capgemini Invent is looking for a Data Engineer with 5+ years of experience in data engineering to join their team. As a Data Engineer, you will be responsible for designing, developing, and maintaining scalable data pipelines to support ML models. You will work closely with data scientists and ML engineers to ensure high-quality data inputs for ML applications. Additionally, you will be involved in data preprocessing, cleansing, exploratory data analysis, and managing datasets comprising text, image, audio, and video data. Key Responsibilities: - Design, develop, and maintain scalable data pipelines for ML models - Perform data preprocessing, cleansing, and labeling - Conduct exploratory data analysis to gather insights and identify data patterns - Collaborate with data scientists and ML engineers to align data pipeline requirements with model development needs - Create and manage datasets comprising text, image, audio, and video data - Implement best practices for data management, ensuring data integrity, consistency, and security - Optimize data workflows and processing pipelines for efficiency and performance - Utilize cloud-based data storage and processing solutions as needed - Stay current with industry trends and technologies to continuously improve data engineering processes - Provide technical support and guidance to junior data engineers and other team members Qualifications Required: - Bachelors or Masters degree in Computer Science, Engineering, or a related field - 5+ years of experience in data engineering, with a focus on building data pipelines and preprocessing data - Strong proficiency in programming languages such as Python, Java, or Scala - Hands-on experience with data processing frameworks and tools like Apache Spark, Hadoop, or similar - Proficiency in SQL and experience with relational and NoSQL databases - Experience with data visualization and EDA tools such as Pandas, Matplotlib, or Tableau - Familiarity with ML and AI concepts, particularly in relation to data preparation and pipelines - Experience with text, image, audio, and video data management, including labeling and cleansing - Exposure to EdgeAI applications and their unique data processing requirements (preferred) - Strong problem-solving skills and the ability to work independently and collaboratively About Capgemini: Capgemini is a global business and technology transformation partner with a diverse team of 340,000 members in more than 50 countries. The company is trusted by clients for its expertise in AI, cloud, and data, delivering end-to-end services and solutions to address various business needs. Capgemini focuses on accelerating the transition to a digital and sustainable world while creating tangible impact for enterprises and society. Role Overview: Capgemini Invent is looking for a Data Engineer with 5+ years of experience in data engineering to join their team. As a Data Engineer, you will be responsible for designing, developing, and maintaining scalable data pipelines to support ML models. You will work closely with data scientists and ML engineers to ensure high-quality data inputs for ML applications. Additionally, you will be involved in data preprocessing, cleansing, exploratory data analysis, and managing datasets comprising text, image, audio, and video data. Key Responsibilities: - Design, develop, and maintain scalable data pipelines for ML models - Perform data preprocessing, cleansing, and labeling - Conduct exploratory data analysis to gather insights and identify data patterns - Collaborate with data scientists and ML engineers to align data pipeline requirements with model development needs - Create and manage datasets comprising text, image, audio, and video data - Implement best practices for data management, ensuring data integrity, consistency, and security - Optimize data workflows and processing pipelines for efficiency and performance - Utilize cloud-based data storage and processing solutions as needed - Stay current with industry trends and technologies to continuously improve data engineering processes - Provide technical support and guidance to junior data engineers and other team members Qualifications Required: - Bachelors or Masters degree in Computer Science, Engineering, or a related field - 5+ years of experience in data engineering, with a focus on building data pipelines and preprocessing data - Strong proficiency in programming languages such as Python, Java, or Scala - Hands-on experience with data processing frameworks and tools like Apache Spark, Hadoop, or similar - Proficiency in SQL and experience with relational and NoSQL databases - Experience with data visualization and EDA tools such as Pandas, Matplotlib, or Tableau - Familiarity with ML and AI concepts, particularly in relation to data preparation and pipelines - Experience with text, image, audio, and video data management, including labeling and cleansing - Exposure to EdgeAI applications a

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

gurugram, haryana, india

hyderabad, telangana, india