AI/ML Engineer

CloudHire

6 years

0 Lacs

gurugram haryana india

Posted:1 week ago| Platform:

Apply

Skills Required

ai ml data design apache etl storage code processing automation learning scripting tuning workflow extract training support model database scalability logic engineering devops multiprocessing spark hadoop hive scrapy aws gcp azure

Work Mode

On-site

Job Type

Full Time

Job Description

Location

Salary Budget

Key Responsibilities

Design, build, and maintain
scalable data pipelines
(batch + streaming) using
Spark
,
Hadoop
, and other Apache ecosystem tools.
Develop robust
ETL workflows
for large-scale data ingestion, transformation, and validation.
Work with
Cassandra
,
Data Lakes
, and distributed storage systems to handle large-volume datasets.
Write clean, optimized, and modular
Python
code for data processing, automation, and machine learning tasks.
Utilize
Linux
environments for scripting, performance tuning, and data workflow orchestration.
Build and manage
web scraping pipelines
to extract structured and unstructured data from diverse sources.
Collaborate with ML/AI teams to prepare
training datasets
, manage feature stores, and support model lifecycle.
Implement and experiment with
LLMs
,
LangChain
,
RAG pipelines
, and vector database integrations.
Assist in
fine-tuning models
, evaluating model performance, and deploying ML models into production.
Optimize data workflows for performance, scalability, and fault tolerance.
Document data flows, transformation logic, and machine learning processes.
Work cross-functionally with engineering, product, and DevOps teams to ensure reliable, production-grade data systems.

Requirements

3–6 years of experience
as a
Data Engineer
,
ML Engineer
, or similar role.
Strong expertise in
Advanced Python
(data structures, multiprocessing, async, clean architecture).
Solid experience with:
Apache Spark / PySpark
Hadoop ecosystem (HDFS, Hive, Yarn, HBase, etc.)
Cassandra
or similar distributed databases
Linux
(CLI tools, shell scripting, environment management)
Proven ability to design and implement
ETL pipelines
and scalable data processing systems.
Hands-on experience with
data lakes
, large-scale storage, and distributed systems.
Experience with
web scraping frameworks
(BeautifulSoup, Scrapy, Playwright, etc.).
Familiarity with
LangChain
,
LLMs
,
RAG
,
vector stores
(FAISS, Pinecone, Milvus), and ML workflow tools.
Understanding of
model training, fine-tuning, and evaluation
workflows.
Strong problem-solving skills, ability to deep dive into complex data issues, and write production-ready code.
Experience with cloud environments (AWS/GCP/Azure) is a plus.

More Jobs at CloudHire

Back End Developer - Python

Gurugram, Haryana, India

5 - 8 yrs

Salary: Not disclosed

Cloud Support Engineer

Kolkata, West Bengal, India

3 - 3 yrs

Salary: Not disclosed

Frontend Testing Engineer

Gurugram, Haryana, India

Experience: Not specified

Salary: Not disclosed

Software Engineer - API Intergration

Gurugram, Haryana, India

5.0 - 5.0 yrs

Salary: Not disclosed

Quantitative Analyst

Pune, Maharashtra, India

5.0 - 5.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.