Posted:1 week ago|
Platform:
On-site
Full Time
Job description:
We’re looking for a hands-on Data Engineer to manage and scale our data scraping pipelines across 60+ websites. The job involves handling OCR-processed PDFs, ensuring data quality, and building robust, self-healing workflows that fuel AI-driven insights.
You’ll Work On:
Managing and optimizing Airflow scraping DAGs
Implementing validation checks, retry logic & error alerts
Cleaning and normalizing OCR text (Tesseract / AWS Textract)
Handling deduplication, formatting, and missing data
Maintaining MySQL/PostgreSQL data integrity
Collaborating with ML engineers on downstream pipelines
What You Bring:
2–5 years of hands-on experience in Python data engineering
Experience with Airflow, Pandas, and OCR tools
Solid SQL skills and schema design (MySQL/PostgreSQL)
Comfort with CSVs and building ETL pipelines
Required:
1. Scrapy or Selenium experience
2. CAPTCHAs handling
3. Experience in PyMuPDF, Regex
4. AWS S3
5. LangChain, LLM, Fast API
6. Streamlit
7. Matplotlib
Job Type: Full-time
Day shift
Work Location: In person
Job Type: Full-time
Pay: ₹70,000.00 - ₹150,000.00 per month
Application Question(s):
Work Location: In person
GEMTECH PARAS SOLUTIONS PVT LTD
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowgurugram, haryana, india
Salary: Not disclosed
gurugram, haryana, india
Experience: Not specified
Salary: Not disclosed
hyderābād
3.0 - 8.08 Lacs P.A.
hyderābād
5.3125 - 7.875 Lacs P.A.
gurgaon
Salary: Not disclosed
gurgaon
5.56 - 7.5 Lacs P.A.
noida, uttar pradesh, india
Salary: Not disclosed
hyderabad, telangana, india
Salary: Not disclosed
noida, uttar pradesh, india
Salary: Not disclosed
gurugram, haryana, india
Salary: Not disclosed