Backend Data & Integration Engineer

4 - 8 years

0 Lacs

Posted:8 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Data Engineer at our company located in Kochi, you will be responsible for building data pipelines and connecting external systems and interfaces. Your role objective includes developing crawling/fetch pipelines, parsing/normalization of job postings & CVs, and embeddings/similarity search. Some key responsibilities include: - Development of crawling/fetch pipelines using an API-first approach; utilizing playwright/requests where permitted - Parsing/normalization of job postings & CVs, implementing deduplication/delta logic, and managing repost heuristics - Implementing embeddings/similarity search, controlling Azure OpenAI, and maintaining vector persistence in pgvector - Integrating with systems such as HR4YOU, SerpAPI, BA job board, and email/SMTP using APIs, webhooks, and CSV import - Handling batch/stream processing with Azure Functions/container jobs, managing retry/off, and dead-letter queues - Implementing telemetry for data quality, monitoring freshness, duplicate rate, coverage, and cost per 1,000 items - Collaborating with the Frontend team for exports, generating CSV/Excel files, presigned URLs, and admin configuration Qualifications required for this role include: - 4+ years of end/data engineering experience - Proficiency in Python (FastAPI, pydantic, httpx/requests, Playwright/Selenium) and solid TypeScript for smaller services/SDKs - Experience with Azure services such as Functions/Container Apps or AKS jobs, Storage/Blob, Key Vault, and Monitor/Log Analytics - Familiarity with messaging systems like Service Bus/Queues, idempotence & exactly-once semantics, and clean ETL/ELT patterns - Strong database skills in PostgreSQL, pgvector, query design, and performance tuning - Knowledge of clean ETL/ELT patterns, testability with pytest, and observability with OpenTelemetry Nice-to-have qualifications include: - Experience in NLP/IE using spaCy/regex/rapidfuzz, and document parsing with pdfminer/textract - Knowledge of license/ToS-compliant data retrieval, captcha/anti-bot strategies, and working method emphasizing API-first and clean code practices - Familiarity with tools/stack such as GitHub, GitHub Actions/Azure DevOps, Docker, pnpm/Turborepo (Monorepo), Jira/Linear, and Notion/Confluence - Willingness to participate in on-call/support duties on a rotating basis, following the "you build it, you run it" approach Join us in this exciting opportunity to contribute to our data engineering team and make a significant impact in the field of data processing and integration.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You