Jobs
Interviews

19 Pdfminer Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 years

0 Lacs

kochi, kerala, india

On-site

Responsibilities Role objective: Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Tasks Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP Batch/stream processing (Azure Functions/container jobs), retry/backoff, dead-letter queues Telemetry for data quality (freshness, duplicate rate, coverage, cost per 1,000 items...

Posted 20 hours ago

Apply

4.0 - 8.0 years

0 Lacs

kochi, kerala

On-site

As a Data Engineer at our company located in Kochi, you will be responsible for building data pipelines and connecting external systems and interfaces. Your role objective includes developing crawling/fetch pipelines, parsing/normalization of job postings & CVs, and embeddings/similarity search. Some key responsibilities include: - Development of crawling/fetch pipelines using an API-first approach; utilizing playwright/requests where permitted - Parsing/normalization of job postings & CVs, implementing deduplication/delta logic, and managing repost heuristics - Implementing embeddings/similarity search, controlling Azure OpenAI, and maintaining vector persistence in pgvector - Integrating w...

Posted 4 days ago

Apply

4.0 - 8.0 years

0 Lacs

kochi, kerala

On-site

Role Overview: As a Senior Backend Data & Integration Engineer at our company, your primary responsibility will be to build data pipelines and connect external systems and interfaces. You will be working with a variety of technologies such as Python, TypeScript, Azure, and integrations to ensure smooth data flow and seamless integrations. Key Responsibilities: - Develop crawling/fetch pipelines using an API-first approach, utilizing playwright/requests where permitted. - Parse and normalize job postings & CVs, implement deduplication/delta logic utilizing seen hash and repost heuristics. - Work on embeddings/similarity search, controlling Azure OpenAI and ensuring vector persistence in pgvec...

Posted 4 days ago

Apply

5.0 years

0 Lacs

ahmedabad, gujarat, india

On-site

About Our Company: Aerocraft Engineering India Pvt Ltd based in Ahmedabad, provides services to US-based Architecture, Engineering, and Construction groups of companies: Russell and Dawson – An Architecture/Engineering/Construction firm (www.rdaep.com) United-BIM – BIM Modeling Services Firm (www.united-bim.com) AORBIS – Procurement as a Service Provider (www.aorbis.com) We are a nimble and growing organization where everyone’s role is very important for the company’s business success. All team members’ contributions have a direct correlation with the company’s performance in meeting its business and financial objectives. We're looking for an AI/ML Engineer to join our innovative team workin...

Posted 6 days ago

Apply

0 years

0 Lacs

kochi, kerala, india

On-site

Location: Kochi, Infopark (Onsite) Type: Full-time How to apply: Send a 5-minute Loom recording of yourself and why you think this would be a good fit to careers@datadivr.com with the subject line: I'm the one . About the Role Datadivr is looking for a Backend Developer to own the design and implementation of our document processing and data pipelines. Your work will ensure our AI solutions have clean, structured, and reliable data for retrieval and task execution. This is a hands-on role where you’ll build systems that transform messy PDFs, images, and documents into fast, production-ready data services that power real-world AI applications. What You’ll Do Build and maintain document ingest...

Posted 1 week ago

Apply

4.0 years

0 Lacs

kochi, kerala, india

On-site

Designation: Senior Backend Data & Integration Engineer (Python/TypeScript, Azure, Integrations) Experience: 4 -8 Years Location: Cochin(Onsite) Job Summary Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Key Responsibilities: • Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) • Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) • Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) • Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP • Batch/stream processing (Az...

Posted 1 week ago

Apply

4.0 years

0 Lacs

kochi, kerala, india

On-site

Designation: Senior Backend Data & Integration Engineer (Python/TypeScript, Azure, Integrations) Experience: 4 -8 Years Location: Cochin Job Summary Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Key Responsibilities Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP Batch/stream processing (Azure Functions/conta...

Posted 1 week ago

Apply

4.0 years

0 Lacs

kochi, kerala, india

On-site

Job Description Designation: Senior Backend Data & Integration Engineer (Python/TypeScript, Azure, Integrations) Experience: 4 -8 Years Location: Cochin Job Summary Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Key Responsibilities: • Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) • Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) • Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) • Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP • Batch/stream proces...

Posted 1 week ago

Apply

4.0 years

0 Lacs

kochi, kerala, india

On-site

Greetings from Beo Software Pvt Ltd. We are a German Headquartered ( BEO Gmbh ) IT solutions and services company based in Kochi. To improve our clients' operational efficiencies, we bring time-tested methodologies, proven processes, and deep expertise in software development, as well as a legacy of best practices. We provide full stack development services and serve as an extended office for various clients across Europe. Please find the details below: Designation : Senior Backend Data & Integration Engineer (Python/Typescript, Azure, Integrations) Experience : 4+ Years Job Location : Kochi, Kerala • Development of crawling/fetch pipelines (API-first; playwright/requests only where permitte...

Posted 1 week ago

Apply

5.0 years

0 Lacs

ahmedabad, gujarat, india

On-site

About Our Company: Aerocraft Engineering India Pvt Ltd based in Ahmedabad, provides services to US-based Architecture, Engineering, and Construction groups of companies: Russell and Dawson – An Architecture/Engineering/Construction firm (www.rdaep.com) United-BIM – BIM Modeling Services Firm (www.united-bim.com) AORBIS – Procurement as a Service Provider (www.aorbis.com) We are a nimble and growing organization where everyone’s role is very important for the company’s business success. All team members’ contributions have a direct correlation with the company’s performance in meeting its business and financial objectives. We're looking for an AI/ML Engineer to join our innovative team workin...

Posted 2 weeks ago

Apply

4.0 - 8.0 years

0 Lacs

kochi, kerala

On-site

As a Data Engineer, your main objective will be to build data pipelines for crawling, parsing, and connecting external systems and interfaces. This includes developing crawling and fetching pipelines using an API-first approach and tools like playwright and requests. You will also be responsible for parsing and normalizing job postings and CVs, implementing deduplication and delta logic, and working on embeddings and similarity search. Additionally, you will be involved in integrating with various systems such as HR4YOU, SerpAPI, BA job board, and email/SMTP. Your role will also require you to work on batch and stream processing using Azure Functions or container jobs, implementing retry/bac...

Posted 2 weeks ago

Apply

4.0 years

2 - 3 Lacs

cochin

On-site

Location : Kochi Employment Type : Full Time Work Mode : Hybrid Experience : 4-8 yrs Job Code : BEO-5090 Posted Date : 02/09/2025 Job Description Responsibilities Role objective: Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Tasks Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP Batch/stream processing (Azure ...

Posted 2 weeks ago

Apply

4.0 years

0 Lacs

kochi, kerala, india

On-site

Responsibilities Role objective: Build data pipelines (crawling/parsing, deduplication/delta, embeddings) and connect external systems and interfaces. Tasks Development of crawling/fetch pipelines (API-first; playwright/requests only where permitted) Parsing/normalization of job postings & CVs, deduplication/delta logic (seen hash, repost heuristics) Embeddings/similarity search (controlling Azure OpenAI, vector persistence in pgvector) Integrations: HR4YOU (API/webhooks/CSV import), SerpAPI, BA job board, email/SMTP Batch/stream processing (Azure Functions/container jobs), retry/backoff, dead-letter queues Telemetry for data quality (freshness, duplicate rate, coverage, cost per 1,000 items...

Posted 2 weeks ago

Apply

4.0 years

0 Lacs

navi mumbai, maharashtra, india

On-site

About IRIS IRIS Business Services Limited (IRIS) is a leading regtech SaaS provider listed on both the BSE and NSE. Established in 2000, IRIS empowers over 30 regulators and 6,000 enterprises across 54+ countries, positively impacting more than 2 billion lives. Our innovative solutions transform regulatory compliance into a competitive business advantage. Headquartered in Mumbai, IRIS operates subsidiaries in the USA, Singapore, Malaysia, and Italy, with an affiliate in the UAE. IRIS is also a proud member of XBRL jurisdictions worldwide, including XBRL International, India, Europe, South Africa, and the USA. In India, IRIS Is An Authorized GST Suvidha Provider And a Private Invoice Registra...

Posted 3 weeks ago

Apply

4.0 years

0 Lacs

navi mumbai, maharashtra, india

On-site

About IRIS IRIS Business Services Limited (IRIS) is a leading regtech SaaS provider listed on both the BSE and NSE. Established in 2000, IRIS empowers over 30 regulators and 6,000 enterprises across 54+ countries, positively impacting more than 2 billion lives. Our innovative solutions transform regulatory compliance into a competitive business advantage. Headquartered in Mumbai, IRIS operates subsidiaries in the USA, Singapore, Malaysia, and Italy, with an affiliate in the UAE. IRIS is also a proud member of XBRL jurisdictions worldwide, including XBRL International, India, Europe, South Africa, and the USA. In India, IRIS Is An Authorized GST Suvidha Provider And a Private Invoice Registra...

Posted 1 month ago

Apply

5.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

About Our Company: Aerocraft Engineering India Pvt Ltd based in Ahmedabad, provides services to US-based Architecture, Engineering, and Construction groups of companies: Russell and Dawson – An Architecture/Engineering/Construction firm (www.rdaep.com) United-BIM – BIM Modeling Services Firm (www.united-bim.com) AORBIS – Procurement as a Service Provider (www.aorbis.com) We are a nimble and growing organization where everyone’s role is very important for the company’s business success. All team members’ contributions have a direct correlation with the company’s performance in meeting its business and financial objectives. We're looking for an AI/ML Engineer to join our innovative team workin...

Posted 1 month ago

Apply

3.0 - 5.0 years

4 - 9 Lacs

Pune

Work from Office

Role & responsibilities Design, prototype, and deploy AI-driven applications leveraging LLMs (GPT-4, Perplexity, Claude, Gemini, etc.) and open-source transformer models. Lead or co-lead end-to-end AI/GenAI solutions : from data ingestion, entity extraction, and semantic search to user-facing interfaces. Implement RAG (Retrieval-Augmented Generation) architectures, knowledge grounding pipelines, and prompt orchestration logic. Fine-tune transformer models (BERT, RoBERTa, T5, LLaMA) on custom datasets for use cases like: Document understanding Conversational AI Question answering Summarization & Topic Modeling Integrate LLM workflows into scalable backend architectures with APIs and frontends...

Posted 2 months ago

Apply

2.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

About Our Company: Aerocraft Engineering India PVT Ltd based in Ahmedabad, provides services to US based Architecture, Engineering and Construction group of companies: Russell and Dawson – An Architecture/Engineering/Construction firm (www.rdaep.com) United-BIM – BIM Modeling Services Firm (www.united-bim.com) AORBIS – Procurement as a Service Provider (www.aorbis.com) For AORBIS business, We are seeking a passionate and skilled AI/ML Engineer with hands-on experience in Computer Vision, Document Processing Automation (PDFs), and LLMs . The ideal candidate will contribute to designing and deploying scalable AI solutions that extract, interpret, and act on unstructured and semi-structured dat...

Posted 3 months ago

Apply

8 - 12 years

0 Lacs

Mumbai, Maharashtra, India

Remote

We are seeking a talented individual to join our Data Science team at Marsh. This role will be based in Mumbai. This is a hybrid role that has a requirement of working at least three days a week in the office. Senior Manager - Data Science and Automation We will count on you to: Identify opportunities which add value to the business and make the process more efficient. Invest in understand the core business including products, process, documents, and data points with the objective of identifying efficiency and value addition opportunities. Design and develop end-to-end NLP/LLM solutions for document parsing, information extraction, and summarization from PDFs and scanned text. Develop AI app...

Posted 4 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies