Part Time Python Developer

0 years

0 Lacs

Posted:1 week ago| Platform: GlassDoor logo

Apply

Work Mode

Remote

Job Type

Part Time

Job Description

Job Title: Part-Time Python OCR Developer (S3 Integration + Database Automation)

Project Overview:
Photon Software is implementing an automated OCR engine within the Vendor Management System (VMS) to extract key data from Purchase Order and Invoice PDFs stored in AWS S3. The process will run 24×7 as a scheduled job, automatically downloading new files, extracting text and data, persisting the results into a database, and archiving processed PDFs.

Use Case Summary:

  • Monitor a specific S3 folder every 30 minutes for new PDF uploads.
  • Automatically download new PDFs and perform OCR extraction (e.g., PO number, vendor details, invoice amount, date).
  • Persist extracted data into a MySQL database using Python scripts.
  • After successful extraction, move PDFs to an “Archive” folder in S3.
  • Continue this cycle continuously (24×7) using a scheduler (e.g., cron job, APScheduler, AWS Lambda, or EC2 background service).

Responsibilities:

  • Develop and deploy Python-based OCR pipeline (Tesseract / EasyOCR).
  • Integrate with AWS S3 for file read/write operations.
  • Implement data persistence logic (MySQL insert/update).
  • Design a scheduling mechanism for 30-minute job execution.
  • Maintain error logs, exception handling, and retry mechanisms.
  • Optimize performance for continuous 24×7 operations.

Required Skills:

  • Strong Python (3.x) programming experience.
  • Hands-on with AWS S3 SDK (boto3) and MySQL (PyMySQL / SQLAlchemy).
  • OCR tools such as Tesseract, EasyOCR, or PaddleOCR.
  • Experience in PDF processing (pdfplumber, PyMuPDF/fitz).
  • Knowledge of job schedulers (cron, APScheduler, Celery, or AWS Lambda scheduling).
  • Basic understanding of logging, error handling, and multithreading.

Nice to Have:

  • Experience handling financial documents (POs, invoices).
  • Familiarity with OCR accuracy improvement and confidence scoring.
  • AWS Lambda or Docker deployment knowledge.

Engagement Details:

  • Part-time role (approx. 1 hour/day) for ongoing OCR automation maintenance and enhancement.
  • Remote / flexible timing with weekly progress updates.
  • Duration: Long-term (initial 3-month contract, extendable).

How to Apply:
Send your profile or GitHub samples demonstrating similar OCR or AWS automation work to shivbratmishra@photonsoftwares.com. Include “Python OCR – S3 Automation” in the subject line.

Job Type: Part-time

Pay: ₹5,000.00 - ₹6,000.00 per month

Expected hours: 10 per week

Work Location: In person

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You