Data Engineer - 3+ Years

3 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Company Description

NovaIA offers an AI-powered voice assistant tool designed to support human agents in real time. Particularly tailored for real estate agencies, the assistant can make calls, follow up with leads, filter prospects, and schedule appointments. Key features include real-time agent support and appointment management automation. The assistant listens in on conversations, providing live guidance, data, or suggestions, and seamlessly handles follow-ups and meeting setups through voice interactions.


We're hiring a Data Engineer

Job Title: Data Engineer – Real-Time & ML Pipelines

Location:

Experience:

Working Hours:


Key Responsibilities

  • Design and implement data pipelines for real-time STT input, NLP processing, and TTS output
  • Build scalable ingestion systems for audio logs, model artifacts, and interaction metadata
  • Manage message queues and streaming data for efficient voice call routing and response
  • Optimize caching layers and prefetching logic for pre-recorded response fragments
  • Create ETL/ELT workflows for downstream analytics, monitoring, and feedback loops
  • Develop and manage session memory stores for dynamic context handling
  • Ensure data versioning, schema consistency, and lineage tracking
  • Collaborate on token usage optimization and infrastructure cost reporting

Core Skills

  • Data pipeline orchestration: Kubernetes
  • Stream processing: Kafka, Apache Flink, Redis Streams, RabbitMQ
  • Programming: Python, SQL; familiarity with Java/Scala is a plus
  • Cloud-native architecture: AWS (Kinesis, S3, Lambda), GCP (Pub/Sub, BigQuery), or Azure equivalents
  • Storage systems: PostgreSQL, DynamoDB, Parquet, Snowflake, Delta Lake
  • Data quality, schema validation, and observability tools
  • Experience working with audio data (transcription logs, metadata tagging, media storage)
  • Version control & CI/CD for data (DVC, Great Expectations, Git)


Preferred / Bonus Skills

  • Familiarity with ML model pipelines and experiment tracking
  • Real-time ETL optimization and low-latency microservices
  • Knowledge of vector databases (e.g., FAISS, Chroma, Pinecone)
  • Experience with WebRTC, SIP, or real-time audio systems
  • Data governance and compliance (PII masking, audit trails)

General Qualities We Value

  • Comfort working in fast-paced, ambiguous environments
  • Startup or zero-to-one product experience
  • A strong portfolio, GitHub contributions, or project demos
  • Willingness to collaborate closely with founders and cross-functional teams
  • Curiosity, creativity, and ability to learn quickly

Note: If Question is Not Applicable: Write NA

  • Note: If Question is Not Applicable: Write NA

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You