Artificial Intelligence Engineer

3 years

0 Lacs

Posted:8 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

About Us


Role Overview


Key Responsibilities:

  • Ship Flask-based AI services: design, implement, and maintain Flask APIs for LLM/agent features (auth, rate‑limits, request/response schemas) with logging, tracing, and error handling.
  • Integrate with React experiences: define clear API contracts and collaborate with React devs; build/own a lightweight internal demo UI when needed to validate UX.
  • Build agentic workflows end-to-end: tool/function calling, planning/memory, guardrails; wire agents to recruiting use cases (sourcing, outreach personalization, scheduling, knowledge retrieval).
  • Operate RAG as a product surface: own chunking policies, embeddings, vector/hybrid search, and freshness pipelines; monitor retrieval quality and data drift.
  • Deliver on cost & latency SLOs: set budgets/targets; optimize via model routing (SLM vs FM), quantization, caching, batching; track p50/p95 latency and cost per task.
  • Evaluation & safety by default: curate prompt + dataset repos, offline/online evals (task success, hallucination, bias), red‑team critical paths; maintain dashboards and alerts.
  • Lightweight MLOps & releases: manage model/artifact versions (simple registry), canary/A‑B rollouts, and rollbacks; automate with CI/CD.
  • Security & PII ownership: implement least‑privilege access, masking, and audit trails; ensure tenancy isolation and SOC2‑oriented readiness.
  • Lead a small squad: set technical direction, break down work, review PRs, mentor ICs, and collaborate with Product/Design/Customer teams to hit outcomes.


Tech Stack You’ll Work With:

  • Languages: Python and Flask and React
  • AI/ML: PyTorch/JAX, foundation models (OpenAI/Anthropic/Google)
  • Agents & RAG: tool/function calling, vector databases (Pinecone/Weaviate/PGVector/FAISS).


Required Skills & Qualifications:

  • 3+ years in software engineering with 6 months to 1 year experience in building ML/LLM‑powered products in production.
  • Strong Python and Flask
  • Hands‑on experience with agentic workflows (tool use, planning, memory) and Retrieval‑Augmented Generation (RAG) at scale.
  • Demonstrated work with small/compact LLMs (fine‑tuning, quantization, distillation) and routing between foundation models and SLMs.
  • Ability to design evaluation frameworks and ship safe, reliable AI systems with measurable outcomes.
  • Solid systems and infra fundamentals: APIs, queues, data modeling, profiling, CI/CD, and cloud basics.
  • Experience leading projects or a small team (tech lead, mentor, or manager‑of‑ICs).


Nice-to-Have:

  • HRTech/ATS/CRM experience or handling sensitive PII datasets.
  • GPU inference optimization (KV‑cache, paged attention, tensor parallel), Triton kernels, or custom CUDA.
  • Search/retrieval systems (Elasticsearch/OpenSearch, Vespa, Milvus) and hybrid retrieval expertise.
  • Frontend familiarity (React/Next.js) for rapid prototyping with Design.
  • Security, privacy, and compliance exposure (SOC2, ISO 27001).


Education:

Bachelor’s degree in computer science, Engineering, or a related field (or equivalent practical experience).


What Success Looks Like (30/60/90):

  • 30 days: Ship a scoped AI feature improvement (e.g., retrieval quality or latency win); map the AI surface area and propose a technical roadmap.
  • 60 days: Lead a squad to productionize an agentic workflow with measurable wins (conversion, time‑to‑fill, or support deflection).
  • 90 days: Establish an AI evaluation & observability stack (dashboards, alerts, experiment tracking) and a repeatable release process.


What We Offer:

  • Be part of a dynamic and forward-thinking executive search firm that fosters collaboration, growth, and continuous learning.
  • Work closely with top-tier organizations, providing strategic talent solutions and gaining deep insights into leadership hiring and talent management.
  • Enjoy a rewarding package that reflects your expertise and contribution, along with comprehensive benefits to support your well-being.
  • Access best-in-class tools, technologies, and methodologies that are shaping the future of executive search, empowering you to deliver exceptional results for clients.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You