Data Scientist-Data Science-Gen AI Engineer

5 - 9 years

0 Lacs

Posted:1 week ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Machine Learning Systems Architect, your primary responsibility will be to lead the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for Large Language Models (LLMs) to serve multiple concurrent users. To achieve this, you will: - Optimize inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost. - Design and implement agentic AI systems using frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration. - Fine-tune, integrate, and deploy foundation models like GPT, LLaMA, Claude, Mistral, Falcon, and others into intelligent applications. - Develop and maintain robust MLOps workflows to manage the entire model lifecycle including training, deployment, monitoring, and versioning. - Collaborate closely with DevOps teams to implement scalable serving infrastructure by leveraging containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS, GCP, Azure). - Implement retrieval-augmented generation (RAG) pipelines that integrate vector databases such as FAISS, Pinecone, or Weaviate. - Build observability systems for LLMs to monitor prompt performance, latency, and user feedback. - Work collaboratively with research, product, and operations teams to deliver production-grade AI systems that can handle real-world traffic patterns effectively. - Stay informed about emerging AI trends, hardware acceleration techniques, and actively contribute to open-source or research initiatives whenever possible. Additionally, it is important to note that this job requires you to have a strong background in machine learning and experience with various frameworks and tools mentioned in the job description.,

Mock Interview

Practice Video Interview with JobPe AI

Start Machine Learning Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You