Jobs
Interviews

12 Llm Evaluation Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

1.0 years

6 - 18 Lacs

in

Remote

About the job: Humanity Founders is hiring a Senior AI Engineer to build and ship AI systems that create real, positive social impact. In this full-time, remote role, you'll own the end-to-end lifecycle from problem framing and data pipelines to model training/finetuning, LLM ops, evaluation, and production deployment. You'll work closely with product, design, and domain experts, leading technical decisions and mentoring teammates while holding a high bar for quality, safety, and velocity. You'll architect reliable ML/LLM services, design robust evaluation and observability, and turn research into user-facing features. This is a hands-on role for an engineer who thrives in a fast, product-ce...

Posted 3 weeks ago

Apply

1.0 years

6 - 18 Lacs

in

On-site

About the job: As an AI Engineer at Humanity Founders, you will play a crucial role in developing cutting-edge AI solutions that have a positive impact on society. You will work closely with a talented team of engineers and data scientists to build and deploy innovative AI applications. Key responsibilities: 1. Develop and implement Machine Learning Operations (MLOps) strategies to streamline the development and deployment of AI models. 2. Utilize your expertise in Machine Learning, Artificial Intelligence, Neural Networks, LLM Ops, LLM evaluation, Training, and Development to enhance our AI capabilities. 3. Lead and participate in the prompt engineering of AI solutions using technologies su...

Posted 3 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As a Senior AI/ML Engineer with 8 years of experience based in Bangalore, you will be responsible for leveraging your expertise in Prompt Engineering, Large Language Models (LLMs), and Agentic AI frameworks. Your primary focus will be on designing, implementing, and optimizing AI-driven solutions that capitalize on the latest advancements in generative AI. Your key responsibilities will include developing prompt engineering strategies to enhance AI model performance, fine-tuning LLM hyperparameters like temperature and top_p, building and deploying solutions using Agentic AI frameworks such as LangChain, LangGraph, and Crew AI. You will also be involved in implementing Retrieval-Augmented Ge...

Posted 4 weeks ago

Apply

4.0 - 8.0 years

0 Lacs

pune, maharashtra

On-site

You have a great opportunity with Infosys for the position of Python AI/ML with GEN AI. This is a contract role based in either Chennai or Pune (Onsite). You should have a minimum of 4 years of experience in AI/ML, specifically in GenAI, along with a total of 5+ years of industry experience. Your responsibilities will include demonstrating proficiency in Python, ML, NLP, and text data analysis. You should have hands-on experience in API creation, RAG, Vector databases, LLM prompt engineering, and LLM evaluation. It is important to be familiar with MLOps tools and practices, such as monitoring and evaluation tools like LangSmith, LLM deployment libraries like LiteLLM and LLM Guardrails, CI/CD...

Posted 1 month ago

Apply

1.0 years

2 - 6 Lacs

noida, uttar pradesh, in

On-site

About the job: Are you passionate about the future of AI and ML? We're looking for a developer who doesn't just write code, but co-creates with it. If you see tools like Chat GPT, GitHub Copilot, and Claude as your coding partners and not just a means to an end, then you've found the right place. We believe that great software is built on creative problem-solving and solid logic, not just language-specific syntax. You should be comfortable with any coding language and have a natural curiosity to find smarter, faster ways to solve problems. Have you used AI to debug tricky code or convert a project between languages? If so, you're the kind of person we want in our team. Key Responsibilities: ...

Posted 1 month ago

Apply

7.0 - 9.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

We are urgently looking for a Senior AI/ML Engineer (C2H) based in Bangalore for on-site work. 7+years Key Requirements: - Proficiency in Prompt Engineering and its strategies - Understanding of LLM functionality and its hyperparameters (temperature, top_p) - Experience with Agentic AI frameworks such as Langchain, Lang graph, and Crew AI - Familiarity with RAG - Skilled in LLM evaluation and observability - Strong command of Python - Ability to deploy on any one of the cloud services If you meet these requirements, we would like to hear from you. [HIDDEN TEXT] hashtag#immediatejoiners hashtag#AI hashtag#PYTHON hashtag#Machinelearning hashtag#banglore Show more Show less

Posted 1 month ago

Apply

1.0 years

6 Lacs

IN

Remote

About the job: As a Full Stack GenAI Developer at MeetMinutes, you will be responsible for creating cutting-edge AI solutions using Python, Generative AI Development, LangChain, LLM evaluation, LLMOps, JavaScript, React, Amazon Web Services (AWS), Google Cloud Platforms (GCP), Docker, Machine Learning, Natural Language Processing (NLP), PostgreSQL, REST API, FastAPI, GitHub, System Design, and Prompt Engineering. Key responsibilities: 1. Developing and implementing AI algorithms and models to enhance the functionality of our platform. 2. Integrating AI technologies and features into our existing systems to improve user experience. 3. Collaborating with the engineering team to optimize system...

Posted 3 months ago

Apply

1.0 years

2 - 8 Lacs

IN

Remote

About the job: Key responsibilities: 1. Research and experiment with state-of-the-art AI models including GPT, BERT, Stable Diffusion, and other generative architectures 2. Fine-tune, evaluate, and deploy large language models (LLMs) for various AI-driven applications 3. Design and build AI agents capable of autonomous decision-making and task execution 4. Develop and optimize machine learning pipelines and datasets for training AI models 5. Collaborate with software engineers and data scientists to integrate AI models into products 6. Stay updated with the latest AI research, papers, and trends in generative AI and reinforcement learning 7. Participate in code reviews, design discussions, a...

Posted 3 months ago

Apply

1.0 years

2 Lacs

Noida, Uttar Pradesh, IN

On-site

About the job: Key responsibilities: 1. Collaborate with founder and team leads on hiring needs, prepare job descriptions, post openings, screen candidates, conduct interviews, and handle offer rollouts 2. Oversee onboarding processes including offer letters, background checks, and orientation to ensure smooth integration of new hires 3. Build partnerships with academic institutions to source interns 4. Administer the end-to-end intern program including recruitment, training, performance tracking, and offboarding 5. Ensure accurate and timely tracking of timesheets and attendance, resolve discrepancies, and coordinate payroll processing with the finance team 6. Streamline cross-functional pr...

Posted 3 months ago

Apply

1.0 years

3 - 4 Lacs

Delhi, Delhi, IN

On-site

About the job: Voizer.ai is a cutting-edge AI-driven platform specializing in automation, voice technology, and intelligent workflows. We empower businesses with seamless AI integrations to enhance productivity and efficiency. Join our dynamic team to work on innovative automation solutions that redefine how businesses operate. Key responsibilities: 1. Design, develop, and deploy AI-powered automation workflows using n8n, Zapier, Make (Integrate), or similar tools. 2. Integrate AI models (NLP, LLMs, RPA) into business processes for enhanced efficiency. 3. Troubleshoot and optimize existing automations for performance and scalability. 4. Collaborate with developers, product managers, and busi...

Posted 4 months ago

Apply

4 - 9 years

7 - 17 Lacs

Mumbai

Work from Office

About the Role We're seeking a skilled Python QA Analyst to join our team focused on AI and agentic systems. In this role, you'll ensure the quality and reliability of our AI products while occasionally supporting agentic development during periods of lower QA workload. Key Responsibilities Design, implement and maintain comprehensive test suites for AI and agentic systems Create automated testing frameworks using Python to validate AI model outputs and behaviors Develop test cases that cover edge scenarios and potential failure modes in AI systems Perform thorough regression testing on new releases and feature updates Validate that AI systems meet functional requirements and performance ben...

Posted 4 months ago

Apply

3.0 - 8.0 years

6 - 12 Lacs

hyderabad, pune, bengaluru

Work from Office

Analyze GitHub issues, configure codebases with Docker, evaluate test coverage, and assess LLM bug-fixing performance. Collaborate on open-source research and lead junior engineers in AI-driven code evaluation projects.

Posted Date not available

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies