Get alerts for new jobs matching your selected skills, preferred locations, and experience range.
4.0 years
0 Lacs
Bengaluru, Karnataka, India
Remote
Company Description O-Health is digital healthcare company dedicated to AI-driven digital solutions. Our platform connects patients in remote areas to doctors and utilizes NLP and AI for diagnostics. Role Description This is a full-time on-site role for a NLP + ML Engineer at O-Health located in Bengaluru. The NLP + ML Engineer will be responsible for pattern recognition in text, working with neural networks, implementing algorithms, and analyzing statistics on a daily basis in a healthtech ecosystem. Qualifications Experience in Neural Networks, Data Science and Pattern Recognition Strong background in Computer Science and Statistics Proficiency in machine learning frameworks and tools Excellent problem-solving and analytical skills Ability to work collaboratively in a team environment Master's/Bachelor's in Computer Science, Engineering, Mathematics, or related field with atleast 4 years of experience Experience in development of multi-lingual ASR systems Responsibilities: Design and develop robust backend systems to handle real-time patient data and ML outputs. Develop and integrate machine learning models with APIs to the main O-Health application. Optimize model serving pipelines (e.g. using TorchServe, FastAPI, or ONNX). Manage data pipelines for de-identified OPD datasets used in model training and inference. Implement data encryption, anonymization, and consent-based data access. Development of multilingual voice and text processing. Support versioning and A/B testing of health algorithms. Required Skills:Backend Engineering Strong in Python with frameworks like FastAPI with experience in DBMS. Experience with RESTful APIs, WebSockets, and asynchronous data flows. Familiar with PostgreSQL databases. Working knowledge of Docker, Git, and CI/CD pipelines. Machine Learning Ops Hands-on with PyTorch, Scikit-learn, or TensorFlow for inference integration. Comfortable with model optimization, quantization, and edge deployment formats (e.g. ONNX, TFLite). Familiarity with language models (LLMs) and multilingual NLP. Knowledge of data preprocessing, tokenization, and feature engineering for clinical/NLP tasks. Other Required Skills Understanding of HIPAA/GDPR compliance. Experience working on healthcare, social impact, or AI-for-good projects. What You'll Impact: You’ll play a pivotal role in connecting machine learning research with field-ready healthcare tools. Your work will help scale diagnosis support systems to thousands of underserved patients and power multilingual health consultations in real-time. Show more Show less
Posted 2 weeks ago
3.0 years
0 Lacs
Mohali, Punjab
On-site
Company: Chicmic Studios Job Role: Python Machine Learning & AI Developer Experience Required: 3+ Years We are looking for a highly skilled and experienced Python Developer to join our dynamic team. The ideal candidate will have a robust background in developing web applications using Django and Flask, with expertise in deploying and managing applications on AWS. Proficiency in Django Rest Framework (DRF), a solid understanding of machine learning concepts, and hands-on experience with tools like PyTorch, TensorFlow, and transformer architectures are essential. Key Responsibilities Develop and maintain web applications using Django and Flask frameworks. Design and implement RESTful APIs using Django Rest Framework (DRF) Deploy, manage, and optimize applications on AWS services, including EC2, S3, RDS, Lambda, and CloudFormation. Build and integrate APIs for AI/ML models into existing systems. Create scalable machine learning models using frameworks like PyTorch , TensorFlow , and scikit-learn . Implement transformer architectures (e.g., BERT, GPT) for NLP and other advanced AI use cases. Optimize machine learning models through advanced techniques such as hyperparameter tuning, pruning, and quantization. Deploy and manage machine learning models in production environments using tools like TensorFlow Serving , TorchServe , and AWS SageMaker . Ensure the scalability, performance, and reliability of applications and deployed models. Collaborate with cross-functional teams to analyze requirements and deliver effective technical solutions. Write clean, maintainable, and efficient code following best practices. Conduct code reviews and provide constructive feedback to peers. Stay up-to-date with the latest industry trends and technologies, particularly in AI/ML. Required Skills and Qualifications Bachelor’s degree in Computer Science, Engineering, or a related field. 3+ years of professional experience as a Python Developer. Proficient in Python with a strong understanding of its ecosystem. Extensive experience with Django and Flask frameworks. Hands-on experience with AWS services for application deployment and management. Strong knowledge of Django Rest Framework (DRF) for building APIs. Expertise in machine learning frameworks such as PyTorch , TensorFlow , and scikit-learn . Experience with transformer architectures for NLP and advanced AI solutions. Solid understanding of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB). Familiarity with MLOps practices for managing the machine learning lifecycle. Basic knowledge of front-end technologies (e.g., JavaScript, HTML, CSS) is a plus. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills and the ability to articulate complex technical concepts to non-technical stakeholders. Contact : 9875952836 Office Location: F273, Phase 8b Industrial Area Mohali, Punjab. Job Type: Full-time Schedule: Day shift Monday to Friday Work Location: In person
Posted 2 weeks ago
3.0 years
0 Lacs
Punjab, India
On-site
Key Responsibilities Develop and maintain web applications using Django and Flask frameworks. Design and implement RESTful APIs using Django Rest Framework (DRF). Deploy, manage, and optimize applications on AWS services, including EC2, S3, RDS, Lambda, and CloudFormation. Build and integrate APIs for AI/ML models into existing systems. Create scalable machine learning models using frameworks like PyTorch, TensorFlow, and scikit-learn. Implement transformer architectures (e.g., BERT, GPT) for NLP and other advanced AI use cases. Optimize machine learning models through advanced techniques such as hyperparameter tuning, pruning, and quantization. Deploy and manage machine learning models in production environments using tools like TensorFlow Serving, TorchServe, and AWS SageMaker. Ensure the scalability, performance, and reliability of applications and deployed models. Collaborate with cross-functional teams to analyze requirements and deliver effective technical solutions. Write clean, maintainable, and efficient code following best practices. Conduct code reviews and provide constructive feedback to peers. Stay up-to-date with the latest industry trends and technologies, particularly in AI/ML. Required Skills And Qualifications Bachelors degree in Computer Science, Engineering. 3+ years of professional experience as a Python Developer. Proficient in Python with a strong understanding of its ecosystem. Extensive experience with Django and Flask frameworks. Hands-on experience with AWS services for application deployment and management. Strong knowledge of Django Rest Framework (DRF) for building APIs. Expertise in machine learning frameworks such as PyTorch, TensorFlow, and scikit-learn. Experience with transformer architectures for NLP and advanced AI solutions. Solid understanding of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB). Familiarity with MLOps practices for managing the machine learning lifecycle. Basic knowledge of front-end technologies (e.g., JavaScript, HTML, CSS) is a plus. (ref:hirist.tech) Show more Show less
Posted 2 weeks ago
2.0 years
0 Lacs
Hyderābād
On-site
We are seeking a highly skilled Data Scientist (LLM/Gen AI Engineer) to join our AI and Machine Learning team. In this role, you will focus on the research, development, and deployment of Large Language Models (LLMs) and Generative AI solutions to solve real-world problems and enhance intelligent applications. You will work closely with cross-functional teams including data scientists, machine learning engineers, and product managers to build scalable, production-ready Gen AI systems. Key Responsibilities: Research, fine-tune, and deploy Large Language Models (LLMs) such as GPT, LLaMA, Mistral, or similar. Design and implement end-to-end Gen AI pipelines for tasks such as summarization, question answering, code generation, and retrieval-augmented generation (RAG). Preprocess and curate large datasets for training and evaluation of language models. Optimize models for efficiency, accuracy, and scalability using techniques like quantization, distillation, and model pruning. Integrate LLM-based solutions with existing products, APIs, and user-facing applications. Evaluate model performance using metrics like BLEU, ROUGE, perplexity, and human evaluation. Stay up-to-date with the latest trends in AI research, LLM architecture, and Gen AI tools. Collaborate with engineers to scale models in production and monitor model drift or degradation. Document experiments, model behaviors, and results to guide reproducibility and future development. Required Qualifications: Bachelor’s or Master’s degree in Computer Science, Data Science, Artificial Intelligence, Machine Learning, or related field. 2–5+ years of experience working as a Data Scientist, ML Engineer, or AI Researcher. Hands-on experience with Large Language Models (OpenAI GPT, PaLM, Claude, LLaMA, Mistral, etc.). Proficient in Python and popular ML/NLP libraries (e.g., PyTorch, TensorFlow, Hugging Face Transformers, LangChain). Deep understanding of NLP, tokenization, embeddings, transformers, and attention mechanisms. Familiarity with prompt engineering and fine-tuning techniques (LoRA, PEFT, etc.). Experience deploying models using cloud platforms (AWS, GCP, Azure) and container tools (Docker, Kubernetes). Strong analytical, problem-solving, and communication skills. Job Types: Full-time, Permanent Pay: Up to ₹30,000.00 per month Benefits: Flexible schedule Schedule: Day shift Ability to commute/relocate: Hyderabad, Telangana: Reliably commute or planning to relocate before starting work (Preferred) Application Question(s): Mention your current location Work Location: In person
Posted 2 weeks ago
5.0 years
12 - 18 Lacs
Hyderābād
On-site
Job Title : Senior Machine Learning Engineer – Generative AI Specialist Location : Hyderabad Experience : 5–7 Years (Minimum 2 Years in Generative AI) Compensation : ₹12–18 LPA Job Type : Full-Time, Permanent About Us We’re an innovation-driven AI company solving complex business challenges using the power of Generative AI . Our clients span healthcare, telecom, BFSI, and government sectors – and we’re scaling fast. We’re looking for a hands-on, go-getter ML Engineer who’s been in the trenches building and deploying production-grade LLM applications. Role Summary As a Senior ML Engineer (Generative AI) , you'll lead the development of end-to-end AI solutions, with a strong emphasis on LLM fine-tuning, vector retrieval (RAG), multimodal AI, and private deployments . This is not a research-only role — we want someone who ships code, owns systems, and delivers measurable impact. Key Responsibilities Design and deploy LLM-powered applications (chatbots, summarizers, content generators, etc.) using OpenAI, LLaMA, Mixtral, Claude, or proprietary models Fine-tune foundation models using PEFT, LoRA, QLoRA , and domain-specific datasets Build RAG pipelines using LangChain, LlamaIndex , or custom frameworks Integrate vector databases (FAISS, Pinecone, Qdrant) for semantic search and context retrieval Develop prompt orchestration flows, agents, and tools using frameworks like Autogen, Haystack, or Semantic Kernel Work with unstructured data (PDFs, emails, voice, images) and convert it into usable AI-ready formats Collaborate with product and engineering teams to build scalable GenAI-powered apps Own deployment via APIs, containers, and MLOps workflows (Hugging Face Hub, Azure/AWS/GCP) Must-Have Skills 5–7 years in ML/AI with at least 2+ years hands-on Gen AI (not just R&D) Deep knowledge of NLP, Transformer architectures , and fine-tuning LLMs Strong command of Python, PyTorch/TensorFlow , and LLM libraries (Transformers, SentenceTransformers, LangChain, etc.) Experience in RAG design patterns and private LLM deployments Worked with Whisper, Stable Diffusion, Gemini, or other multimodal models Clear track record of building and deploying Gen AI apps to production Bonus Skills Experience building Gen AI SaaS platforms or tooling Knowledge of RLHF , prompt engineering, instruction tuning Understanding of data governance and compliance in Gen AI applications Familiarity with embedding optimization , latency reduction, quantization strategies Job Type: Full-time Pay: ₹1,200,000.00 - ₹1,800,000.00 per year Benefits: Health insurance Life insurance Provident Fund Schedule: Day shift Work Location: In person
Posted 2 weeks ago
2.0 - 5.0 years
0 Lacs
Kolkata, West Bengal, India
On-site
About the Role We are seeking an AI/ML Engineer to design, develop, and deploy machine learning models for real-world applications. This role emphasizes Retrieval-Augmented Generation (RAG), fine-tuning Large Language Models (LLMs), and speech-to-speech model training and evaluation. If you're passionate about advancing AI technologies and enjoy collaborative problem-solving, we'd love to hear from you. Key Responsibilities Model Development: Design and implement machine learning models, focusing on RAG techniques to enhance LLM performance by integrating external data sources. Fine-Tuning: Fine-tune pre-trained LLMs to improve reasoning and logic capabilities, ensuring models are current with the latest information. Speech-to-Speech Modeling: Develop and optimize models for speech-to-speech conversion, emphasizing real-time processing and accuracy. System Integration: Collaborate with cross-functional teams to integrate models into production systems, ensuring seamless deployment and scalability. Continuous Learning: Stay updated on AI/ML advancements, particularly in RAG, LLM fine-tuning, and frameworks like LlamaIndex and LangChain, applying innovative techniques to enhance model performance. Performance Evaluation: Assess and refine models using relevant performance metrics to ensure they meet application requirements. Code Quality: Write efficient, robust, and maintainable production-level code. Required Skills & Qualifications Educational Background: Bachelor's or Master's degree in Computer Science, AI/ML, Data Science, or a related field. Experience: 2-5 years in developing and deploying ML models, with a focus on RAG and LLM fine-tuning. Programming Skills: Proficiency in Python and experience with ML frameworks such as TensorFlow or PyTorch. Specialized Knowledge: Hands-on experience in deep learning, NLP, and speech/audio processing. Framework Proficiency: Experience with LlamaIndex and LangChain for efficient data retrieval, indexing, and building complex AI workflows. Cloud Platforms: Experience with cloud platforms (AWS, GCP) for model deployment and scaling. Optimization Techniques: Knowledge of model optimization methods, including quantization and pruning. MLOps Practices: Familiarity with MLOps, CI/CD pipelines, and containerization tools like Docker and Kubernetes. Problem-Solving: Strong analytical skills and the ability to work in a fast-paced environment. Preferred Qualifications Real-Time Processing: Experience in real-time audio processing and speech AI. Advanced Models: Understanding of transformer-based models and generative AI. Distributed Computing: Knowledge of distributed computing frameworks like Spark. Fill the form - https://forms.gle/gcg6jUoeZMvUxRSX8 Show more Show less
Posted 2 weeks ago
3.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Systems Engineering General Summary Qualcomm is a company of inventors that unlocked AI on Edge - ushering in an age of rapid acceleration in connectivity and new possibilities that will transform industries, create jobs, and enrich lives. But this is just the beginning. It takes inventive minds with diverse skills, backgrounds, and cultures to transform 5Gs potential into world-changing technologies and products. This is the Invention Age - and this is where you come in. QUALCOMM is the world's leading developer of next generation wireless and multimedia technology. Immediate opportunities exist in QUALCOMM's Multimedia Systems Group to work in the area of Low Power ML Accelerator, developing embedded software for next generation low power NPU. You will be part of the Multimedia Systems and R&D team and develop low power AI accelerator runtime software for Qualcomm Snapdragon platforms. Responsibilities The candidate will be expected to work with a team of engineers to design, implement, integrate, and test kernels for ML operators for Qualcomm's Low power ML accelerator. Design, implement, integrate, and test kernels for ML operators for HW accelerator. Create test framework for tracking performance and power metrics. Work closely with other teams for ML model offload system integration, use case testing and commercialization support. Requirements Strong programming skills in C/C++, Python Expertise in developing and debugging software on embedded platforms. Knowledge of ML operators such as Transformers, LSTM, GRUs.. Knowledge of software design patterns and multi-threaded programming, Eg POSIX Knowledge of computer architecture, operating systems, data structures, and basic algorithms knowledge of fixed-point coding Knowledge of any ML frameworks pytorch, tensorflow.. Knowledge of Model quantization and compression techniques is a plus. Working on ML inference optimizations is a plus. Experience working on any AI HW accelerator (NPU) is a plus. Proven ability to work in a dynamic, multi-tasked environment. Self-starter who likes to be challenged and solve tough complex issues. Educational Qualification Bachelor's/Master’s/PhD degree in Engineering, Electronics and communication, Computer Science or related filed. Minimum Qualifications Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience. Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers. 3074352 Show more Show less
Posted 2 weeks ago
0.0 years
0 Lacs
Hyderabad, Telangana
On-site
We are seeking a highly skilled Data Scientist (LLM/Gen AI Engineer) to join our AI and Machine Learning team. In this role, you will focus on the research, development, and deployment of Large Language Models (LLMs) and Generative AI solutions to solve real-world problems and enhance intelligent applications. You will work closely with cross-functional teams including data scientists, machine learning engineers, and product managers to build scalable, production-ready Gen AI systems. Key Responsibilities: Research, fine-tune, and deploy Large Language Models (LLMs) such as GPT, LLaMA, Mistral, or similar. Design and implement end-to-end Gen AI pipelines for tasks such as summarization, question answering, code generation, and retrieval-augmented generation (RAG). Preprocess and curate large datasets for training and evaluation of language models. Optimize models for efficiency, accuracy, and scalability using techniques like quantization, distillation, and model pruning. Integrate LLM-based solutions with existing products, APIs, and user-facing applications. Evaluate model performance using metrics like BLEU, ROUGE, perplexity, and human evaluation. Stay up-to-date with the latest trends in AI research, LLM architecture, and Gen AI tools. Collaborate with engineers to scale models in production and monitor model drift or degradation. Document experiments, model behaviors, and results to guide reproducibility and future development. Required Qualifications: Bachelor’s or Master’s degree in Computer Science, Data Science, Artificial Intelligence, Machine Learning, or related field. 2–5+ years of experience working as a Data Scientist, ML Engineer, or AI Researcher. Hands-on experience with Large Language Models (OpenAI GPT, PaLM, Claude, LLaMA, Mistral, etc.). Proficient in Python and popular ML/NLP libraries (e.g., PyTorch, TensorFlow, Hugging Face Transformers, LangChain). Deep understanding of NLP, tokenization, embeddings, transformers, and attention mechanisms. Familiarity with prompt engineering and fine-tuning techniques (LoRA, PEFT, etc.). Experience deploying models using cloud platforms (AWS, GCP, Azure) and container tools (Docker, Kubernetes). Strong analytical, problem-solving, and communication skills. Job Types: Full-time, Permanent Pay: Up to ₹30,000.00 per month Benefits: Flexible schedule Schedule: Day shift Ability to commute/relocate: Hyderabad, Telangana: Reliably commute or planning to relocate before starting work (Preferred) Application Question(s): Mention your current location Work Location: In person
Posted 2 weeks ago
2.0 years
0 Lacs
India
Remote
ob Title: AI Full stack Developer – GenAI & NLP Location: Pune, India (Hybrid) Work Mode: Remote Experience Required: 2+ Years (Relevant AI/ML with GenAI & NLP) Salary: Up to ₹15 LPA (CTC) Employment Type: Full-time Department: AI Research & Development Role Overview We are looking for a passionate AI Developer with strong hands-on experience in Generative AI and Natural Language Processing (NLP) to help build intelligent and scalable solutions. In this role, you will design and deploy advanced AI models for tasks such as language generation, summarization, chatbot development, document analysis, and more. You’ll work with cutting-edge LLMs (Large Language Models) and contribute to impactful AI initiatives. Key Responsibilities Design, fine-tune, and deploy NLP and GenAI models using LLMs like GPT, BERT, LLaMA, or similar. Build applications for tasks like text generation, question-answering, summarization, sentiment analysis, and semantic search. Integrate language models into production systems using RESTful APIs or cloud services. Evaluate and optimize models for accuracy, latency, and cost. Collaborate with product and engineering teams to implement intelligent user-facing features. Preprocess and annotate text data, create custom datasets, and manage model pipelines. Stay updated on the latest advancements in generative AI, transformer models, and NLP frameworks. Required Skills & Qualifications Bachelor’s or Master’s degree in Computer Science, AI/ML, or a related field. Minimum 2 years of experience in fullstack development and AI/ML development, with recent work in NLP or Generative AI. Hands-on experience with models such as GPT, T5, BERT, or similar transformer-based architectures. Proficient in Python and libraries such as Hugging Face Transformers, spaCy, NLTK, or OpenAI APIs. Hands-on experience in any frontend/ backend technologies for software development. Experience with deploying models using Flask, FastAPI, or similar frameworks. Strong understanding of NLP tasks, embeddings, vector databases (e.g., FAISS, Pinecone), and prompt engineering. Familiarity with MLOps tools and cloud platforms (AWS, Azure, or GCP). Preferred Qualifications Experience with LangChain, RAG (Retrieval-Augmented Generation), or custom LLM fine-tuning. Knowledge of model compression, quantization, or inference optimization. Exposure to ethical AI, model interpretability, and data privacy practices. What We Offer Competitive salary package up to ₹15 LPA. Remote work flexibility with hybrid team collaboration in Pune. Opportunity to work on real-world generative AI and NLP applications. Access to resources for continuous learning and certification support. Inclusive, fast-paced, and innovative work culture. Skills: ci/cd,model interpretability,openai,langchain,aws,rag architectures,javascript,azure,google cloud,nltk,next.js,kubernetes,large language models,hugging face,ai tools,python,gcp,quantization,deep learning,machine learning (ml),nlp,ai development,mlops tools,backend technologies,llama,pytorch,ethical ai,frontend technologies,r,react.js,cloud,flask,data privacy,spacy,hugging face transformers,computer vision,nlp tasks,retrieval-augmented generation (rag),tensorflow,ml,gpt,ai technologies,large language models (llms),generative ai,fastapi,openai apis,embeddings,natural language processing,artificial intelligence,bert,model compression,natural language processing (nlp),vector databases,django,mlops,node.js,docker,inference optimization,java,machine learning,typescript Show more Show less
Posted 2 weeks ago
0 years
0 Lacs
Noida, Uttar Pradesh, India
On-site
Generative AI Model Development: Design, train, and fine-tune LLMs (GPT, LLaMA, Claude), diffusion models, and multimodal AI systems for various applications. NLP & Deep Learning Expertise: Implement and optimize Transformer-based architectures (BERT, T5, GPT, etc.) for text generation, summarization, and embeddings. AI Pipeline Engineering: Develop end-to-end AI pipelines , including data preprocessing, model training, inference, and deployment using PyTorch, TensorFlow, or JAX. Agile Development: Work in an Agile/Scrum environment, participating in standups, sprint planning, and retrospectives to iteratively develop AI-driven products. Fine-tuning & Optimization: Optimize pre-trained models through LoRA, PEFT, quantization, and distillation for efficiency and cost-effectiveness. Cloud & MLOps Integration: Deploy AI models on AWS using Kubernetes, Docker, FastAPI, and serverless architectures . Vector Databases & Retrieval-Augmented Generation (RAG): Work with FAISS, ChromaDB, Weaviate, or Pinecone to enhance generative models with retrieval-augmented techniques. AI Security & Ethics: Implement safeguards against bias, hallucinations, adversarial attacks , and ensure compliance with AI governance frameworks (e.g., GDPR, HIPAA). Experimentation & Research: Stay up-to-date with the latest AI advancements, conduct A/B testing , and experiment with new zero-shot, few-shot, and fine-tuning techniques. Collaboration & Mentorship: Work with cross-functional teams (data scientists, software engineers, product teams) and mentor junior AI engineers. Show more Show less
Posted 2 weeks ago
3.0 - 8.0 years
5 - 10 Lacs
Noida
Work from Office
About The Role Were building an agentic AI platform that turns one line of text and a video feed into end-to-end, real-time computer-vision solutionsthink semantic video search, object / action recognition, and task-oriented visual agents deployable with a single click As a Gen AI ML Engineer, youll architect the core vision & multimodal-reasoning stack and pave the road from prototype to production. Roles And Responsibilities Semantic video search Ship a pipeline that allows users to type show every forklift near aisle 5 in the last 30 minutes and get keyed-off clips in Wire embeddings to a hybrid FAISS/HNSW index; surface results through a simple REST & React playground. Create agentic pipelines Chain vision language models and zero/few-shot vision models with LLM planners (Gemini, GPT-4o, AutoGen, etc.) so a single prompt becomes a multi-step perception workflow. Profile and accelerate inference (TensorRT, ONNX, quantization, batching) to meet latency / throughput targets on GPU and CPU fleets. Rapid prototyping loops Run weekly paper-to-prototype spikes: reproduce a fresh arXiv idea, benchmark, and decide go/no-go in Hand successful python scripts & checkpoints to MLOps for productionizationno plumbing marathons. Data & Evaluation Spin up scalable pipelines for video ingestion, labeling (active learning, weak supervision), experiment tracking, and continuous evaluation. Collaborate & Lead Partner with product and ML Ops engineers; set research direction, mentor future hires, and establish best practices. Must-have Skill Set 13 years deep-learning research experience (internships & grad work count). Fluency in Python + PyTorch; comfortable hacking large vision/LLM repos. Proof you ship ideasfirst-author paper, OSS repo, Kaggle medal, or faithful reproduction of a cutting-edge model. Hands-on with LLM prompting/fine-tuning and at least one agent framework. Able to turn fuzzy product asks into measurable experiments and explain results clearly. Bonus Cred Large-scale video retrieval or temporal grounding experience. Prior work building agentic-AI pipelines that combine perception models with LLM reasoning. Open-source contributions to GenAI/vision libs (OpenCLIP, Vid2Seq, ViperGPT, etc.). What can you expect? Ability to shape the future of manufacturing by leveraging best-in-class AI and software; we are a unique organization with niche skill set that you would also develop while working with us World class work culture, coaching and development Mentoring from highly experienced leadership from world class companies (refer to Ripik.AI website for details) International exposure Work Location NOIDA (Work from Office)
Posted 2 weeks ago
0.0 - 10.0 years
0 Lacs
Chennai, Tamil Nadu
On-site
Job Information Company Yubi Date Opened 05/28/2025 Job Type Full time Work Experience 6-10 years Industry Technology City Chennai State/Province Tamil Nadu Country India Zip/Postal Code 600001 About Us Yubi stands for ubiquitous. But Yubi will also stand for transparency, collaboration, and the power of possibility. From being a disruptor in India’s debt market to marching towards global corporate markets from one product to one holistic product suite with seven products Yubi is the place to unleash potential. Freedom, not fear. Avenues, not roadblocks. Opportunity, not obstacles. Job Description About Yubi Yubi, formerly known as CredAvenue, is re-defining global debt markets by freeing the flow of finance between borrowers, lenders, and investors. We are the world's possibility platform for the discovery, investment, fulfillment, and collection of any debt solution. At Yubi, opportunities are plenty and we equip you with tools to seize it. In March 2022, we became India's fastest fintech and most impactful startup to join the unicorn club with a Series B fundraising round of $137 million. In 2020, we began our journey with a vision of transforming and deepening the global institutional debt market through technology. Our two-sided debt marketplace helps institutional and HNI investors find the widest network of corporate borrowers and debt products on one side and helps corporates to discover investors and access debt capital efficiently on the other side. Switching between platforms is easy, which means investors can lend, invest and trade bonds - all in one place. All of our platforms shake up the traditional debt ecosystem and offer new ways of digital finance. Yubi Credit Marketplace - With the largest selection of lenders on one platform, our credit marketplace helps enterprises partner with lenders of their choice for any and all capital requirements. Yubi Invest - Fixed income securities platform for wealth managers & financial advisors to channel client investments in fixed income Financial Services Platform - Designed for financial institutions to manage co-lending partnerships & asset based securitization Spocto - Debt recovery & risk mitigation platform Corpository - Dedicated SaaS solutions platform powered by Decision-grade data, Analytics, Pattern Identifications, Early Warning Signals and Predictions to Lenders, Investors and Business Enterprises So far, we have on-boarded over 17000+ enterprises, 6200+ investors & lenders and have facilitated debt volumes of over INR 1,40,000 crore. Backed by marquee investors like Insight Partners, B Capital Group, Dragoneer, Sequoia Capital, LightSpeed and Lightrock, we are the only-of-its-kind debt platform globally, revolutionizing the segment. At Yubi, People are at the core of the business and our most valuable assets. Yubi is constantly growing, with 1000+ like-minded individuals today, who are changing the way people perceive debt. We are a fun bunch who are highly motivated and driven to create a purposeful impact. Come, join the club to be a part of our epic growth story. About the Role We're looking for a highly skilled, results-driven AI Developer who thrives in fast-paced, high-impact environments. If you are passionate about pushing the boundaries of Computer Vision, OCR, NLP and and Large Language Models (LLMs) and have a strong foundation in building and deploying AI solutions, this role is for you. As a Lead Data Scientist, you will take ownership of designing and implementing state-of-the-art AI products. This role demands deep technical expertise, the ability to work autonomously, and a mindset that embraces complex challenges head-on. Here, you won't just fine-tune pre-trained models—you'll be architecting, optimizing, and scaling AI solutions that power real-world applications. Key Responsibilities Architect, develop, and deploy high-performance AI Solutions for real-world applications. Implement and optimize state-of-the-art LLM , OCR models and frameworks. Fine-tune and integrate LLMs (GPT, LLaMA, Mistral, etc.) to enhance text understanding and automation. Build and optimize end-to-end AI pipelines, ensuring efficient data processing and model deployment. Work closely with engineers to operationalize AI models in production (Docker, FastAPI, TensorRT, ONNX). Enhance GPU performance and model inference efficiency, applying techniques such as quantization and pruning. Stay ahead of industry advancements, continuously experimenting with new AI architectures and training techniques. Work in a highly dynamic, startup-like environment, balancing rapid experimentation with production-grade robustness. What We're Looking For Requirements Required Skills & Qualifications: Proven technical expertise – Strong programming skills in Python, PyTorch, TensorFlow with deep experience in NLP and LLM Hands-on experience in developing, training, and deploying LLM and Agentic workflows Strong background in vector databases, RAG pipelines, and fine-tuning LLMs for document intelligence. Deep understanding of Transformer-based architectures for vision and text processing. Experience working with Hugging Face, OpenCV, TensorRT, and NVIDIA GPUs for model acceleration. Autonomous problem solver – You take initiative, work independently, and drive projects from research to production. Strong experience in scaling AI solutions, including model optimization and deployment on cloud platforms (AWS/GCP/Azure). Thrives in fast-paced environments – You embrace challenges, pivot quickly, and execute effectively. Familiarity with MLOps tools (Docker, FastAPI, Kubernetes) for seamless model deployment. Experience in multi-modal models (Vision + Text). Good to Have Financial background and understanding corporate finance . Contributions to open-source AI projects.
Posted 2 weeks ago
3.0 years
0 Lacs
Gurugram, Haryana, India
On-site
Were seeking a Generative AI Engineer with 3-6 years of experience to design, develop, and deploy AI models that enhance our travel platforms. Youll work on LLMs, diffusion models, NLP, and other generative techniques to build solutions like dynamic content creation, conversational AI, and predictive travel recommendations. Skills & Qualifications 3-6 years of hands-on experience in AI/ML, with at least 1-2 years focused on generative AI. Proficiency in Python, PyTorch/TensorFlow, and frameworks like LangChain, Hugging Face, or LlamaIndex. Experience with LLM fine-tuning, prompt engineering, and RAG architectures. Knowledge of cloud platforms (AWS/GCP/Azure) and MLOps tools (MLflow, Kubeflow). Familiarity with travel industry data (e.g., booking systems, customer reviews) is a plus. Strong problem-solving skills and ability to translate business needs into AI solutions. Key Responsibilities Design, train, and fine-tune generative AI models (e.g., GPT, Llama, Stable Diffusion) for travel-specific use cases. Implement NLP pipelines for chatbots, personalized recommendations, and automated content generation. Optimize models for performance, scalability, and cost-efficiency (e.g., quantization, distillation). Collaborate with product teams to integrate AI into customer-facing applications (e.g., dynamic itineraries, virtual travel assistants). Stay ahead of industry trends (e.g., multimodal AI, RAG, autonomous agents) and prototype innovative solutions. Ensure ethical AI practices, bias mitigation, and compliance with data privacy regulations. Nice-to-Have Publications or projects in generative AI (GitHub, blogs, research papers). Experience with multimodal models (text + image/video generation). Exposure to graph neural networks (GNNs) for recommendation systems. What We Offer A fast-paced, collaborative, and growth-oriented environment. Direct impact on products used by millions of global travelers. Work on real-world AI challenges in a dynamic travel-tech environment. Competitive salary, and travel perks. Flexible work culture with a focus on innovation. Why Join Us ? At Thrillophilia, you will be part of a team that is dedicated to redefining the future of travel. We have millions of users, but to reach the next milestone, we need fresh perspectives and bold ideas to perfect every product and process. Here, you wont find the typical startup clichéstheres no excess, no fluff, just the raw, exhilarating challenge of creating the future of travel. At Thrillophilia, we dont just offer a job, we offer an experience! From Holis vibrant colors to Diwalis festive lights, every moment here is a celebration of life, energy, and creativity. We believe in empowering young minds to think big, innovate, and growbecause passion drives progress. Whether it's our grand festivals or recognizing and celebrating our top performers at the RnR, we make sure success never goes unnoticed. Forget the robotic 9-to-5; at Thrillophilia, we thrive on spontaneity, collaboration, and making every day feel like a grand event. (ref:hirist.tech) Show more Show less
Posted 2 weeks ago
5.0 - 7.0 years
0 Lacs
Kochi, Kerala, India
On-site
Highly skilled Senior Machine Learning Engineer with expertise in Deep Learning, Large Language Models (LLMs), and MLOps/LLMOps to design, optimize, and deploy cutting-edge AI solutions. The ideal candidate will have hands-on experience in developing and scaling deep learning models, fine-tuning LLMs/ (e.g., GPT, Llama), and implementing robust deployment pipelines for production environments. Responsibilities Model Development & Fine-Tuning: - Design, train, fine-tune and optimize deep learning models (CNNs, RNNs, Transformers) for NLP, computer vision, or multimodal applications. - Fine-tune and adapt Large Language Models (LLMs) for domain-specific tasks (e.g., text generation, summarization, semantic similarity). - Experiment with RLHF (Reinforcement Learning from Human Feedback) and other alignment techniques. Deployment & Scalability (MLOps/LLMOps): - Build and maintain end-to-end ML pipelines for training, evaluation, and deployment. - Deploy LLMs and deep learning models in production environments using frameworks like FastAPI, vLLM, or TensorRT. - Optimize models for low-latency, high-throughput inference (eg., quantization, distillation, etc.). - Implement CI/CD workflows for ML systems using tools like MLflow, Kubeflow. Monitoring & Optimization: - Set up logging, monitoring, and alerting for model performance (drift, latency, accuracy). - Work with DevOps teams to ensure scalability, security, and cost-efficiency of deployed models. Required Skills & Qualifications: - 5-7 years of hands-on experience in Deep Learning, NLP, and LLMs. - Strong proficiency in Python, PyTorch, TensorFlow, Hugging Face Transformers, and LLM frameworks. - Experience with model deployment tools (Docker, Kubernetes, FastAPI). - Knowledge of MLOps/LLMOps best practices (model versioning, A/B testing, canary deployments). - Familiarity with cloud platforms (AWS, GCP, Azure). Preferred Qualifications: - Contributions to open-source LLM projects. Show more Show less
Posted 2 weeks ago
3.0 years
0 Lacs
Mohali, Punjab
On-site
Company: Chicmic Studios Job Role: Python Machine Learning & AI Developer Experience Required: 3+ Years We are looking for a highly skilled and experienced Python Developer to join our dynamic team. The ideal candidate will have a robust background in developing web applications using Django and Flask, with expertise in deploying and managing applications on AWS. Proficiency in Django Rest Framework (DRF), a solid understanding of machine learning concepts, and hands-on experience with tools like PyTorch, TensorFlow, and transformer architectures are essential. Key Responsibilities Develop and maintain web applications using Django and Flask frameworks. Design and implement RESTful APIs using Django Rest Framework (DRF) Deploy, manage, and optimize applications on AWS services, including EC2, S3, RDS, Lambda, and CloudFormation. Build and integrate APIs for AI/ML models into existing systems. Create scalable machine learning models using frameworks like PyTorch , TensorFlow , and scikit-learn . Implement transformer architectures (e.g., BERT, GPT) for NLP and other advanced AI use cases. Optimize machine learning models through advanced techniques such as hyperparameter tuning, pruning, and quantization. Deploy and manage machine learning models in production environments using tools like TensorFlow Serving , TorchServe , and AWS SageMaker . Ensure the scalability, performance, and reliability of applications and deployed models. Collaborate with cross-functional teams to analyze requirements and deliver effective technical solutions. Write clean, maintainable, and efficient code following best practices. Conduct code reviews and provide constructive feedback to peers. Stay up-to-date with the latest industry trends and technologies, particularly in AI/ML. Required Skills and Qualifications Bachelor’s degree in Computer Science, Engineering, or a related field. 3+ years of professional experience as a Python Developer. Proficient in Python with a strong understanding of its ecosystem. Extensive experience with Django and Flask frameworks. Hands-on experience with AWS services for application deployment and management. Strong knowledge of Django Rest Framework (DRF) for building APIs. Expertise in machine learning frameworks such as PyTorch , TensorFlow , and scikit-learn . Experience with transformer architectures for NLP and advanced AI solutions. Solid understanding of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB). Familiarity with MLOps practices for managing the machine learning lifecycle. Basic knowledge of front-end technologies (e.g., JavaScript, HTML, CSS) is a plus. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills and the ability to articulate complex technical concepts to non-technical stakeholders. Contact : 9875952836 Office Location: F273, Phase 8b Industrial Area Mohali, Punjab. Job Type: Full-time Schedule: Day shift Monday to Friday Work Location: In person
Posted 3 weeks ago
2.0 years
0 Lacs
India
Remote
Senior Machine Learning Engineer (AI-Powered Software Platform for Hidden Physical-Threat Detection & Real-Time Intelligence) About the Company: Aerobotics7 (A7) is a mission-driven deep-tech startup focused on developing a UAV-based next-gen sensing and advanced AI platform to detect, identify, and mitigate hidden threats like landmines, UXOs, and IEDs in real-time. We are embarking on a rapid development phase, creating innovative solutions leveraging cutting-edge technologies. Our dynamic team is committed to building impactful products through continuous learning, and close cross-collaboration. Position Overview: We are seeking a Senior Machine Learning Engineer with a strong research orientation to join our team. This role will focus on developing and refining proprietary machine learning models for drone-based landmine detection and mitigation. The ideal candidate will design, develop, and optimize advanced ML workflows with an emphasis on rigorous research, novel model development, and experimental validation in deep learning, multi-modal/sensor fusion and computer vision applications. Key Responsibilities: Lead the end-to-end AI model development process, including research, experimentation, design, and implementation. Architect, train, and deploy deep learning models on cloud (GCP) and edge devices, ensuring real-time performance. Develop and optimize multi-modal ML/DL models integrating multiple sensor inputs. Implement and fine-tune CNNs, Vision Transformers (ViTs), and other deep-learning architectures. Design and improve sensor fusion techniques for enhanced perception and decision-making. Optimize AI inference for low-latency and high-efficiency deployment on production. Cross-collaborate with software and hardware teams to integrate AI solutions into mission-critical applications. Develop scalable pipelines for model training, validation, and continuous improvement. Ensure robustness, interpretability, and security of AI models in deployment. Required Skills: • Strong expertise in deep learning frameworks (TensorFlow, PyTorch). • Experience with CNNs, ViTs, and other DL architectures. • Hands-on experience in multi-modal ML and sensor fusion techniques. • Proficiency in cloud-based AI model deployment (GCP experience preferred). • Experience with edge AI optimization (NVIDIA Jetson, TensorRT, OpenVINO). • Strong knowledge of data preprocessing, augmentation, and synthetic data generation. • Proficiency in model quantization, pruning, and optimization for real-time applications. • Familiarity with computer vision, object detection, and real-time inference techniques. • Ability to work with limited datasets, including generating synthetic data (VAEs or s similar), data annotation and augmentation strategies. • Strong coding skills in Python and C++ with experience in high-performance computing. Preferred Qualifications: • Experience: 2-4+ Years. • Experience with MLOps, including CI/CD pipelines, model versioning, and monitoring. • Knowledge of reinforcement learning techniques. • Experience in working in fast-paced startup environments. • Prior experience working on AI-driven autonomous systems, robotics, or UAVs. • Understanding of embedded systems and hardware acceleration for AI workloads. Benefits: NOTE: THIS ROLE IS UNDER AEROBOTICS7 INVENTIONS PVT. LTD., AN INDIAN ENTITY. IT IS A REMOTE INDIA-BASED ROLE WITH COMPENSATION ALIGNED TO INDIAN MARKET STANDARDS. WHILE OUR PARENT COMPANY IS US-BASED, THIS POSITION IS FOR CANDIDATES RESIDING AND WORKING IN INDIA. Competitive startup-level salary and comprehensive benefits package. Future opportunity for equity options in the company. Opportunity to work on impactful, cutting-edge technology in a collaborative startup environment. Professional growth with extensive learning and career development opportunities. Direct contribution to tangible, real-world impact. How to Apply: Interested candidates are encouraged to submit their resume along with an (optional) cover letter highlighting their relevant experience and passion for working in a dynamic startup environment. For any questions or further information, feel free to reach out to us directly by emailing us at careers@aerobotics7.com. Show more Show less
Posted 3 weeks ago
5.0 - 7.0 years
0 Lacs
India
On-site
PLEASE NOTE: THIS ROLE IS ONLY FOR CANDIDATES WITH 5 TO 7 YEARS OF EXPERIENCE About PharmSight PharmSight is a leading innovator in bio-pharma analytics, providing cutting-edge AI-powered solutions that transform product research, market intelligence, and healthcare decision-making. We are dedicated to improving patient outcomes and driving advancements in the pharmaceutical industry through the application of advanced artificial intelligence Why join PharmSight? Competitive Compensation: Best-in-class salary with structured career progression Flexible Work Environment: Option to work from anywhere, at any time Global Client Exposure: Collaborate with leading pharmaceutical companies on impactful projects Career Growth & Recognition: A flat hierarchy with ample opportunities for leadership and professional development Role Overview As an AI Developer/Engineer (LLM) at PharmSight, you will be at the forefront of designing, developing, and deploying generative AI applications using state-of-the-art large language models (LLMs). You will be instrumental in crafting innovative AI solutions that solve complex challenges in bio-pharma analytics, product research, and market intelligence, directly impacting our clients ability to make data-driven decisions. This role demands a unique combination of deep technical expertise, creative problem-solving, and a passion for advancing AI technologies within the healthcare and pharmaceutical domains Key Responsibilities Architect, implement, and optimize large language models (LLMs) such as GPT, LLaMA, and BERT, tailoring them to the specific needs of bio-pharma analytics, product research, and market intelligence Experiment with diverse model architectures, hyperparameters, and training methodologies to maximize performance for targeted healthcare and pharmaceutical applications Fine-tune pre-trained models to address domain-specific challenges, ensuring exceptional accuracy, relevance, and contextual understanding Design and refine prompts to optimize LLM performance in generating accurate, insightful, and actionable outputs Develop instruction-tuning pipelines that align model behavior with specific business objectives and user requirements Continuously iterate on prompt strategies to enhance model interpretability and mitigate the risk of hallucinations or irrelevant outputs Conduct rigorous evaluations of LLMs using industry-standard metrics such as perplexity, BLEU, ROUGE, and domain-specific accuracy scores Perform in-depth error analysis, bias detection, and fairness audits to ensure models meet the highest ethical and regulatory standards Benchmark model performance against industry best practices and competitor solutions to maintain a competitive edge and drive continuous improvement Deploy LLMs into production environments, ensuring scalability, reliability, and low-latency performance to meet the demands of real-world applications Optimize models for inference speed and resource efficiency through techniques like quantization, distillation, and pruning Implement robust monitoring systems to track model performance in real-time and deploy timely updates to address drift or degradation in output quality Collaborate closely with data engineers and analysts to seamlessly integrate LLM outputs into PharmSight’s analytics platforms Leverage graph databases (e.g., vector graphs, hybrid graphs) to enhance structured knowledge extraction from unstructured text Develop APIs and intuitive interfaces that facilitate seamless interaction between LLMs and other critical system components Remain at the forefront of LLM research, actively exploring advancements in areas such as few-shot learning, reinforcement learning from human feedback (RLHF), and multimodal models Prototype and rigorously test emerging techniques to enhance model capabilities and address novel challenges in the bio-pharma domain Contribute findings to open-source projects, publish research insights, and represent PharmSight in AI research communities Work collaboratively with cross-functional teams including data scientists, product managers, and domain experts, ensuring that LLM development is aligned with critical business goals Mentor junior developers and analysts, providing guidance on LLM techniques, coding best practices, and emerging trends in AI Requirements Educational Background: bachelor’s or master’s degree in computer science, Data Science, Artificial Intelligence, or a related field AI & ML Experience: 5-7 years of hands-on experience in AI/ML development, with a strong focus on large language models (LLMs) Expertise in Python and deep learning frameworks (e.g., TensorFlow, PyTorch) Solid understanding of prompt engineering, model optimization, and NLP techniques Healthcare/Pharma Knowledge: A solid understanding of healthcare data, bio-pharma industry dynamics, and regulatory requirements Analytical Mindset: Exceptional problem-solving skills with the ability to translate business needs into innovative AI-driven solutions Communication Skills: Excellent written and verbal communication skills, with the ability to collaborate effectively with cross-functional teams and explain complex AI concepts to non-technical stakeholders (Bonus Skill) Experience in MLOps (e.g., Docker, Kubernetes, CI/CD pipelines, model monitoring) (Bonus Skill) Proficiency in cloud platforms (AWS, Azure, or GCP) for scalable AI deployment (Bonus Skill) Experience with knowledge graph construction and multimodal data integration (Eg, Neo4j, Entity extraction, nodes extraction) Join Us PharmSight offers a competitive salary, comprehensive benefits package, and the opportunity to work on cutting-edge AI projects that are transforming the pharmaceutical industry. We are committed to fostering a collaborative and innovative work environment where you can grow your skills and make a real impact Interested? Send your CV/Resume to Careers@pharmsight.com , and we’ll get back to you soon! Show more Show less
Posted 3 weeks ago
3.0 - 5.0 years
3 - 5 Lacs
Bengaluru / Bangalore, Karnataka, India
On-site
Lead the design, development, and implementation of AI/ML solutions across multiple domains Collaborate with cross-functional teams for seamless integration of AI/ML components Mentor and coach junior engineers, offering development opportunities and guidance Resolve issues related to AI model optimization for high performance and accuracy Conduct research on AI/ML trends and innovations to adopt best practices Develop and optimize quantization techniques for efficient AI/ML model execution on Qualcomm hardware Manage project timelines, objectives, and resource allocation across functions Minimum Qualifications: Bachelor's degree in Engineering, Computer Science, or a related field and 4+ years of Software Engineering or related experience OR Master's degree in Engineering, Computer Science, or a related field and 3+ years of Software Engineering or related experience Experience in software architecture and programming languages Proficiency with tools and frameworks such as PyTorch, TensorFlow, ONNX, etc. Preferred Qualifications: Excellent development skills in C++ / Python Strong knowledge of data structures and algorithms Hands-on expertise in deep learning frameworks like ONNX, PyTorch In-depth understanding of CV, NLP, LLM, GenAI, Classification, and Object Detection models Proficient in quantization (8-bit, 4-bit) and calibration algorithms Understanding of ML compiler techniques and graph optimizations Familiarity with software design patterns and SOLID principles Strong analytical, debugging, and development skills Knowledge of ML compilers (e.g., TVM, Glow) and runtimes (ONNX Runtime, TensorFlow Runtime) is a plus
Posted 3 weeks ago
2.0 years
0 Lacs
India
Remote
ob Title: AI Full stack Developer – GenAI & NLP Location: Pune, India (Hybrid) Work Mode: Remote Experience Required: 2+ Years (Relevant AI/ML with GenAI & NLP) Salary: Up to ₹15 LPA (CTC) Employment Type: Full-time Department: AI Research & Development Role Overview We are looking for a passionate AI Developer with strong hands-on experience in Generative AI and Natural Language Processing (NLP) to help build intelligent and scalable solutions. In this role, you will design and deploy advanced AI models for tasks such as language generation, summarization, chatbot development, document analysis, and more. You’ll work with cutting-edge LLMs (Large Language Models) and contribute to impactful AI initiatives. Key Responsibilities Design, fine-tune, and deploy NLP and GenAI models using LLMs like GPT, BERT, LLaMA, or similar. Build applications for tasks like text generation, question-answering, summarization, sentiment analysis, and semantic search. Integrate language models into production systems using RESTful APIs or cloud services. Evaluate and optimize models for accuracy, latency, and cost. Collaborate with product and engineering teams to implement intelligent user-facing features. Preprocess and annotate text data, create custom datasets, and manage model pipelines. Stay updated on the latest advancements in generative AI, transformer models, and NLP frameworks. Required Skills & Qualifications Bachelor’s or Master’s degree in Computer Science, AI/ML, or a related field. Minimum 2 years of experience in fullstack development and AI/ML development, with recent work in NLP or Generative AI. Hands-on experience with models such as GPT, T5, BERT, or similar transformer-based architectures. Proficient in Python and libraries such as Hugging Face Transformers, spaCy, NLTK, or OpenAI APIs. Hands-on experience in any frontend/ backend technologies for software development. Experience with deploying models using Flask, FastAPI, or similar frameworks. Strong understanding of NLP tasks, embeddings, vector databases (e.g., FAISS, Pinecone), and prompt engineering. Familiarity with MLOps tools and cloud platforms (AWS, Azure, or GCP). Preferred Qualifications Experience with LangChain, RAG (Retrieval-Augmented Generation), or custom LLM fine-tuning. Knowledge of model compression, quantization, or inference optimization. Exposure to ethical AI, model interpretability, and data privacy practices. What We Offer Competitive salary package up to ₹15 LPA. Remote work flexibility with hybrid team collaboration in Pune. Opportunity to work on real-world generative AI and NLP applications. Access to resources for continuous learning and certification support. Inclusive, fast-paced, and innovative work culture. Skills: nltk,computer vision,inference optimization,model interpretability,gpt,bert,mlops,artificial intelligence,next.js,tensorflow,ai development,machine learning,generative ai,ml,openai,node.js,kubernetes,large language models (llms),openai apis,natural language processing,machine learning (ml),fastapi,natural language processing (nlp),java,azure,nlp tasks,model compression,embeddings,vector databases,aws,typescript,r,hugging face transformers,google cloud,hugging face,llama,ai tools,mlops tools,rag architectures,langchain,spacy,docker,retrieval-augmented generation (rag),pytorch,gcp,cloud,large language models,react.js,deep learning,python,ai technologies,flask,ci/cd,data privacy,django,quantization,javascript,ethical ai,nlp Show more Show less
Posted 3 weeks ago
3.0 years
0 Lacs
Bengaluru, Karnataka
On-site
Experience : 3+ years Job Location : Bengaluru, Karnataka Work Modality : Fulltime work from office Job Description : To develop LLM-driven products from the ground up. We are looking for enthusiastic members who would like to design cutting-edge systems and implement AI solutions that scale globally. Strong communication skills Problem-solving abilities Strong programming background Understanding of Transformer architecture Required Qualifications : 3+ years of hands-on experience in AI/ML, with proven projects using Transformers (e.g., BERT, GPT, T5, ViTs, Small LLMs) Strong proficiency in Python and deep learning frameworks (PyTorch or TensorFlow) Ability to independently analyze open sources and code repositories Experience in fine-tuning Transformer models for NLP (e.g., text classification, summarization) or Computer Vision (e.g., image generation, recognition) Knowledge of GPU acceleration, optimization techniques, and model quantization Experience in deploying models using Flask, FastAPI, or cloud-based inference services Familiarity with data pre-processing, feature engineering, and training workflows
Posted 3 weeks ago
0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Company Description UAE-based ZySec AI provides cutting-edge cybersecurity solutions to help enterprises tackle evolving security challenges at scale. Utilizing an autonomous AI workforce, ZySec AI enhances operational efficiency by automating repetitive, resource-intensive tasks, enabling security teams to focus on strategic priorities. Our mission is to make AI more efficient, accessible, and private for security professionals.mWe're building the future of Autonomous Data Intelligence at CyberPod AI and were looking for a deeply technical, hands-on AI Engineer to push the boundaries of whats possible with Large Language Models (LLMs). This role is for someone whos already been in the trenches: fine-tuned foundation models, experimented with quantization and performance tuning, and knows PyTorch inside out. If youre passionate about optimizing LLMs, crafting efficient reasoning architectures, and contributing to open-source communities like Hugging Face, this is your playground. Role Description Fine-tune Large Language Models (LLMs) on custom datasets for specialized reasoning tasks. Design and run benchmarking pipelines across accuracy, speed, token throughput, and energy efficiency. Implement quantization, pruning, and distillation techniques for model compression and deployment readiness. Evaluate and extend agentic RAG (Retrieval-Augmented Generation) pipelines and reasoning agents. Contribute to SOTA model architectures for multi-hop, temporal, and multimodal reasoning. Collaborate closely with the data engineering, infra, and applied research teams to bring ideas from paper to production. Own and drive experiments, ablations, and performance dashboards end-to-end. Requirements Hands-on experience working with deep learning and large models, particularly LLMs. Strong understanding of PyTorch internals: autograd, memory profiling, efficient dataloaders, mixed precision. Proven track record in fine-tuning LLMs (e.g., LLaMA, Falcon, Mistral, Open LLaMA, T5, etc.) on real-world use cases. Benchmarking skills: can run standardized evals (e.g., MMLU, GSM8K, HELM, TruthfulQA) and interpret metrics. Deep familiarity with quantization techniques: GPTQ, AWQ, QLoRA, bitsandbytes, and low-bit inference. Working knowledge of Hugging Face ecosystem (Transformers, Accelerate, Datasets, Evaluate). Active Hugging Face profile with at least one public model/repo published. Experience in training and optimizing multi-modal models (vision-language/audio) is a big plus. Published work (arXiv, GitHub, blogs) or open-source contributions preferred. If you are passionate about AI and want to be a part of a dynamic and innovative team, then ZySec AI is the perfect place for you. Apply now and join us in shaping the future of artificial intelligence. Show more Show less
Posted 3 weeks ago
3.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
A career within our Infrastructure practice will provide you with the opportunity to design, build, coordinate and maintain the IT environments for clients to run internal operations, collect data, monitor, develop and launch products. Infrastructure management consists of hardware, storage, compute, network and software layers. As a part of our Infrastructure Engineering team, you will be responsible for maintaining the critical IT systems which includes build, run and maintenance while providing technical support and training that aligns to industry leading practices. To really stand out and make us fit for the future in a constantly changing world, each and every one of us at PwC needs to be a purpose-led and values-driven leader at every level. To help us achieve this we have the PwC Professional; our global leadership development framework. It gives us a single set of expectations across our lines, geographies and career paths, and provides transparency on the skills we need as individuals to be successful and progress in our careers, now and in the future. Responsibilities As a Senior Associate, you'll work as part of a team of problem solvers, helping to solve complex business issues from strategy to execution. PwC Professional skills and responsibilities for this management level include but are not limited to: Use feedback and reflection to develop self awareness, personal strengths and address development areas. Delegate to others to provide stretch opportunities, coaching them to deliver results. Demonstrate critical thinking and the ability to bring order to unstructured problems. Use a broad range of tools and techniques to extract insights from current industry or sector trends. Review your work and that of others for quality, accuracy and relevance. Know how and when to use tools available for a given situation and can explain the reasons for this choice. Seek and embrace opportunities which give exposure to different situations, environments and perspectives. Use straightforward communication, in a structured way, when influencing and connecting with others. Able to read situations and modify behavior to build quality relationships. Uphold the firm's code of ethics and business conduct. AI Engineer Overview We are seeking an exceptional AI Engineer to drive the development, optimization, and deployment of cutting-edge generative AI solutions for our clients. This role is at the forefront of applying generative models to solve real-world business challenges, requiring deep expertise in both the theoretical underpinnings and practical applications of generative AI. Core Qualifications Advanced degree (MS/PhD) in Computer Science, Machine Learning, or related field with a focus on generative models 3+ years of hands-on experience developing and deploying AI models in production environments with 1 year of experience in developing generative AI pilots, proofs of concept, and prototypes Deep understanding of state-of-the-art AI architectures (e.g., Transformers, VAEs, GANs, Diffusion Models) Expertise in PyTorch or TensorFlow, with a preference for experience in both Proficiency in Python and software engineering best practices for AI systems Technical Skills Required Demonstrated experience with large language models (LLMs) such as GPT, BERT, T5, etc. Practical understanding of generative AI frameworks (e.g., Hugging Face Transformers, OpenAI GPT, DALL-E) Familiarity with prompt engineering and few-shot learning techniques Expertise in MLOps and LLMOps practices, including CI/CD for ML models Strong knowledge of one or more cloud-based AI services (e.g., AWS SageMaker, Azure ML, Google Vertex AI) Preferred Proficiency in optimizing generative models for inference (quantization, pruning, distillation) Experience with distributed training of large-scale AI models Experience with model serving technologies (e.g., TorchServe, TensorFlow Serving, Triton Inference Server) Key Responsibilities Architect and implement end-to-end generative AI solutions, from data preparation to production deployment Develop custom AI models and fine-tune pre-trained models for specific client use cases Optimize generative models for production, balancing performance, latency, and resource utilization Design and implement efficient data pipelines for training and serving generative models Develop strategies for effective prompt engineering and few-shot learning in production systems Implement robust evaluation frameworks for generative AI outputs Collaborate with cross-functional teams to integrate generative AI capabilities into existing systems Address challenges related to bias, fairness, and ethical considerations in generative AI applications Project Delivery Lead the technical aspects of generative AI projects from pilot to production Develop proof-of-concepts and prototypes to demonstrate the potential of generative AI in solving client problems Conduct technical feasibility studies for applying generative AI to novel use cases Implement monitoring and observability solutions for deployed generative models Troubleshoot and optimize generative AI systems in production environments Client Engagement Provide expert technical guidance on generative AI capabilities and limitations to clients Collaborate with solution architects to design generative AI-powered solutions that meet client needs Present technical approaches and results to both technical and non-technical stakeholders Assist in scoping and estimating generative AI projects Innovation and Knowledge Sharing Stay at the forefront of generative AI research and industry trends Contribute to the company's intellectual property through patents or research publications Develop internal tools and frameworks to accelerate generative AI development Mentor junior team members on generative AI technologies and best practices Contribute to technical blog posts and whitepapers on generative AI applications The ideal candidate will have a proven track record of successfully deploying AI models in production environments, a deep understanding of the latest advancements in generative AI, and the ability to apply this knowledge to solve complex business problems. They should be passionate about pushing the boundaries of what's possible with generative AI and excited about the opportunity to shape the future of AI-driven solutions for our clients. Show more Show less
Posted 3 weeks ago
3.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
It's fun to work in a company where people truly BELIEVE in what they are doing! We're committed to bringing passion and customer focus to the business. About Fractal Fractal is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets; an ecosystem where human imagination is at the heart of every decision. Where no possibility is written off, only challenged to get better. We believe that a true Fractalite is the one who empowers imagination with intelligence. Fractal has been featured as a Great Place to Work by The Economic Times in partnership with the Great Place to Work® Institute and recognized as a ‘Cool Vendor’ and a ‘Vendor to Watch’ by Gartner. Job Description As an NLP Lead Data Scientist, you will be building solutions that require analyzing and transforming natural language data into useful outcomes using NLP techniques. To succeed in this role, you should possess outstanding skills in statistical analysis, machine learning methods and text representation techniques and language models, consulting experience in FS, banking domain, good communication and client handling skill. Responsibilities Build Solutions that identify intent and entities, other features from user comments, chat transcript and other unstructured text data. Design and implement advanced solutions utilizing NLP traditional Models and Large Language Models (LLMs). Demonstrate self-driven initiative by taking ownership and creating end-to-end solutions. Conduct research and stay informed about the latest developments in generative AI and LLMs. Train and fine-tune pre-trained NLP models and run evaluation experiments. Perform statistical analysis of results and refine models. Basic logical pseudo code writing. Develop and maintain code libraries, tools, and frameworks to support generative AI development. Participate in code reviews and contribute to maintaining high code quality standards. Engage in the entire software development lifecycle, from design and testing to deployment and maintenance. Collaborate closely with cross-functional teams to align messaging, contribute to roadmaps, and integrate software into different repositories for core system compatibility. Possess strong analytical and problem-solving skills. Demonstrate excellent communication skills and the ability to work effectively in a team environment. Primary Skills Natural Language Processing (NLP): Hands-on experience in use case classification, topic modeling, Q&A and chatbots, search, Document AI, summarization, and content generation. AND/OR Computer Vision and Audio: Hands-on experience in image classification, object detection, segmentation, image generation, audio, and video analysis. Generative AI: Proficiency with SaaS LLMs, including Lang chain, llama index, vector databases, Prompt engineering (COT, TOT, ReAct, agents). Experience with Azure OpenAI, Google Vertex AI, AWS Bedrock for text/audio/image/video modalities. Familiarity with Open-source LLMs, including tools like TensorFlow/Pytorch and huggingface. Techniques such as quantization, LLM finetuning using PEFT, RLHF, data annotation workflow, and GPU utilization. Cloud: Hands-on experience with cloud platforms such as Azure, AWS, and GCP. Cloud certification is preferred. Application Development: Proficiency in Python, Docker, FastAPI/Django/Flask, and Git Domain Skill: Consulting experience with FS,Banking domain for at least 3 years. Required Qualifications 16-20 years into Deep Learning framewokrs like Keras, Pytorch, TensorFlow Experience of design, building and deployment of ML/NLP Solutions Proficient in Python & SQL Experience in NLP tools like Genism, spacy, Stanford NLP, HuggingFace Must have excellent project/program management skills and have experience managing multiple work streams and projects at one time Have business acumen to manage revenues profitably and meet financial goals consistently. Able to quantify business value for clients and create win-win commercial propositions. Preferred Qualifications Experience automating data within Tableau/Qlik/ggplot/Shiny to tell a story through interactive visualizations Client relationship management: Build deep client relationship, network & be a thought partner. Anticipate business problems & deliver par excellence. Sales Support & account growth: Actively focus on opportunities to grow the client along with the senior engagement manager. Support the sales team as required for RFPs and regular sales pitches Firm building: Contribute to firm growth by participating and conducting training sessions. Coaching & grooming: Coach & groom the team on gaining knowledge & skills on first principles of analytics techniques, problem-solving, project management, client relationship management EDUCATION: B.E/B.Tech/M.Tech in Computer Science or related technical degree OR Equivalent If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us! Not the right fit? Let us know you're interested in a future opportunity by clicking Introduce Yourself in the top-right corner of the page or create an account to set up email alerts as new job postings become available that meet your interest! Show more Show less
Posted 3 weeks ago
4.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
Job Category: AIML Job Type: Full Time Job Location: Bengaluru Mangalore Experience: 4-8 Years Skills: AI AWS/AZURE/GCP Azure ML C computer vision data analytics Data Modeling Data Visualization deep learning Descriptive Analytics GenAI Image processing Java LLM models ML ONNX Predictive Analytics Python R Regression/Classification Models SageMaker SQL TensorFlow Position Overview We are looking for an experienced AI/ML Engineer to join our team in Bengaluru. The ideal candidate will bring a deep understanding of machine learning, artificial intelligence, and big data technologies, with proven expertise in developing scalable AI/ML solutions. You will lead technical efforts, mentor team members, and collaborate with cross-functional teams to design, develop, and deploy cutting edge AI/ML applications. Job Details Job Category: AI/ML Engineer. Job Type: Full-Time Job Location: Bengaluru Experience Required: 4-8 Years About Us We are a multi-award-winning creative engineering company. Since 2011, we have worked with our customers as a design and technology enablement partner, guiding them on their digital transformation journeys. Roles And Responsibilities Design, develop, and deploy deep learning models for object classification, detection, and segmentation using CNNs and Transfer Learning. Implement image preprocessing and advanced computer vision pipelines. Optimize deep learning models using pruning, quantization, and ONNX for deployment on edge devices. Work with PyTorch, TensorFlow, and ONNX frameworks to develop and convert models. Accelerate model inference using GPU programming with CUDA and cuDNN. Port and test models on embedded and edge hardware platforms. ( Orin, Jetson, Hailo ) Conduct research and experiments to evaluate and integrate GenAI technologies in computer vision tasks. Explore and implement cloud-based AI workflows, particularly using AWS/Azure AI/ML services. Collaborate with cross-functional teams for data analytics, data processing, and large-scale model training. Required Skills Strong programming experience in Python. Solid background in deep learning, CNNs, and transfer learning and Machine learning basics. Expertise in object detection, classification, segmentation. Proficiency with PyTorch, TensorFlow, and ONNX. Experience with GPU acceleration (CUDA, cuDNN). Hands-on knowledge of model optimization (pruning, quantization). Experience deploying models to edge devices (e.g., Jetson, mobile, Orin, Hailo ) Understanding of image processing techniques. Familiarity with data pipelines, data preprocessing, and data analytics. Willingness to explore and contribute to Generative AI and cloud-based AI solutions. Good problem-solving and communication skills. Preferred (Nice-to-Have) Experience with C/C++. Familiarity with AWS Cloud AI/ML tools (e.g., SageMaker, Rekognition). Exposure to GenAI frameworks like OpenAI, Stable Diffusion, etc. Knowledge of real-time deployment systems and streaming analytics. Qualifications Graduation/Post-graduation in Computers, Engineering, or Statistics from a reputed institute. What We Offer Competitive salary and benefits package. Opportunity to work in a dynamic and innovative environment. Professional development and learning opportunities. Visit us on: CodeCraft Technologies LinkedIn : CodeCraft Technologies LinkedIn Instagram : CodeCraft Technologies Instagram Show more Show less
Posted 3 weeks ago
3.0 years
0 Lacs
Bengaluru, Karnataka, India
Remote
Summary Gainwell is seeking LLM Ops Engineers and ML Ops Engineers to join our growing AI/ML team. This role is responsible for developing, deploying, and maintaining scalable infrastructure and pipelines for Machine Learning (ML) models and Large Language Models (LLMs). You will play a critical role in ensuring smooth model lifecycle management, performance monitoring, version control, and compliance while collaborating closely with Data Scientists, DevOps, and Role Description Core LLM Ops Responsibilities: Develop and manage scalable deployment strategies specifically tailored for LLMs (GPT, Llama, Claude, etc.). Optimize LLM inference performance, including model parallelization, quantization, pruning, and fine-tuning pipelines. Integrate prompt management, version control, and retrieval-augmented generation (RAG) pipelines. Manage vector databases, embedding stores, and document stores used in conjunction with LLMs. Monitor hallucination rates, token usage, and overall cost optimization for LLM APIs or on-prem deployments. Continuously monitor models for its performance and ensure alert system in place. Ensure compliance with ethical AI practices, privacy regulations, and responsible AI guidelines in LLM workflows. Core ML Ops Responsibilities: Design, build, and maintain robust CI/CD pipelines for ML model training, validation, deployment, and monitoring. Implement version control, model registry, and reproducibility strategies for ML models. Automate data ingestion, feature engineering, and model retraining workflows. Monitor model performance, drift, and ensure proper alerting systems are in place. Implement security, compliance, and governance protocols for model deployment. Collaborate with Data Scientists to streamline model development and experimentation. What We’re Looking For Bachelor's or Master's degree or higher in Computer Science, Data Sciences-Machine Learning, Engineering, or related fields. Strong experience with ML Ops tools (Kubeflow, ML flow, TFX, Sage Maker, etc.). Experience with LLM-specific tools and frameworks ( LangChain, Lang Graph, LlamaIndex, Hugging Face, OpenAI APIs, Vector DBs like Pinecone, FAISS, Weavite, Chroma DB etc.). Solid experience in deploying models in cloud (AWS, Azure, GCP) and on-prem environments. Proficient in containerization (Docker, Kubernetes) and CI/CD practices. Familiarity with monitoring tools like Prometheus, Grafana, and ML observability platforms. Strong coding skills in Python, Bash, and familiarity with infrastructure-as-code tools (Terraform, Helm, etc.).Knowledge of healthcare AI applications and regulatory compliance (HIPAA, CMS) is a plus. Strong skills in Giskard, Deepeval etc. Qualifications Bachelor or Masters or Higher in Computer Sciences, Data Sciences, or any related field 3+ years to 7 Years of experience in deploying ML/DL and LLM based solutions in large scale deployment environment or related experience Experience with fine-tuning LLMs and serving them in production at scale. Knowledge of model compression techniques for LLMs (LoRA, QLoRA, quantization-aware training). Experience with distributed systems and high-performance computing for large-scale model serving. Awareness of AI fairness, explainability, and governance frameworks. What You Should Expect in This Role Fully Remote Opportunity – Work from anywhere in the U.S. / India Minimal Travel Required – Occasional travel opportunities (0-10%). Opportunity to Work on Cutting-Edge AI Solutions in a mission-driven healthcare technology environment. Show more Show less
Posted 3 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
Accenture
36723 Jobs | Dublin
Wipro
11788 Jobs | Bengaluru
EY
8277 Jobs | London
IBM
6362 Jobs | Armonk
Amazon
6322 Jobs | Seattle,WA
Oracle
5543 Jobs | Redwood City
Capgemini
5131 Jobs | Paris,France
Uplers
4724 Jobs | Ahmedabad
Infosys
4329 Jobs | Bangalore,Karnataka
Accenture in India
4290 Jobs | Dublin 2