Home
Jobs
Companies
Resume

153 Quantization Jobs - Page 2

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0 years

0 Lacs

India

On-site

Linkedin logo

About Us: Soul AI is a pioneering company founded by IIT Bombay and IIM Ahmedabad alumni, with a strong founding team from IITs, NITs, and BITS. We specialize in delivering high-quality human-curated data and AI-first scaled operations services. Based in San Francisco and Hyderabad, we are a fast-moving team on a mission to build AI for Good, driving innovation and societal impact. Role Overview: We’re looking for a Generative AI Engineer to join our client’s team and build intelligent systems powered by large language models and other generative AI architectures. This role involves developing and deploying LLM-based features, integrating vector search, fine-tuning models, and collaborating with product and engineering teams to ship robust, scalable GenAI applications. You’ll work across the GenAI stack — from prompt design to inference optimization — and shape how generative models are used in real-world products. Responsibilities: Fine-tune and deploy LLMs (e.g., GPT, LLaMA, Mistral) using frameworks like Hugging Face Transformers or LangChain Build and optimize Retrieval-Augmented Generation (RAG) pipelines using vector databases (e.g., Pinecone, FAISS) Engineer prompts for structured, reliable outputs across use cases (chatbots, summarization, coding copilots, etc.) Implement scalable inference pipelines and optimize latency, throughput, and cost using techniques like quantization or model distillation Collaborate with product, design, and frontend teams to integrate GenAI into user-facing features Monitor, evaluate, and continuously improve model performance, safety, and accuracy in production Ensure compliance with privacy, safety, and responsible AI practices (e.g., content filtering, output sanitization) Required Skills: Strong programming skills in Python, with familiarity in modern ML tooling Practical experience with LLM frameworks (e.g., Hugging Face Transformers, LangChain, LlamaIndex) Experience building or deploying RAG pipelines, including handling embeddings and vector search Understanding of transformer models, prompt engineering, and tokenization strategies Hands-on with APIs (OpenAI, Anthropic, Cohere, etc.) and model serving (FastAPI, Flask, etc.) Experience deploying ML models using Docker, Kubernetes, and/or cloud services (AWS/GCP/Azure) Comfortable with model evaluation, monitoring, and troubleshooting inference pipelines Nice to Have: Experience with multimodal models (e.g., diffusion models, TTS, image/video generation) Knowledge of RLHF, safety alignment, or model fine-tuning best practices Familiarity with open-source LLMs (e.g., Mistral, LLaMA, Falcon, Mixtral) and optimization (LoRA, quantization) Experience with LangChain agents, tool usage, and memory management Contributions to open-source GenAI projects or published demos/blogs on generative AI Exposure to frontend technologies (React/Next.js) for prototyping GenAI tools Educational Qualifications: Bachelor's or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, or a related technical field Candidates with relevant project experience or open-source contributions may be considered regardless of formal degree Show more Show less

Posted 5 days ago

Apply

3.0 years

0 Lacs

Hyderābād

On-site

Company: Qualcomm India Private Limited Job Area: Engineering Group, Engineering Group > Software Engineering General Summary: More Details Below: Join The Exciting Generative AI Team At Qualcomm Focused On Integrating Cutting Edge GenAI Models On Qualcomm Chipsets. The Team Uses Qualcomm Chips’ Extensive Heterogeneous Computing Capabilities To Allow Inference Of GenAI Models On-Device Without A Need For Connection To The Cloud. Our Inference Engine Is Designed To Help Developers Run Neural Network Models Trained In A Variety Of Frameworks On Snapdragon Platforms At Blazing Speeds While Still Sipping The Smallest Amount Of Power. Utilize This Power Efficient Hardware And Software Stack To Run Large Language Models (LLMs) And Large Vision Models (LVM) At Near GPU Speeds! Responsibilities: In This Role, You Will Spearhead The Development And Commercialization Of The Qualcomm AI Runtime (QAIRT) SDK On Qualcomm SoCs. As An AI Inferencing Expert, You'll Push The Limits Of Performance From Large Models. Your Mastery In Deploying Large C/C++ Software Stacks Using Best Practices Will Be Essential. You'll Stay On The Cutting Edge Of GenAI Advancements, Understanding LLMs/Transformers And The Nuances Of Edge-Based GenAI Deployment. Most Importantly, Your Passion For The Role Of Edge In AI's Evolution Will Be Your Driving Force. Requirements: Master’s/Bachelor’s Degree In Computer Science Or Equivalent. 3+ Years Of Relevant Work Experience In Software Development. Strong Understanding Of Generative AI Models – LLM, LVM And LLMs And Building Blocks Floating-Point, Fixed-Point Representations And Quantization Concepts. Experience With Optimizing Algorithms For AI Hardware Accelerators (Like CPU/GPU/NPU). Strong Development Skills In C/C++ Excellent Analytical And Debugging Skills. Good Communication Skills (Verbal, Presentation, Written). Ability To Collaborate Across A Globally Diverse Team And Multiple Interests. Preferred Qualifications Strong Understanding Of SIMD Processor Architecture And System Design. Proficiency In Object-Oriented Software Development. Familiarity With Linux And Windows Environment Strong Background In Kernel Development For SIMD Architectures. Familiarity With Frameworks Like Llama.Cpp, MLX, And MLC Is A Plus. Good Knowledge Of PyTorch, TFLite, And ONNX Runtime Is Preferred. Experience With Parallel Computing Systems And Assembly Is A Plus. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field. Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers.

Posted 5 days ago

Apply

5.0 years

0 Lacs

Thiruvananthapuram

On-site

Brief Description The Technical Lead – AI, Embedded & Edge Systems is responsible for leading cross-functional engineering efforts in the design, development, and integration of AI-powered solutions across embedded hardware, edge/server devices, system software, and infrastructure components. The role bridges hardware-software co-design, system architecture, and execution leadership, ensuring delivery of scalable, high-performance, and secure computer vision and artificial intelligence and robotics products. 1.2. Key Responsibilities1. Technical Architecture & Design Lead end-to-end system design covering embedded hardware, edge/server platforms, and AI model integration. Define scalable and secure architectural frameworks for hardware-software interoperability, real-time processing, and data flow. 2. Embedded Systems & Hardware Solutioning Architect and develop embedded hardware systems (MCUs, SoCs, AI accelerators) for edge and server products. Supervise schematic reviews, PCB design validation, and bring-up of edge devices aligned with AI/CV use cases. 3. AI & Computer Vision Integration Coordinate with ML engineers to deploy and optimize CV models on heterogeneous hardware (GPU, IPU, NPU, SOCs etc.). Support model quantization, inference optimization, and real-time processing strategies on edge platforms. 4. Edge and Server Application Development Oversee development of core applications, services, middleware, and utilities on both embedded and server systems. Ensure fault-tolerant, modular, and interoperable implementations across deployment environments. 5. Infrastructure Recommendation & Integration Recommend compute, storage, and networking infrastructure for on-prem, hybrid, or cloud deployments based on application demands. Collaborate with DevOps and infra teams to establish deployment pipelines, configuration profiles, and monitoring strategies. 7. Team Leadership & Mentorship Lead and mentor a team of embedded engineers, application developers, and integration specialists. Conduct technical reviews, guide debugging sessions, and promote engineering best practices. 8. Stakeholder Collaboration Communicate and negotiate on technology and development aspects with product managers, architects, QA, and the peers in other organizations to ensure aligned development and seamless integration. Translate business requirements and user stories into actionable technical work plans. Preferred Skills Bachelor’s or Master’s degree in Electronics, Electrical, Computer Engineering, or related field. 5+ years of product development experience, preferably with embedded systems design and systems programming with at least 3 years in a technical leadership role. Strong expertise in embedded systems, circuit design, board bring-up, and firmware development. Proficient in Python, C/C++, and system-level programming with real-time constraints. Hands-on experience with edge AI deployment frameworks, embedded Linux, and driver development. Familiarity with infrastructure components (Docker, Kubernetes, CI/CD tools) Preferred Attributes Experience working on mission-critical or real-time AI/CV products (e.g., surveillance, robotics, smart infrastructure). Ability to navigate between low-level hardware issues and high-level application architecture. Strong documentation, communication, and team coordination skills. Exposure to regulatory or compliance-heavy product domains is an advantage. Job Types: Full-time, Permanent Schedule: Fixed shift Monday to Friday Education: Bachelor's (Preferred) Experience: total work: 5 years (Preferred) C++: 5 years (Preferred) Work Location: In person Expected Start Date: 25/07/2025

Posted 5 days ago

Apply

0 years

0 Lacs

Chandigarh, India

On-site

Linkedin logo

Job Description : AI/ML Specialist Overview We are looking for a highly skilled and experienced AI/ML to join our dynamic team. The ideal candidate will have a robust background in developing web applications using Django and Flask, with expertise in deploying and managing applications on AWS. Proficiency in Django Rest Framework (DRF), a solid understanding of machine learning concepts, and hands-on experience with tools like PyTorch, TensorFlow, and transformer architectures are essential. Key Responsibilities Develop and maintain web applications using Django and Flask frameworks. Design and implement RESTful APIs using Django Rest Framework (DRF). Deploy, manage, and optimize applications on AWS services, including EC2, S3, RDS, Lambda, and CloudFormation. Build and integrate APIs for AI/ML models into existing systems. Create scalable machine learning models using frameworks like PyTorch, TensorFlow, and scikit-learn. Implement transformer architectures (e.g., BERT, GPT) for NLP and other advanced AI use cases. Optimize machine learning models through advanced techniques such as hyperparameter tuning, pruning, and quantization. Deploy and manage machine learning models in production environments using tools like TensorFlow Serving, TorchServe, and AWS SageMaker. Ensure the scalability, performance, and reliability of applications and deployed models. Collaborate with cross-functional teams to analyze requirements and deliver effective technical solutions. Write clean, maintainable, and efficient code following best practices. Conduct code reviews and provide constructive feedback to peers. Stay up-to-date with the latest industry trends and technologies, particularly in AI/ML. Required Skills And Qualifications Bachelor's degree in Computer Science, Engineering, or a related field. Proficient in Python with a strong understanding of its ecosystem. Extensive experience with Django and Flask frameworks. Hands-on experience with AWS services for application deployment and management. Strong knowledge of Django Rest Framework (DRF) for building APIs. Expertise in machine learning frameworks such as PyTorch, TensorFlow, and scikit-learn. Experience with transformer architectures for NLP and advanced AI solutions. Solid understanding of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB). Familiarity with MLOps practices for managing the machine learning lifecycle. Basic knowledge of front-end technologies (e.g., JavaScript, HTML, CSS) is a plus. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills and the ability to articulate complex technical concepts to non-technical stakeholders. (ref:hirist.tech) Show more Show less

Posted 6 days ago

Apply

3.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Software Engineering General Summary More details below: Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds! Responsibilities In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force. Requirements Master’s/Bachelor’s degree in computer science or equivalent. 3+ years of relevant work experience in software development. Strong understanding of Generative AI models – LLM, LVM and LLMs and building blocks Floating-point, Fixed-point representations and Quantization concepts. Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU). Strong development skills in C/C++ Excellent analytical and debugging skills. Good communication skills (verbal, presentation, written). Ability to collaborate across a globally diverse team and multiple interests. Preferred Qualifications Strong understanding of SIMD processor architecture and system design. Proficiency in object-oriented software development. Familiarity with Linux and Windows environment Strong background in kernel development for SIMD architectures. Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus. Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred. Experience with parallel computing systems and Assembly is a plus. Minimum Qualifications Bachelor's degree in Engineering, Information Systems, Computer Science, or related field. Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers. 3076120 Show more Show less

Posted 6 days ago

Apply

5.0 years

0 Lacs

Kolkata, West Bengal, India

On-site

Linkedin logo

Roles and Responsibilities: As a, Associate Manager - Senior Data scientist you will solve some of the most impactful business problems for our clients using a variety of AI and ML technologies. You will collaborate with business partners and domain experts to design and develop innovative solutions on the data to achieve predefined outcomes. • Engage with clients to understand current and future business goals and translate business problems into analytical frameworks • Develop custom models based on in-depth understanding of underlying data, data structures, and business problems to ensure deliverables meet client needs • Create repeatable, interpretable and scalable models • Effectively communicate the analytics approach and insights to a larger business audience • Collaborate with team members, peers and leadership at Tredence and client companies Qualification: Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience. 5+ years of experience in data science, building hands-on ML models Experience with LMs (Llama (1/2/3), T5, Falcon, Langchain or framework similar like Langchain) Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning Candidate must be comfortable interpreting research papers and architecture diagrams of Language Models Candidate must be comfortable with LORA, RAG, Instruct fine-tuning, Quantization, etc. Experience leading the end-to-end design, development, and deployment of predictive modeling solutions. Excellent programming skills in Python. Strong working knowledge of Python’s numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc. Advanced SQL skills with SQL Server and Spark experience. Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling. Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment Experience with data visualization tools — PowerBI, Tableau, R Shiny, etc. preferred Experience with cloud platforms such as Azure, AWS is preferred but not required. Show more Show less

Posted 6 days ago

Apply

10.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Fractal is one of the most prominent players in the Artificial intelligence space. Fractal's mission is to power every human decision in the enterprise and brings Al, engineering, and design to help the world's most admire Fortune 500® companies. Fractal's products include Qure.ai to assist radiologists in making better diagnostic decisions, Crux Intelligence to assist CEOs and senior executives make better tactical and strategic decisions, Theremin.ai to improve investment decisions, Eugenie.ai to find anomalies in high-velocity data, Samya.ai to drive next-generation Enterprise Revenue Growth Manage- ment, Senseforth.ai to automate customer interactions at scale to grow top-line and bottom-line and Analytics Vidhya is the largest Analytics and Data Science community offering industry-focused training programs. Fractal has more than 3600 employees across 16 global locations, including the United States, UK, Ukraine, India, Singapore, and Australia. Fractal has consistently been rated as India's best companies to work for, by The Great Place to Work® Institute, featured as a leader in Customer Analytics Service Providers Wave™ 2021, Computer Vision Consultancies Wave™ 2020 & Specialized Insights Service Providers Wave™ 2020 by Forrester Research, a leader in Analytics & Al Services Specialists Peak Matrix 2021 by Everest Group and recognized as an "Honorable Vendor" in 2022 Magic Quadrant™ for data & analytics by Gartner. For more information, visit fractal.ai Job Description: (Senior Data Scientist – Generative AI) We’re looking for a passionate Data Scientist – Generative AI who thrives at the intersection of AI research & real-world applications. This role is ideal for someone who’s eager to build, experiment & scale LLM-powered solutions in enterprise environments. This role blends hands-on Problem solving, Research, Engineering & collaboration across multidisciplinary team driving innovation across industries/domains. Responsibilities: • Design and implement advanced solutions utilizing Large Language Models (LLMs). • Demonstrate self-driven initiative by taking ownership and creating end-to-end solutions. • Conduct research and stay informed about the latest developments in generative AI and LLMs. • Develop and maintain code libraries, tools, and frameworks to support generative AI development. •Participate in code reviews and contribute to maintaining high code quality standards. • Engage in the entire software development lifecycle, from design and testing to deployment and maintenance. • Collaborate closely with cross-functional teams to align messaging, contribute to roadmaps, and integrate software into different repositories for core system compatibility. • Possess strong analytical and problem-solving skills. • Demonstrate excellent communication skills and the ability to work effectively in a team environment. Primary Skills: • Natural Language Processing (NLP): Hands-on experience in use case classification, topic modeling, Q&A and chatbots, search, Document AI, summarization, and content generation. AND/OR • Computer Vision and Audio: Hands-on experience in image classification, object detection, segmentation, image generation, audio, and video analysis. • Generative AI: o Proficiency with SaaS LLMs, including Lang chain, llama index, vector databases, Prompt engineering (COT, TOT, ReAct, agents) Experience with Azure OpenAI, Google Vertex AI, AWS Bedrock for text/audio/image/video modalities. Familiarity with Open-source LLMs, including tools like TensorFlow/Pytorch and huggingface. Techniques such as quantization, LLM finetuning using PEFT, RLHF, data annotation workflow, and GPU utilization. • Cloud: Hands-on experience with cloud platforms such as Azure, AWS, and GCP. Cloud certification is preferred. • Application Development: Proficiency in Python, Docker, FastAPI/Django/Flask, and Git Must Have Skills 5–10 years of experience in Data Science - NLP, with at least 2 years in GenAI/LLMs Proficiency in Python, SQL & ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) Hands-on experience with GenAI tools like LangChain, LlamaIndex, RAG pipelines, Prompt Engineering & vector databases (e.g., FAISS, ChromaDB) Strong understanding of NLP techniques including embeddings, topic modeling, text classification, semantic search, summarization, Q&A, chatbots, etc Experience with cloud platforms (GCP, AWS, or Azure) and CI/CD pipelines Experience integrating LLMs via Azure OpenAI, Google Vertex AI or AWS Bedrock Ability to work independently & drive projects end-to-end from development to production Strong problem-solving, data storytelling & communication skills Show more Show less

Posted 6 days ago

Apply

4.0 years

11 Lacs

Mohali

On-site

Skill Sets: Expertise in ML/DL, model lifecycle management, and MLOps (MLflow, Kubeflow) Proficiency in Python, TensorFlow, PyTorch, Scikit-learn, and Hugging Face models Strong experience in NLP, fine-tuning transformer models, and dataset preparation Hands-on with cloud platforms (AWS, GCP, Azure) and scalable ML deployment (Sagemaker, Vertex AI) Experience in containerization (Docker, Kubernetes) and CI/CD pipelines Knowledge of distributed computing (Spark, Ray), vector databases (FAISS, Milvus), and model optimization (quantization, pruning) Familiarity with model evaluation, hyperparameter tuning, and model monitoring for drift detection Roles and Responsibilities: Design and implement end-to-end ML pipelines from data ingestion to production Develop, fine-tune, and optimize ML models, ensuring high performance and scalability Compare and evaluate models using key metrics (F1-score, AUC-ROC, BLEU etc) Automate model retraining, monitoring, and drift detection Collaborate with engineering teams for seamless ML integration Mentor junior team members and enforce best practices Job Type: Full-time Pay: Up to ₹1,100,000.00 per year Schedule: Day shift Monday to Friday Application Question(s): How soon can you join us Experience: Total: 4 years (Required) Data Science roles: 3 years (Required) Work Location: In person

Posted 6 days ago

Apply

3.0 years

0 Lacs

Kolkata, West Bengal, India

On-site

Linkedin logo

Role - Data Scientist / Senior Data Scientist / Data Science Manager / Data Science Senior manager / Data Science Architect Location - Kolkata/ Bangalore/ Hyderabad/ Pune/ Chennai/ Gurgaon Work Mode- Hybrid Interview Mode - F2F (Kolkata candidates) & Virtual (only for out station candidates) Specification - Classical ML / Genai Qualification: • Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience. • 3+ years of experience in data science, building hands-on ML models/ Genai Models • Experience with LMs (Llama (1/2/3), T5, Falcon, Langchain or framework similar like Langchain) • Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning • Candidate must be comfortable interpreting research papers and architecture diagrams of Language Models • Candidate must be comfortable with LORA, RAG, Instruct fine-tuning, Quantization, etc. • Experience leading the end-to-end design, development, and deployment of predictive modeling solutions. • Excellent programming skills in Python. Strong working knowledge of Python’s numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc. • Advanced SQL skills with SQL Server and Spark experience. • Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks • Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling. • Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment • Experience with data visualization tools — PowerBI, Tableau, R Shiny, etc. preferred • Experience with cloud platforms such as Azure, AWS is preferred but not required. Show more Show less

Posted 6 days ago

Apply

3.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

Job Title : Data Science Location : Hyderabad, Kondapur Experience : 3-7 Years Skills: Data Science, Machine Learning, Gen AI, LLM Key Responsibilities:- Design and implement sophisticated AI applications leveraging state-of-the-art LLM technologies Develop efficient solutions for LLM integration, fine-tuning, and deployment Optimize model performance, latency, and resource utilization Build and maintain robust data pipelines for training and inference Implement advanced prompt engineering techniques and retrieval-augmented generation (RAG) Develop evaluation frameworks to measure AI system performance and output quality Collaborate with cross-functional teams to understand requirements and deliver solutions Mentor junior developers and share AI/LLM knowledge across the organization Participate in code reviews and ensure adherence to best practices. Requirement :- 5+ years of software development experience with at least 3 years focused on AI/ML technologies Strong experience working with transformer-based models and LLM APIs Proficiency in Python and relevant AI/ML frameworks (PyTorch, TensorFlow, Hugging Face) Experience with vector databases and semantic search technologies Solid understanding of prompt engineering, RAG, and fine-tuning techniques Familiarity with cloud platforms (AWS, Azure, GCP) for AI model deployment Strong problem-solving skills and attention to detail.' Experience with LLM optimization techniques like quantization and distillation Knowledge of AI evaluation metrics and benchmarking methodologies Understanding of multimodal AI systems (text, image, audio) Experience with containerization and orchestration tools (Docker, Kubernetes) Contributions to open-source AI projects or research publications Familiarity with AI ethics and responsible AI development Qualifications:- Bachelors degree in computer science, AI, Machine Learning, or related field Master's in AI, B,Tech,MCA, or related field Show more Show less

Posted 1 week ago

Apply

3.0 years

0 Lacs

Mohali, Punjab

On-site

Indeed logo

Company: Chicmic Studios Job Role: Python Machine Learning & AI Developer Experience Required: 3+ Years We are looking for a highly skilled and experienced Python Developer to join our dynamic team. The ideal candidate will have a robust background in developing web applications using Django and Flask, with expertise in deploying and managing applications on AWS. Proficiency in Django Rest Framework (DRF), a solid understanding of machine learning concepts, and hands-on experience with tools like PyTorch, TensorFlow, and transformer architectures are essential. Key Responsibilities Develop and maintain web applications using Django and Flask frameworks. Design and implement RESTful APIs using Django Rest Framework (DRF) Deploy, manage, and optimize applications on AWS services, including EC2, S3, RDS, Lambda, and CloudFormation. Build and integrate APIs for AI/ML models into existing systems. Create scalable machine learning models using frameworks like PyTorch , TensorFlow , and scikit-learn . Implement transformer architectures (e.g., BERT, GPT) for NLP and other advanced AI use cases. Optimize machine learning models through advanced techniques such as hyperparameter tuning, pruning, and quantization. Deploy and manage machine learning models in production environments using tools like TensorFlow Serving , TorchServe , and AWS SageMaker . Ensure the scalability, performance, and reliability of applications and deployed models. Collaborate with cross-functional teams to analyze requirements and deliver effective technical solutions. Write clean, maintainable, and efficient code following best practices. Conduct code reviews and provide constructive feedback to peers. Stay up-to-date with the latest industry trends and technologies, particularly in AI/ML. Required Skills and Qualifications Bachelor’s degree in Computer Science, Engineering, or a related field. 3+ years of professional experience as a Python Developer. Proficient in Python with a strong understanding of its ecosystem. Extensive experience with Django and Flask frameworks. Hands-on experience with AWS services for application deployment and management. Strong knowledge of Django Rest Framework (DRF) for building APIs. Expertise in machine learning frameworks such as PyTorch , TensorFlow , and scikit-learn . Experience with transformer architectures for NLP and advanced AI solutions. Solid understanding of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB). Familiarity with MLOps practices for managing the machine learning lifecycle. Basic knowledge of front-end technologies (e.g., JavaScript, HTML, CSS) is a plus. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills and the ability to articulate complex technical concepts to non-technical stakeholders. Contact : 9875952836 Office Location: F273, Phase 8b Industrial Area Mohali, Punjab. Job Type: Full-time Schedule: Day shift Monday to Friday Work Location: In person

Posted 1 week ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Hello, Truecaller is calling you from Bangalore, India! Ready to pick up? Our goal is to make communication smarter, safer, and more efficient, all while building trust everywhere. We're all about bringing you smart services with a big social impact, keeping you safe from fraud, harassment, scam calls or messages, so you can focus on the conversations that matter. Top 20 most downloaded apps globally, and world’s #1 caller ID and spam-blocking service for Android and iOS, with extensive AI capabilities, with more than 400 million active users per month. Founded in 2009, listed on Nasdaq OMX Stockholm and is categorized as a Large Cap. Our focus on innovation, operational excellence, sustainable growth, and collaboration has resulted in consistently high profitability and strong EBITDA margins. A team of 400 people from ~35 different nationalities spread across our headquarters in Stockholm and offices in Bangalore, Mumbai, Gurgaon and Tel Aviv with high ambitions . We in the Insights Team are responsible for SMS Categorization, Fraud detection and other Smart SMS features within the Truecaller app. The OTP & bank notifications, bill & travel reminder alerts are some examples of the Smart SMS features. The team has developed a patented offline text parser that powers all these features and the team is also exploring cutting edge technologies like LLM to enhance the Smart SMS features. The team’s mission is to become the World’s most loved and trusted SMS app which is aligned with Truecaller’s vision to make communication safe and efficient. Smart SMS is used by over 90M users every day. As a Senior Data Scientist, you will be responsible for collecting, organizing, analyzing, and interpreting Truecaller data with a focus on NLP. In this role, you will be pivotal in advancing our work with large language models and on-device models across diverse regions. Your expertise will enhance our natural language processing, machine learning, and predictive analytics capabilities. What you bring in: 5+ years of experience in designing, developing, and deploying ML models at scale, with a focus on NLP-driven solutions. Strong background in Natural Language Processing (NLP), including text classification, entity recognition, language modeling, and transformer-based architectures. Experience in building and deploying models at scale, handling millions of messages efficiently while maintaining performance and accuracy. Also working with on-device models. Ability to not only build ML models but also take ownership of deploying them into production, ensuring scalability, reliability, and monitoring. Knowledge of anomaly detection, adversarial ML techniques, and risk modeling to identify and prevent spam and fraudulent messaging activities. Strong ability to take ML models from research and experimentation to production, working closely with ML engineers and data engineers. Expertise in machine learning libraries such as TensorFlow, PyTorch, pandas and Scikit-learn, along with NLP-specific tools like Hugging Face Transformers, spaCy with experience in TFlife, ONNX. Hands-on experience fine-tuning LLMs including transformer-based architectures (BERT, GPT, LLaMA, T5, etc.) for domain-specific applications, including knowledge distillation, quantization, and model compression for efficiency. Strong ability to design, refine, and optimize prompts for LLM-based applications, ensuring high-quality responses and reduced model hallucinations. Ability to leverage data driven decision by experimentation, and statistical analysis to improve models and business outcomes. Strong understanding of designing, testing, and optimizing prompts for LLM-based applications to improve model accuracy and efficiency. Programming knowledge in at least one language, such as Python or R. Preferably python. Expert knowledge of machine learning algorithms. Familiarity with database modelling and data warehousing principles with a working knowledge of SQL Experience in building and optimizing large-scale data processing systems using Spark/PySpark Strong ability to work cross-functionally with engineers, product managers, and business stakeholders to align ML solutions with company objectives. The impact you will create: Take a loosely defined business problem and break it into tractable data problems. For each data problem, clearly articulate the value of solving it, its impact, and its complexity. Collaborate with Product and Engineering to scope, design, and implement systems that solve complex business problems ensuring they are delivered on time and within scope. Design, develop, and optimize state-of-the-art NLP models for large-scale message classification, fraud detection, and spam filtering, impacting millions of users globally. Take full ownership of ML model development, deployment, and monitoring, ensuring models are production-ready, scalable, and cost-efficient. Lead data science projects from ideation to deployment, ensuring alignment with business objectives and timelines. Manage and analyze large datasets collected from multiple countries, ensuring data integrity and consistency. Stay updated on industry best practices and emerging technologies to drive innovation within the Data Team. You work collaboratively across systems and teams to solve user and business problems. You are expected to help define success and design and build the systems to achieve it. To work with the Product to decide on priorities and set direction, design solutions, and help the team implement them. It would be great if you also have: Understanding of Conversational AI Deploying NLP models in production Working knowledge of GCP components Life at Truecaller - Behind the code: https://www.instagram.com/lifeattruecaller/ Sounds like your dream job? We will fill the position as soon as we find the right candidate, so please send your application as soon as possible. As part of the recruitment process, we will conduct a background check. This position is based in Bangalore, India. We only accept applications in English . What we offer: A smart, talented and agile team: An international team where ~35 nationalities are working together in several locations and time zones with a learning, sharing and fun environment. A great compensation package: Competitive salary, 30 days of paid vacation, flexible working hours, private health insurance, parental leave, telephone bill reimbursement, Udemy membership to keep learning and improving and Wellness allowance. Great tech tools: Pick the computer and phone that you fancy the most within our budget ranges. Office life: We strongly believe in the in-person collaboration and follow an office-first approach while offering some flexibility. Enjoy your days with great colleagues with loads of good stuff to learn from, daily lunch and breakfast and a wide range of healthy snacks and beverages. In addition, every now and then check out the playroom for a fun break or join our exciting parties and or team activities such as Lab days, sports meetups etc. There something for everyone! Come as you are: Truecaller is diverse, equal and inclusive. We need a wide variety of backgrounds, perspectives, beliefs and experiences in order to keep building our great products. No matter where you are based, which language you speak, your accent, race, religion, color, nationality, gender, sexual orientation, age, marital status, etc. All those things make you who you are, and that’s why we would love to meet you. Show more Show less

Posted 1 week ago

Apply

0 years

9 - 10 Lacs

Bengaluru

On-site

MTS Software System Design Eng. Bangalore, India Engineering 66058 Job Description WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ THE ROLE: AMD is looking for an experienced engineer for an exciting role in Server CPU software development team. This person will be a member of a core team and will work with the latest hardware and software technology. The person will interact closely with key AMD technical experts to ensure the best possible performance and results on AMD platforms. THE PERSON: The successful candidate for this position will be interacting with software and hardware technologists working across many locations. This is a great opportunity to work as a part of highly regarded team to deliver leading edge solutions. KEY RESPONSIBILITIES: Problem solving across multiple software layers, (user space, kernel, applications, libraries) and hardware. Optimization/development of the CPU performance stack (applications, libraries) for AMD server and workstation processors on Windows platform. Analyze and solve performance, scalability bottlenecks when code is running on multi-core, multi-node deployments. Innovate and publish papers, patents and participate in technical conferences to advance AMD technologies. Continuously learn and grow along with evolving X86 server CPU architecture and application landscape. PREFERRED EXPERIENCE: Image processing skills: Color format conversions, Image Filtering and Enhancement operations, Morphological operations, Image transforms and statistical operations. Good understanding in Image Detection, Segmentation, Recognition, Restoration and Medical Imaging. Knowledge in Signal Processing theory like Sampling, Quantization, DFT and FFT. Multi-threaded FFT computing, Distributed FFT computing Very strong data structure and algorithmic skills. Experience in identifying performance bottlenecks, and designing/implementing optimizations to relieve analyzed bottlenecks. Strong Windows internals with experience in software development using C/C++ and debugging skills on multicore systems (preferably using OpenMP). Experience in performance analysis for data center, HPC (High Performance Computing), MPI (Message passing Interface) applications. Experience in x86 (or other architecture based) optimizations. Understanding of Cache sub-system, Instruction Set Architecture, pipeline (for any CPU). Bonus skills: Experience on Intel MKL libraries, Linear Algebra, x86 assembly programming (vector/SIMD), porting source code from Linux to Windows, development on Windows servers Knowledge of one or more CPU Profiling tools (preferably in Windows). ACADEMIC CREDENTIALS: B.Tech/M.Tech/MS/B.E/M.E in computer science or related fields #LI-NS2 AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Posted 1 week ago

Apply

0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Linkedin logo

Position Title: AI/ML Engineer Company : Cyfuture India Pvt. Ltd. Industry : IT Services and IT Consulting Location : Sector 81, NSEZ, Noida (5 Days Work From Office) Website : www.cyfuture.com About Cyfuture Cyfuture is a trusted name in IT services and cloud infrastructure, offering state-of-the-art data center solutions and managed services across platforms like AWS, Azure, and VMWare. We are expanding rapidly in system integration and managed services, building strong alliances with global OEMs like VMWare, AWS, Azure, HP, Dell, Lenovo, and Palo Alto. Position Overview We are hiring an experienced AI/ML Engineer to lead and shape our AI/ML initiatives. The ideal candidate will have hands-on experience in machine learning and artificial intelligence, with strong leadership capabilities and a passion for delivering production-ready solutions. This role involves end-to-end ownership of AI/ML projects, from strategy development to deployment and optimization of large-scale systems. Key Responsibilities Lead and mentor a high-performing AI/ML team. Design and execute AI/ML strategies aligned with business goals. Collaborate with product and engineering teams to identify impactful AI opportunities. Build, train, fine-tune, and deploy ML models in production environments. Manage operations of LLMs and other AI models using modern cloud and MLOps tools. Implement scalable and automated ML pipelines (e.g., with Kubeflow or MLRun). Handle containerization and orchestration using Docker and Kubernetes. Optimize GPU/TPU resources for training and inference tasks. Develop efficient RAG pipelines with low latency and high retrieval accuracy. Automate CI/CD workflows for continuous integration and delivery of ML systems. Key Skills & Expertise 1. Cloud Computing & Deployment Proficiency in AWS, Google Cloud, or Azure for scalable model deployment. Familiarity with cloud-native services like AWS SageMaker, Google Vertex AI, or Azure ML. Expertise in Docker and Kubernetes for containerized deployments Experience with Infrastructure as Code (IaC) using tools like Terraform or CloudFormation. 2. Machine Learning & Deep Learning Strong command of frameworks: TensorFlow, PyTorch, Scikit-learn, XGBoost. Experience with MLOps tools for integration, monitoring, and automation. Expertise in pre-trained models, transfer learning, and designing custom architectures. 3. Programming & Software Engineering Strong skills in Python (NumPy, Pandas, Matplotlib, SciPy) for ML development. Backend/API development with FastAPI, Flask, or Django. Database handling with SQL and NoSQL (PostgreSQL, MongoDB, BigQuery). Familiarity with CI/CD pipelines (GitHub Actions, Jenkins). 4. Scalable AI Systems Proven ability to build AI-driven applications at scale. Handle large datasets, high-throughput requests, and real-time inference. Knowledge of distributed computing: Apache Spark, Dask, Ray. 5. Model Monitoring & Optimization Hands-on with model compression, quantization, and pruning. A/B testing and performance tracking in production. Knowledge of model retraining pipelines for continuous learning. 6. Resource Optimization Efficient use of compute resources: GPUs, TPUs, CPUs. Experience with serverless architectures to reduce cost. Auto-scaling and load balancing for high-traffic systems. 7. Problem-Solving & Collaboration Translate complex ML models into user-friendly applications. Work effectively with data scientists, engineers, and product teams. Write clear technical documentation and architecture reports. Show more Show less

Posted 1 week ago

Apply

4.0 years

0 Lacs

India

Remote

Linkedin logo

About the Role We are hiring an AI/ML Engineer based in India. You will help design, develop, optimize, and deploy a multimodal AI models for eye disease screening using image and tabular/textual data. You will collaborate closely with AI researchers, engineers, and product teams to build and translate cutting-edge models. This is an opportunity to work at the frontier of clinical AI with purpose-driven colleagues and powerful social impact. Job Type: Full-Time, Remote Start Date: Immediate Compensation: 35-50 lakh INR, commensurate with experience Responsibilities Design and implement multimodal deep learning models combining image encoders and language models. Train, fine-tune, and optimize models using annotated eye images and structured clinical data. Implement instruction-tuned outputs for diagnosis, referral decisions, and patient counseling in English, Hindi, and Tamil. Optimize inference performance using quantization, pruning, and model distillation for deployment on smartphones or edge devices. Work with mobile and backend engineers to integrate models with our telemedicine app and cloud-based infrastructure. Contribute to model evaluation across clinical sites using real-world patient data to measure accuracy, bias, and latency. Support development of responsible AI pipelines: privacy, bias mitigation, versioning, and drift detection. Qualifications Must-Have: Bachelor’s or Master’s degree in Computer Science, AI, Biomedical Engineering, or related field. 4+ years of experience with deep learning framework (Master’s degree can substitute for experience). Hands-on experience training or fine-tuning transformer models (e.g., LLaMA, T5, GPT). Hands-on experience working with vision models (CNNs, ViTs) Fluency with version control (Git), collaborative workflows, and cloud-based development. Passion for using AI in global health or social impact domains. Preferred: Experience with multimodal fusion techniques (cross-attention, late fusion, MLP). Experience implementing or optimizing Retrieval-Augmented Generation (RAG) pipelines for domain-specific applications (e.g., medical QA, knowledge-grounded generation). Experience working with multilingual NLP/NLG (especially Hindi and Tamil). Prior work with model optimization for edge deployment using ONNX, Core ML, TensorFlow Lite, or quantized PyTorch models, or knowledge of hybrid cloud/on-device design. Proven ability to mentor junior engineers and foster a culture of technical growth and collaboration. Strong written and verbal communication skills with cross-functional stakeholders (e.g., product, clinical, design) Why Join Us Work on a mission-driven project with real clinical impact across underserved communities in India. Flexibility to work remotely while contributing to a globally recognized AI/health project. ( Note : Must be available for 2–3 regularly scheduled Zoom meetings per week and one in-person week per year.) Join a startup-minded team with stable long-term partnerships and funding. Apply here: https://www.visilant.org/careers/aiml-engineer-india-job-post Note: Applications through LinkedIn will not be reviewed. About Visilant: Visilant is a digital health social enterprise spun out of Johns Hopkins University. Visilant builds smartphone-based imaging, telemedicine, and artificial intelligence to empower non-eye care specialists to screen patients leading causes of blindness. Visilant has already screened over 30,000 patients in partnership with the largest eye care systems in India Show more Show less

Posted 1 week ago

Apply

0 years

0 Lacs

India

On-site

Linkedin logo

About the Job As an AI/ML Engineer, you will be responsible for designing, validating, and integrating cutting-edge machine learning models and algorithms. Collaborate closely with cross-functional teams, including data scientists, to recognize and establish project objectives. Oversee data infrastructure maintenance, ensuring streamlined and scalable data operations. Stay updated with advancements in AI and propose their integration for operational enhancement. Effectively convey detailed data insights to non-technical stakeholders. Uphold stringent data privacy and security protocols. Engage in the full lifecycle of AI projects, spanning from ideation through deployment and continuous upkeep. Core Responsibilities Develop, validate, and implement machine learning models and algorithms. Collaborate with data scientists and other stakeholders to understand and define project goals. Maintain data infrastructure and ensure scalability and efficiency of data-related operations. Stay abreast of the latest developments in the field of AI/Client and recommend ways to implement them in our operations. Communicate complex data findings in a clear and understandable manner to non-technical stakeholders. Adhere to data privacy and security guidelines. Participate in the entire AI project lifecycle, from concept to deployment and maintenance. Required Skills Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals. Solid theoretical understanding of machine learning. Experience with mapping NLP models (BERT and GPT) to accelerators and awareness of trade-offs across memory, bandwidth, and compute. Experience with Vector databases like Chroma Db, Pinecone, PGVector or similar Experience with Large language Models like GPT 3.5, GPT 4, GPT 4.o, Llama, Gemini, Mistral etc. Experience in LLM integration framework like langchain, Llamaindex, AgentGPT etc Experience with ML Models from definition to deployment, including training, quantization, sparsity, model preprocessing, and deployment. Proficiency in Python development in a Linux environment and using standard development tools. Experience with deep learning frameworks (such as PyTorch, Tensorflow, Keras, Spark). Working knowledge of Artificial Neural Networks (ANNs), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Generative Adversarial Networks (GANs). Experience in training, tuning, and deploying ML models for Computer Vision (e.g., ResNet), and/or Recommendation Systems (e.g., DLRM). Experience deploying ML workloads on distributed systems. Self-motivated team player with a strong sense of ownership and leadership. Strong verbal, written, and organizational skills for effective communication and documentation. Research background with a publication record. Work experience at a cloud provider or AI compute/sub-system company. Knowledge of cloud computing platforms and services, such as AWS, Azure, or Google Cloud. Experience with information security and secure development best practices. Qualifications Bachelor's or higher degree in Computer Science, Engineering, Mathematics, Statistics, Physics, or a related field. 2-4 yrs of hands-on experience in AI/ML Show more Show less

Posted 1 week ago

Apply

0 years

0 Lacs

India

On-site

Linkedin logo

Compensation: INR 2 crore per year including incentives Strictly please do NOT apply if you have not built 1-2 SLM for clients before. Multiplier AI is a leader in AI accelerators for life sciences and is due for listing. About the Role We are seeking a seasoned and forward-thinking Head for AI and SLM to spearhead Small Language Model (SLM) implementation projects across enterprise and industry-specific use cases. This is a high-impact leadership role that combines deep technical expertise with strategic consulting to deliver scalable, efficient, and secure SLM solutions. Key Responsibilities Lead end-to-end design and deployment of Small Language Models (SLMs) in production environments. Define architecture for on-device or private-cloud SLM deployments, optimizing for latency, token cost, and privacy. Collaborate with cross-functional teams (data, MLOps, product, security) to integrate SLMs into existing systems and workflows. Select and fine-tune open-source or custom SLMs (e.g., Phi-3, TinyLlama, Mistral) for targeted business use cases. Mentor engineering and data science teams on best practices in efficient prompt engineering, RAG pipelines, quantization, and distillation techniques. Act as a thought partner to leadership and clients on GenAI roadmap, risk management, and responsible AI design. Required Skills & Experience Proven experience in deploying Small Language Models in production (not just large-scale LLMs). this is essential do not apply if not done it Strong understanding of transformer architecture, tokenizer design, and parameter-efficient fine-tuning (LoRA, QLoRA). Hands-on with HuggingFace, ONNX, GGUF, and GPU/CPU/edge model optimization techniques. Experience integrating SLMs into real-world systems—mobile apps, secure enterprise workflows, or embedded devices. Background in Python, PyTorch/TensorFlow, and familiarity with MLOps tools like Weights & Biases, MLflow, and LangChain. Strategic mindset to balance model performance vs. cost vs. explainability . Preferred Qualifications Prior consulting experience with AI/ML deployments in pharma, finance, or regulated sectors. Familiarity with privacy-preserving AI, federated learning, or differential privacy. Contributions to open-source LLM/SLM projects. What We Offer Leadership in shaping the future of lightweight AI. Exposure to cutting-edge GenAI applications across industries. Competitive compensation and equity options (for permanent roles). Show more Show less

Posted 1 week ago

Apply

5.0 years

0 Lacs

Nilanga, Maharashtra, India

On-site

Linkedin logo

Role Description Who we are: At UST, we help the world’s best organizations grow and succeed through transformation. Bringing together the right talent, tools, and ideas, we work with our client to co-create lasting change. Together, with over 26,000 employees in 25 countries, we build for boundless impact—touching billions of lives in the process. Visit us at . You Are We are looking for experienced candidate around 5 years in artificial intelligence deep learning framework. Develop and deploy deep learning architectures using Tensorflow and pyTorch. Optimize model performance, including hyper parameter tuning and quantization. Collaborate with data scientists, engineers, business teams to define ML use cases. Monitor and improve model performance post-deployment.. Therefore, it’s essential that you are skilled at problem solving, solution design, and high-quality coding. The Opportunity Research and evaluate new algorithms and techniques to improve model performance. Contribute to the development of innovative machine learning solutions Develop AL/ML applications and data pipelines for data preprocessing and feature engineering. Engage in research activities to explore the latest advancements in generative AI techniques and algorithms. Stay updated with current literature and emerging trends in the field. Troubleshoot and resolve technical issues related to machine learning models and software systems Communicate complex technical concepts effectively to both technical and non-technical audiences What You Need Experience with Tensorflow, PytTorch and deep learning model development. Experience with ML model deployment(Docker, Kubernetes, Tensorflow serving, Torchscript) Knowledge of CNNs, RNNs, Transformers, GANs and other advance models. Understanding of data structures, data modeling and software architecture Ability to write robust code in Python. Outstanding analytical and problem-solving skills What We Believe We’re proud to embrace the same values that have shaped UST since the beginning. Since day one, we’ve been building enduring relationships and a culture of integrity. And today, it's those same values that are inspiring us to encourage innovation from everyone, to champion diversity and inclusion and to place people at the center of everything we do. Humility We will listen, learn, be empathetic and help selflessly in our interactions with everyone. Humanity Through business, we will better the lives of those less fortunate than ourselves. Integrity We honor our commitments and act with responsibility in all our relationships. Equal Employment Opportunity Statement UST is an Equal Opportunity Employer. We believe that no one should be discriminated against because of their differences, such as age, disability, ethnicity, gender, gender identity and expression, religion or sexual orientation. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law. UST reserves the right to periodically redefine your roles and responsibilities based on the requirements of the organization and/or your performance. To support and promote the values of UST. Comply with all Company policies and procedures Skills Artificial Intelligence,Machine Learning,Cnn,Python Show more Show less

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Vellore, Tamil Nadu, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply

5.0 years

0 Lacs

Bengaluru

On-site

About Us : We are an innovative company revolutionising retail checkout experiences by replacing traditional barcodes with cutting-edge Computer Vision technology. Our platform enables seamless, faster, and smarter checkout processes, enhancing the shopping experience for both retailers and consumers. We're growing rapidly and are looking for an experienced Android/Cross-Platform App Developer to join our team and help us build the future of retail technology. Key Responsibilities: Lead the research, design, and development of advanced computer vision models for tasks like object detection, tracking, segmentation, OCR, scene understanding, and 3D vision. Translate business needs into scalable scientific solutions using state-of-the-art deep learning and classical computer vision techniques. Design and implement experiments to evaluate performance, robustness, and accuracy of CV models in real-world production scenarios. Collaborate with cross-functional teams including software engineering, product, and data teams to integrate vision models into applications Drive innovation through internal IP generation (patents, publications) and contribute to the long-term AI/ML roadmap. Provide scientific and technical leadership, mentoring junior scientists and reviewing designs and architectures. Stay up to date with latest developments in AI, deep learning, and computer vision through academic and industrial research.e with industry best practices and emerging technologies to drive continuous improvement. Qualifications: M.S. or Ph.D. in Computer Science, Electrical Engineering, or a related field with specialization in Computer Vision or Machine Learning. 5+ years of hands-on experience in building and deploying production-grade computer vision models. Strong theoretical background and applied experience in deep learning frameworks (e.g., PyTorch) and model architectures (e.g., CNNs, Vision Transformers, Diffusion Models). Experience in working with large-scale datasets, training pipelines, and performance evaluation metrics. Proficiency in Python and scientific computing libraries (e.g., NumPy, OpenCV, scikit-learn). Experience with model optimization for edge deployment (ONNX, TensorRT, pruning/quantization) is a strong plus. Strong written and verbal communication skills, with a track record of mentoring and collaboration. Preferred Qualifications: Experience with computer vision in real-time systems (e.g., AR/VR, robotics, automotive, surveillance). Published research papers in top-tier conferences (CVPR, ICCV, NeurIPS, etc.). Exposure to MLOps or ML model lifecycle in production environments. Familiarity with cloud platforms (AWS/GCP/Azure) and containerization tools (Docker, Kubernetes) and basic bash scripting.

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Faridabad, Haryana, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Cuttack, Odisha, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Bhubaneswar, Odisha, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Kolkata, West Bengal, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply

8.0 years

0 - 0 Lacs

Guwahati, Assam, India

Remote

Linkedin logo

Experience : 8.00 + years Salary : USD 4074-4814 / month (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Contract for 5 Months(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - A leading US-based digital consultancy with a track record of excellence) What do you need for this opportunity? Must have skills required: FastAPI, Hugging Face, Knowledge graphs, MLOps, Quantization, TensorFlow, AI, ChatGPT, LLM Fine-tuning, Rag (retrieval-augmented generation), Vector databases, Python A leading US-based digital consultancy with a track record of excellence is Looking for: Role : Senior Python / AI Engineer: Hybrid (Mumbai) Experience : 6+years Work Location : Mumbai : Hybrid (One week in a month) Engagement : Contract To Hire (Initially 5 months to start) : Start date : Immediate Timing : 2pm to 11 pm IST Interview process: 2 Rounds(Aptitude round + Technical round ) Job Description : Overall Job Mission: To design, develop, implement, and optimize AI-driven solutions by effectively leveraging and integrating existing Large Language Models and related technologies. Outcomes (What does the person need to achieve?) LLM Integration & Application Development: Successfully integrate existing LLMs (e.g., GPT, LLaMA, Mistral, Claude, Gemini) into Python-based applications to deliver AI-powered features. (e.g., Develop and deploy 3 applications with LLM-driven functionality within the first 6 months with a user satisfaction rating of 4.5/5). Prompt Engineering & Optimization: Design, implement, and rigorously test prompts to maximize the effectiveness and accuracy of existing LLMs for specific application requirements. (e.g., Improve the accuracy of LLM-driven features by 20% through prompt engineering best practices). AI Solution Optimization: Optimize the performance, efficiency, and scalability of AI solutions built with LLMs, focusing on factors like response time, cost-effectiveness, and resource utilization. (e.g., Reduce the average response time of LLM-based applications by 15% while maintaining accuracy). Data Handling & Retrieval: Implement effective data processing, including preprocessing and cleaning of text datasets, and utilize vector databases to enable efficient information retrieval for LLM applications. (e.g., Achieve a 90% success rate in retrieving relevant information from vector databases for LLM queries). Deployment & Scalability: Deploy and scale LLM-powered applications on cloud platforms to support a growing user base and ensure high availability. (e.g., Successfully scale LLM applications to handle a 50% increase in user traffic without performance degradation). Competencies (How does the person need to behave?) LLM Application Expertise: Possesses strong skills in integrating and applying existing LLMs through APIs and libraries, with a focus on prompt engineering and application development. Python Development & AI Frameworks: Demonstrates proficiency in Python programming and AI/ML frameworks (Hugging Face, PyTorch, TensorFlow) for building and deploying LLM-based solutions. Problem Solving & Adaptability: Exhibits the ability to solve challenges related to LLM integration, optimize performance, and adapt to the evolving landscape of LLM technologies. Collaboration & Communication: Effectively communicates technical solutions and collaborates with cross-functional teams to deliver impactful AI applications. Results Orientation: Focuses on delivering functional, efficient, and scalable AI solutions that meet business needs and user expectations. Required Skills & Experience- Must-Have Hands-on Experience: Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow). Hands-on experience working with LLMs and fine-tuning. Experience in prompt engineering and optimizing AI model outputs. Building APIs with FastAPI or Flask for AI model integration. Familiarity with vector databases and embedding models. Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG). Nice to Have (or Learn on the Job): Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment. Experience working with knowledge graphs and reasoning-based AI. Background in MLOps for tracking and managing AI models. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you! Show more Show less

Posted 1 week ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies