Jobs
Interviews

319 Quantization Jobs - Page 6

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 years

20 - 30 Lacs

Hyderābād

On-site

About the Role We are seeking a visionary and hands-on AI Lead to architect, build, and scale next-generation Generative and Agentic AI systems. In this role, you will drive the end-to-end lifecycle—from research and prototyping to production deployment—guiding a team of AI engineers and collaborating cross-functionally to deliver secure, scalable, and impactful AI solutions across multimodal and LLM-based ecosystems. Key Responsibilities Architect and oversee the development of GenAI and Agentic AI workflows, including multi-agent systems and LLM-based pipelines. Guide AI engineers in best practices for RAG (Retrieval-Augmented Generation), prompt engineering, and agent design. Evaluate and implement the right technology stack: open source (Hugging Face, LangChain, LlamaIndex) vs. closed source (OpenAI, Anthropic, Mistral). Lead fine-tuning and adapter-based training (e.g., LoRA, QLoRA, PEFT). Drive inference optimization using quantization, ONNX, TensorRT, and related tools. Build and refine RAG pipelines using embedding models, vector DBs (FAISS, Qdrant), chunking strategies, and hybrid knowledge graph systems. Manage LLMOps with tools like Weights & Biases, MLflow, and ClearML, ensuring experiment reproducibility and model versioning. Design and implement evaluation frameworks for truthfulness, helpfulness, toxicity, and hallucinations. Integrate guardrails, content filtering, and data privacy best practices into GenAI systems. Lead development of multi-modal AI systems (VLMs, CLIP, LLaVA, video-text fusion models). Oversee synthetic data generation for fine-tuning in low-resource domains. Design APIs and services for Model-as-a-Service (MaaS) and AI agent orchestration. Collaborate with product, cloud, and infrastructure teams to align on deployment, GPU scaling, and cost optimization. Translate cutting-edge AI research into usable product capabilities, from prototyping to production. Mentor and grow the AI team, establishing R&D best practices and benchmarks. Stay up-to-date with emerging trends (arXiv, Papers With Code) to keep the organization ahead of the curve. Required Skills & Expertise AI & ML Foundations: Generative AI, LLMs, Diffusion Models, Agentic AI Systems, Multi-Agent Planning, Prompt Engineering, Feedback Loops, Task Decomposition Ecosystem & Frameworks: Hugging Face, LangChain, OpenAI, Anthropic, Mistral, LLaMA, GPT, Claude, Mixtral, Falcon, etc. Fine-tuning & Inference: LoRA, QLoRA, PEFT, ONNX, TensorRT, DeepSpeed, vLLM Data & Retrieval Systems: FAISS, Qdrant, Chroma, Pinecone, Hybrid RAG + Knowledge Graphs MLOps & Evaluation: Weights & Biases, ClearML, MLflow, Evaluation metrics (truthfulness, helpfulness, hallucination) Security & Governance: Content moderation, data privacy, model alignment, ethical constraints Deployment & Ops: Cloud (AWS, GCP, Azure) with GPU scaling, Serverless LLMs, API-based inference, Docker/Kubernetes Other: Multi-modal AI (images, video, audio), API Design (Swagger/OpenAPI), Research translation and POC delivery Preferred Qualifications 7+ years in AI/ML roles, with at least 2–3 years in a technical leadership capacity Proven experience deploying LLM-powered systems at scale Experience working with cross-functional product and infrastructure teams Contributions to open-source AI projects or published research papers (a plus) Strong communication skills to articulate complex AI concepts to diverse stakeholders Why Join Us? Work at the forefront of AI innovation with opportunities to publish, build, and scale impactful systems Lead a passionate team of engineers and researchers Shape the future of ethical, explainable, and usable AI products Ready to shape the next wave of AI? Apply now and join us on this journey! Job Type: Full-time Pay: ₹2,000,000.01 - ₹3,002,234.14 per year Benefits: Flexible schedule Health insurance Paid time off Provident Fund Schedule: Day shift Monday to Friday Supplemental Pay: Yearly bonus Work Location: In person

Posted 1 month ago

Apply

5.0 years

3 - 7 Lacs

Ahmedabad

On-site

About the Role: Grade Level (for internal use): 10 The Team: The Capital IQ Solutions Data Science team supports the S&P Capital IQ Pro platform with innovative Data Science and Machine Learning solutions, utilizing the most advanced NLP Generative AI models. This role presents a unique opportunity for hands-on ML/NLP/Gen AI/LLM scientists and engineers to advance to the next step in their career journey and apply their technical expertise in NLP, deep learning, Gen AI, and LLMs to drive business value for multiple stakeholders while conducting cutting-edge applied research in LLMs, Gen AI, and related areas. Responsibilities and Impact: Design solutions utilizing NLP models including chat assistants and RAG systems. Design and develop custom NLP LLM Models including both prompt engineering techniques and model fine-tunning and alignment (SFT, RLHF, DPO) NLP Model evaluation using both human-supported and synthetic evaluation methods and metrics. Deploy NLP models ensuring latency, reliability, and scalability. Discover new methods for prompt engineering, model fine-tuning, quantization and latency optimization, document embeddings and chunking. Collaborate closely with product teams, business stakeholders, and engineers to ensure smooth integration of NLP models into production systems. Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions. Actively research, explore and identify the latest relevant methods and technologies What We’re Looking For : Basic Required Qualifications : Degree in Computer Science, Mathematics or Statistics, Computational linguistics, Engineering, or a related field. Good understanding of machine learning and deep learning methods and their mathematical foundations 5-8 years of professional experience in Advanced Analytics / Data Science / Machine Learning 5-8 years hands-on experience developing NLP models, ideally with transformer architectures. Demonstrated experience with Python, PyTorch, Hugging Face or similar tools. Mastery of Python and ability to write robust and high standard, testable code Knowledge of developing or tuning LLMS Additional Preferred Qualifications : 3+ years of experience with implementing information retrieval systems. Experience with contributing to Open Source initiatives or in research projects and/or participation in Kaggle competitions. Publications related to Machine Learning or Deep Learning Ability to work in a team Able to report progress and summarize issues to a less technical audience Curious and open-minded attitude to new approaches About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence . What’s In It For You? Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology–the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We're more than 35,000 strong worldwide—so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We’re constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That’s why we provide everything you—and your career—need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to reportfraud@spglobal.com . S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, “pre-employment training” or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here . ----------------------------------------------------------- Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: EEO.Compliance@spglobal.com and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf ----------------------------------------------------------- 20 - Professional (EEO-2 Job Categories-United States of America), IFTECH202.1 - Middle Professional Tier I (EEO Job Group), SWP Priority – Ratings - (Strategic Workforce Planning) Job ID: 317453 Posted On: 2025-06-30 Location: Ahmedabad, Gujarat, India

Posted 1 month ago

Apply

6.0 years

0 Lacs

Bengaluru East, Karnataka, India

On-site

Organization: At CommBank, we never lose sight of the role we play in other people’s financial wellbeing. Our focus is to help people and businesses move forward to progress. To make the right financial decisions and achieve their dreams, targets, and aspirations. Regardless of where you work within our organisation, your initiative, talent, ideas, and energy all contribute to the impact that we can make with our work. Together we can achieve great things. Job Title: Data Scientist Location: Bangalore Business & Team: BB Advanced Analytics and Artificial Intelligence COE Impact & contribution: As a Senior Data Scientist, you will be instrumental in pioneering Gen AI and multi-agentic systems at scale within CommBank. You will architect, build, and operationalize advanced generative AI solutions—leveraging large language models (LLMs), collaborative agentic frameworks, and state-of-the-art toolchains. You will drive innovation, helping set the organizational strategy for advanced AI, multi-agent collaboration, and responsible next-gen model deployment. Roles & Responsibilities: Gen AI Solution Development: Lead end-to-end development, fine-tuning, and evaluation of state-of-the-art LLMs and multi-modal generative models (e.g., transformers, GANs, VAEs, Diffusion Models) tailored for financial domains. Multi-Agentic System Engineering: Architect, implement, and optimize multi-agent systems, enabling swarms of AI agents (utilizing frameworks like Lang chain, Lang graph, and MCP) to dynamically collaborate, chain, reason, critique, and autonomously execute tasks. LLM-Backed Application Design: Develop robust, scalable GenAI-powered APIs and agent workflows using Fast API, Semantic Kernel, and orchestration tools. Integrate observability and evaluation using Lang fuse for tracing, analytics, and prompt/response feedback loops. Guardrails & Responsible AI: Employ frameworks like Guardrails AI to enforce robust safety, compliance, and reliability in LLM deployments. Establish programmatic checks for prompt injections, hallucinations, and output boundaries. Enterprise-Grade Deployment: Productionize and manage at-scale Gen AI and agent systems with cloud infrastructure (GCP/AWS/Azure), utilizing model optimization (quantization, pruning, knowledge distillation) for latency/throughput trade offs. Toolchain Innovation: Leverage and contribute to open source projects in the Gen AI ecosystem (e.g., Lang Chain, Lang Graph, Semantic Kernel, Lang fuse, Hugging face, Fast API). Continuously experiment with emerging frameworks and research. Stakeholder Collaboration: Partner with product, engineering, and business teams to define high-impact use cases for Gen AI and agentic automation; communicate actionable technical strategies and drive proof-of-value experiments into production. Mentorship & Thought Leadership: Guide junior team members in best practices for Gen AI, prompt engineering, agentic orchestration, responsible deployment, and continuous learning. Represent CommBank in the broader AI community through papers, patents, talks, and open-source. Essential Skills: 6+ years of hands-on experience in Machine Learning, Deep Learning, or Generative AI domains, including practical expertise with LLMs, multi-agent frameworks, and prompt engineering. Proficient in building and scaling multi-agent AI systems using Lang Chain, Lang Graph, Semantic Kernel, MCP, or similar agentic orchestration tools. Advanced experience developing and deploying Gen AI APIs using Fast API; operational familiarity with Lang fuse for LLM evaluation, tracing, and error analytics. Demonstrated ability to apply Guardrails to enforce model safety, explainability, and compliance in production environments. Experience with transformer architectures (BERT/GPT, etc.), fine-tuning LLMs, and model optimization (distillation/quantization/pruning). Strong software engineering background (Python), with experience in enterprise-grade codebases and cloud-native AI deployments. Experience integrating open and commercial LLM APIs and building retrieval-augmented generation (RAG) pipelines. Exposure to agent-based reinforcement learning, agent simulation, and swarm-based collaborative AI. Familiarity with robust experimentation using tools like Lang Smith, GitHub Copilot, and experiment tracking systems. Proven track record of driving Gen AI innovation and adoption in cross-functional teams. Papers, patents, or open-source contributions to the Gen AI/LLM/Agentic AI ecosystem. Experience with financial services or regulated industries for secure and responsible deployment of AI. Education Qualifications: Bachelor’s or Master’s degree in Computer Science, Engineering, Information Technology. If you're already part of the Commonwealth Bank Group (including Bankwest, x15ventures), you'll need to apply through Sidekick to submit a valid application. We’re keen to support you with the next step in your career. We're aware of some accessibility issues on this site, particularly for screen reader users. We want to make finding your dream job as easy as possible, so if you require additional support please contact HR Direct on 1800 989 696. Advertising End Date: 01/07/2025

Posted 1 month ago

Apply

5.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

About The Role Grade Level (for internal use): 10 The Team The Capital IQ Solutions Data Science team supports the S&P Capital IQ Pro platform with innovative Data Science and Machine Learning solutions, utilizing the most advanced NLP Generative AI models. This role presents a unique opportunity for hands-on ML/NLP/Gen AI/LLM scientists and engineers to advance to the next step in their career journey and apply their technical expertise in NLP, deep learning, Gen AI, and LLMs to drive business value for multiple stakeholders while conducting cutting-edge applied research in LLMs, Gen AI, and related areas. Responsibilities And Impact Design solutions utilizing NLP models including chat assistants and RAG systems. Design and develop custom NLP LLM Models including both prompt engineering techniques and model fine-tunning and alignment (SFT, RLHF, DPO) NLP Model evaluation using both human-supported and synthetic evaluation methods and metrics. Deploy NLP models ensuring latency, reliability, and scalability. Discover new methods for prompt engineering, model fine-tuning, quantization and latency optimization, document embeddings and chunking. Collaborate closely with product teams, business stakeholders, and engineers to ensure smooth integration of NLP models into production systems. Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions. Actively research, explore and identify the latest relevant methods and technologies What We’re Looking For Basic Required Qualifications : Degree in Computer Science, Mathematics or Statistics, Computational linguistics, Engineering, or a related field. Good understanding of machine learning and deep learning methods and their mathematical foundations 5-8 years of professional experience in Advanced Analytics / Data Science / Machine Learning 5-8 years hands-on experience developing NLP models, ideally with transformer architectures. Demonstrated experience with Python, PyTorch, Hugging Face or similar tools. Mastery of Python and ability to write robust and high standard, testable code Knowledge of developing or tuning LLMS Additional Preferred Qualifications 3+ years of experience with implementing information retrieval systems. Experience with contributing to Open Source initiatives or in research projects and/or participation in Kaggle competitions. Publications related to Machine Learning or Deep Learning Ability to work in a team Able to report progress and summarize issues to a less technical audience Curious and open-minded attitude to new approaches About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence. What’s In It For You? Our Purpose Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology–the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress. Our People We're more than 35,000 strong worldwide—so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We’re constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits We take care of you, so you can take care of business. We care about our people. That’s why we provide everything you—and your career—need to thrive at S&P Global. Our Benefits Include Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring And Opportunity At S&P Global At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to reportfraud@spglobal.com. S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, “pre-employment training” or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here. Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: EEO.Compliance@spglobal.com and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf 20 - Professional (EEO-2 Job Categories-United States of America), IFTECH202.1 - Middle Professional Tier I (EEO Job Group), SWP Priority – Ratings - (Strategic Workforce Planning) Job ID: 317453 Posted On: 2025-06-30 Location: Ahmedabad, Gujarat, India

Posted 1 month ago

Apply

2.0 years

0 Lacs

India

Remote

Senior Machine Learning Engineer (AI-Powered Software Platform for Hidden Physical-Threat Detection & Real-Time Intelligence) About the Company: Aerobotics7 (A7) is a mission-driven deep-tech startup focused on developing a UAV-based next-gen sensing and advanced AI platform to detect, identify, and mitigate hidden threats like landmines, UXOs, and IEDs in real-time. We are embarking on a rapid development phase, creating innovative solutions leveraging cutting-edge technologies. Our dynamic team is committed to building impactful products through continuous learning, and close cross-collaboration. Position Overview: We are seeking a Senior Machine Learning Engineer with a strong research orientation to join our team. This role will focus on developing and refining proprietary machine learning models for drone-based landmine detection and mitigation. The ideal candidate will design, develop, and optimize advanced ML workflows with an emphasis on rigorous research, novel model development, and experimental validation in deep learning, multi-modal/sensor fusion and computer vision applications. Key Responsibilities: Lead the end-to-end AI model development process, including research, experimentation, design, and implementation. Architect, train, and deploy deep learning models on cloud (GCP) and edge devices, ensuring real-time performance. Develop and optimize multi-modal ML/DL models integrating multiple sensor inputs. Implement and fine-tune CNNs, Vision Transformers (ViTs), and other deep-learning architectures. Design and improve sensor fusion techniques for enhanced perception and decision-making. Optimize AI inference for low-latency and high-efficiency deployment on production. Cross-collaborate with software and hardware teams to integrate AI solutions into mission-critical applications. Develop scalable pipelines for model training, validation, and continuous improvement. Ensure robustness, interpretability, and security of AI models in deployment. Required Skills: • Strong expertise in deep learning frameworks (TensorFlow, PyTorch). • Experience with CNNs, ViTs, and other DL architectures. • Hands-on experience in multi-modal ML and sensor fusion techniques. • Proficiency in cloud-based AI model deployment (GCP experience preferred). • Experience with edge AI optimization (NVIDIA Jetson, TensorRT, OpenVINO). • Strong knowledge of data preprocessing, augmentation, and synthetic data generation. • Proficiency in model quantization, pruning, and optimization for real-time applications. • Familiarity with computer vision, object detection, and real-time inference techniques. • Ability to work with limited datasets, including generating synthetic data (VAEs or s similar), data annotation and augmentation strategies. • Strong coding skills in Python and C++ with experience in high-performance computing. Preferred Qualifications: • Experience: 2-4+ Years. • Experience with MLOps, including CI/CD pipelines, model versioning, and monitoring. • Knowledge of reinforcement learning techniques. • Experience in working in fast-paced startup environments. • Prior experience working on AI-driven autonomous systems, robotics, or UAVs. • Understanding of embedded systems and hardware acceleration for AI workloads. Benefits: NOTE: THIS ROLE IS UNDER AEROBOTICS7 INVENTIONS PVT. LTD., AN INDIAN ENTITY. IT IS A REMOTE INDIA-BASED ROLE WITH COMPENSATION ALIGNED TO INDIAN MARKET STANDARDS. WHILE OUR PARENT COMPANY IS US-BASED, THIS POSITION IS FOR CANDIDATES RESIDING AND WORKING IN INDIA. Competitive startup-level salary and comprehensive benefits package. Future opportunity for equity options in the company. Opportunity to work on impactful, cutting-edge technology in a collaborative startup environment. Professional growth with extensive learning and career development opportunities. Direct contribution to tangible, real-world impact. How to Apply: Interested candidates are encouraged to submit their resume along with an (optional) cover letter highlighting their relevant experience and passion for working in a dynamic startup environment. For any questions or further information, feel free to reach out to us directly by emailing us at careers@aerobotics7.com.

Posted 1 month ago

Apply

0.0 - 8.0 years

0 Lacs

Ahmedabad, Gujarat

On-site

About the Role: Grade Level (for internal use): 10 The Team: The Capital IQ Solutions Data Science team supports the S&P Capital IQ Pro platform with innovative Data Science and Machine Learning solutions, utilizing the most advanced NLP Generative AI models. This role presents a unique opportunity for hands-on ML/NLP/Gen AI/LLM scientists and engineers to advance to the next step in their career journey and apply their technical expertise in NLP, deep learning, Gen AI, and LLMs to drive business value for multiple stakeholders while conducting cutting-edge applied research in LLMs, Gen AI, and related areas. Responsibilities and Impact: Design solutions utilizing NLP models including chat assistants and RAG systems. Design and develop custom NLP LLM Models including both prompt engineering techniques and model fine-tunning and alignment (SFT, RLHF, DPO) NLP Model evaluation using both human-supported and synthetic evaluation methods and metrics. Deploy NLP models ensuring latency, reliability, and scalability. Discover new methods for prompt engineering, model fine-tuning, quantization and latency optimization, document embeddings and chunking. Collaborate closely with product teams, business stakeholders, and engineers to ensure smooth integration of NLP models into production systems. Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions. Actively research, explore and identify the latest relevant methods and technologies What We’re Looking For : Basic Required Qualifications : Degree in Computer Science, Mathematics or Statistics, Computational linguistics, Engineering, or a related field. Good understanding of machine learning and deep learning methods and their mathematical foundations 5-8 years of professional experience in Advanced Analytics / Data Science / Machine Learning 5-8 years hands-on experience developing NLP models, ideally with transformer architectures. Demonstrated experience with Python, PyTorch, Hugging Face or similar tools. Mastery of Python and ability to write robust and high standard, testable code Knowledge of developing or tuning LLMS Additional Preferred Qualifications : 3+ years of experience with implementing information retrieval systems. Experience with contributing to Open Source initiatives or in research projects and/or participation in Kaggle competitions. Publications related to Machine Learning or Deep Learning Ability to work in a team Able to report progress and summarize issues to a less technical audience Curious and open-minded attitude to new approaches About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence . What’s In It For You? Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology–the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We're more than 35,000 strong worldwide—so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We’re constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That’s why we provide everything you—and your career—need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to reportfraud@spglobal.com . S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, “pre-employment training” or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here . ----------------------------------------------------------- Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: EEO.Compliance@spglobal.com and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf ----------------------------------------------------------- 20 - Professional (EEO-2 Job Categories-United States of America), IFTECH202.1 - Middle Professional Tier I (EEO Job Group), SWP Priority – Ratings - (Strategic Workforce Planning) Job ID: 317453 Posted On: 2025-06-30 Location: Ahmedabad, Gujarat, India

Posted 1 month ago

Apply

0.0 - 8.0 years

0 Lacs

Ahmedabad, Gujarat

On-site

Senior Data Scientist Ahmedabad, India; Bangalore, India Information Technology 317453 Job Description About The Role: Grade Level (for internal use): 10 The Team: The Capital IQ Solutions Data Science team supports the S&P Capital IQ Pro platform with innovative Data Science and Machine Learning solutions, utilizing the most advanced NLP Generative AI models. This role presents a unique opportunity for hands-on ML/NLP/Gen AI/LLM scientists and engineers to advance to the next step in their career journey and apply their technical expertise in NLP, deep learning, Gen AI, and LLMs to drive business value for multiple stakeholders while conducting cutting-edge applied research in LLMs, Gen AI, and related areas. Responsibilities and Impact: Design solutions utilizing NLP models including chat assistants and RAG systems. Design and develop custom NLP LLM Models including both prompt engineering techniques and model fine-tunning and alignment (SFT, RLHF, DPO) NLP Model evaluation using both human-supported and synthetic evaluation methods and metrics. Deploy NLP models ensuring latency, reliability, and scalability. Discover new methods for prompt engineering, model fine-tuning, quantization and latency optimization, document embeddings and chunking. Collaborate closely with product teams, business stakeholders, and engineers to ensure smooth integration of NLP models into production systems. Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions. Actively research, explore and identify the latest relevant methods and technologies What We’re Looking For : Basic Required Qualifications : Degree in Computer Science, Mathematics or Statistics, Computational linguistics, Engineering, or a related field. Good understanding of machine learning and deep learning methods and their mathematical foundations 5-8 years of professional experience in Advanced Analytics / Data Science / Machine Learning 5-8 years hands-on experience developing NLP models, ideally with transformer architectures. Demonstrated experience with Python, PyTorch, Hugging Face or similar tools. Mastery of Python and ability to write robust and high standard, testable code Knowledge of developing or tuning LLMS Additional Preferred Qualifications : 3+ years of experience with implementing information retrieval systems. Experience with contributing to Open Source initiatives or in research projects and/or participation in Kaggle competitions. Publications related to Machine Learning or Deep Learning Ability to work in a team Able to report progress and summarize issues to a less technical audience Curious and open-minded attitude to new approaches About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence. What’s In It For You? Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology–the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We're more than 35,000 strong worldwide—so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We’re constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That’s why we provide everything you—and your career—need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to reportfraud@spglobal.com. S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, “pre-employment training” or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here. - Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: EEO.Compliance@spglobal.com and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf - 20 - Professional (EEO-2 Job Categories-United States of America), IFTECH202.1 - Middle Professional Tier I (EEO Job Group), SWP Priority – Ratings - (Strategic Workforce Planning) Job ID: 317453 Posted On: 2025-06-30 Location: Ahmedabad, Gujarat, India

Posted 1 month ago

Apply

2.0 years

0 Lacs

Hyderabad, Telangana

On-site

Hyderabad, Telangana, India Job Type Full Time About the Role About the Role We are looking for a hands-on and technically proficient Embedded Software Team Lead to drive the development of intelligent edge systems that combine embedded firmware, machine learning inference, and hardware acceleration. This role is perfect for someone who thrives at the intersection of real-time firmware design, AI model deployment, and hardware-software co-optimization. You will lead a team delivering modular, scalable, and efficient firmware pipelines that run quantized ML models on accelerators like Hailo, Coral, Torrent (BlackHole), Kendryte, and other emerging chipsets. Your focus will include model runtime integration, low-latency sensor processing, OTA-ready firmware stacks, and CI/CD pipelines for embedded products at scale Requirements Key Responsibilities Technical Leadership & Planning Own the firmware lifecycle across multiple AI-based embedded product lines. Define system and software architecture in collaboration with hardware, ML, and cloud teams. Lead sprint planning, code reviews, performance debugging, and mentor junior engineers. ️ ML Model Deployment & Runtime Integration Collaborate with ML engineers to port, quantize, and deploy models using TFLite , ONNX , or HailoRT . Build runtime pipelines that connect model inference with real-time sensor data (vision, IMU, acoustic). Optimize memory and compute flows for edge model execution under power/bandwidth constraints. Firmware Development & Validation Build production-grade embedded stacks using RTOS (FreeRTOS/Zephyr) or embedded Linux . Implement secure bootloaders, OTA update mechanisms, and encrypted firmware interfaces. Interface with a variety of peripherals including cameras, IMUs, analog sensors, and radios (BLE/Wi-Fi/LoRa). ️ CI/CD, DevOps & Tooling for Embedded Set up and manage CI/CD pipelines for firmware builds, static analysis, and validation. Integrate Docker-based toolchains, hardware-in-loop (HIL) testing setups, and simulators/emulators. Ensure codebase quality, maintainability, and test coverage across the embedded stack. Required Qualifications ‍ Education: BE/B.Tech/M.Tech in Embedded Systems, Electronics, Computer Engineering, or related fields. Experience: Minimum 4+ years of embedded systems experience. Minimum 2 years in a technical lead or architect role. Hands-on experience in ML model runtime optimization and embedded system integration. Technical Skills Required Embedded Development & Tools Expert-level C/C++ , hands-on with RTOS and Yocto-based Linux . Proficient with toolchains like GCC/Clang, OpenOCD, JTAG/SWD, Logic Analyzers. Familiarity with OTA , bootloaders , and memory management (heap/stack analysis, linker scripts). ML Model Integration Proficiency in TFLite , ONNX Runtime , HailoRT , or EdgeTPU runtimes . Experience with model conversion, quantization (INT8, FP16), runtime optimization. Ability to read/modify model graphs and connect to inference APIs. Connectivity & Peripherals Working knowledge of BLE, Wi-Fi, LoRa, RS485 , USB, and CAN protocols. Integration of camera modules , MIPI CSI , IMUs , and custom analog sensors . ️ DevOps for Embedded Hands-on with GitLab/GitHub CI, Docker, and containerized embedded builds. Build system expertise: CMake , Make , Bazel , or Yocto preferred. Experience in automated firmware testing (HIL, unit, integration). Preferred (Bonus) Skills Familiarity with machine vision pipelines , ISP tuning , or video/audio codec integration . Prior work on battery-operated devices , energy-aware scheduling , or deep sleep optimization . Contributions to embedded ML open-source projects or model deployment tools. Why Join Us? At EURTH TECHTRONICS PVT LTD , we go beyond firmware—we’re designing and deploying embedded intelligence on every device, from industrial gateways to smart consumer wearables. Build and lead teams working on cutting-edge real-time firmware + ML integration . Work on full-stack embedded ML systems using the latest AI accelerators and embedded chipsets . Drive product-ready, scalable software platforms that power IoT, defense, medical , and consumer electronics . How to Apply Send your updated resume + GitHub/portfolio links to: jobs@eurthtech.com About the Company About EURTH TECHTRONICS PVT LTD EURTH TECHTRONICS PVT LTD is a cutting-edge Electronics Product Design and Engineering firm specializing in embedded systems, IoT solutions, and high-performance hardware development. We provide end-to-end product development services—from PCB design, firmware development, and system architecture to manufacturing and scalable deployment. With deep expertise in embedded software, signal processing, AI-driven edge computing, RF communication, and ultra-low-power design, we build next-generation industrial automation, consumer electronics, and smart infrastructure solutions. Our Core Capabilities Embedded Systems & Firmware Engineering – Architecting robust, real-time embedded solutions with RTOS, Linux, and MCU/SoC-based firmware. IoT & Wireless Technologies – Developing LoRa, BLE, Wi-Fi, UWB, and 5G-based connected solutions for industrial and smart city applications. Hardware & PCB Design – High-performance PCB layout, signal integrity optimization, and design for manufacturing (DFM/DFA). Product Prototyping & Manufacturing – Accelerating concept-to-market with rapid prototyping, design validation, and scalable production. AI & Edge Computing – Implementing real-time AI/ML on embedded devices for predictive analytics, automation, and security. Security & Cryptography – Integrating post-quantum cryptography, secure boot, and encrypted firmware updates. Our Industry Impact ✅ IoT & Smart Devices – Powering the next wave of connected solutions for industrial automation, logistics, and smart infrastructure. ✅ Medical & Wearable Tech – Designing low-power biomedical devices with precision sensor fusion and embedded intelligence. ✅ Automotive & Industrial Automation – Developing AI-enhanced control systems, predictive maintenance tools, and real-time monitoring solutions. ✅ Scalable Enterprise & B2B Solutions – Delivering custom embedded hardware and software tailored to OEMs, manufacturers, and system integrators. Our Vision We are committed to advancing technology and innovation in embedded product design. With a focus on scalability, security, and efficiency, we empower businesses with intelligent, connected, and future-ready solutions. We currently cater to B2B markets, offering customized embedded development services, with a roadmap to expand into direct-to-consumer (B2C) solutions.

Posted 1 month ago

Apply

0.0 - 4.0 years

0 Lacs

Hyderabad, Telangana

On-site

Hyderabad, Telangana, India Job Type Full Time About the Role About the Role We are seeking a passionate and skilled Embedded ML Engineer to work on cutting-edge ML inference pipelines for low-power, real-time embedded platforms. You will help design and deploy highly efficient ML models on custom hardware accelerators like Hailo, Coral (Edge TPU), Kendryte K210, and Torrent/BlackHole in real-world IoT systems. This role combines model optimization, embedded firmware development, and toolchain management. You will be responsible for translating large ML models into efficient quantized versions, benchmarking them on custom hardware, and integrating them with embedded firmware pipelines that interact with real-world sensors and peripherals. Requirements Key Responsibilities ML Model Optimization & Conversion Convert, quantize, and compile models built in TensorFlow, PyTorch , or ONNX to hardware-specific formats. Work with compilers and deployment frameworks like TFLite , HailoRT , EdgeTPU Compiler , TVM , or ONNX Runtime . Use techniques such as post-training quantization , pruning , distillation , and model slicing . ️ Embedded Integration & Inference Deployment Integrate ML runtimes in C/C++ or Python into firmware stacks built on RTOS or embedded Linux . Handle real-time sensor inputs (camera, accelerometer, microphone) and pass them through inference engines. Manage memory, DMA transfers, inference buffers, and timing loops for deterministic behavior. Benchmarking & Performance Tuning Profile and optimize models for latency, memory usage, compute load , and power draw . Work with runtime logs, inference profilers, and vendor SDKs to squeeze maximum throughput on edge hardware. Conduct accuracy vs performance trade-off studies for different model variants. Testing & Validation Design unit, integration, and hardware-in-loop (HIL) tests to validate model execution on actual devices. Collaborate with hardware and firmware teams to debug runtime crashes, inference failures, and edge cases. Build reproducible benchmarking scripts and test data pipelines. Required Qualifications ‍ Education: BE/B.Tech/M.Tech in Electronics, Embedded Systems, Computer Science, or related disciplines. Experience: 2–4 years in embedded ML, edge AI, or firmware development with ML inference integration. Technical Skills Required Embedded Firmware & Runtime Strong experience in C/C++ , basic Python scripting. Experience with RTOS (FreeRTOS, Zephyr) or embedded Linux. Understanding of memory-mapped I/O, ring buffers, circular queues, and real-time execution cycles. ML Model Toolchains Experience with TensorFlow Lite , ONNX Runtime , HailoRT , EdgeTPU , uTensor , or TinyML . Knowledge of quantization-aware training or post-training quantization techniques. Familiarity with model conversion pipelines and hardware-aware model profiling. Media & Sensor Stack Ability to work with input/output streams from cameras , IMUs , microphones , etc. Experience integrating inference with V4L2, GStreamer, or custom ISP preprocessors is a plus. Tooling & Debugging Git, Docker, cross-compilation toolchains (Yocto, CMake). Debugging with SWD/JTAG, GDB, or serial console-based logging. Profiling with memory maps, timing charts, and inference logs. Preferred (Bonus) Skills Previous work with low-power vision devices , audio keyword spotting , or sensor fusion ML . Familiarity with edge security (encrypted models, secure firmware pipelines). Hands-on with simulators/emulators for ML testing (Edge Impulse, Hailo’s HEF emulator, etc.). Participation in TinyML forums , open-source ML toolkits, or ML benchmarking communities. Why Join Us? At EURTH TECHTRONICS PVT LTD , we're not just building IoT firmware—we're deploying machine learning intelligence on ultra-constrained edge platforms , powering real-time decisions at the edge. Get exposure to full-stack embedded ML pipelines — from model quantization to runtime integration. Work with a world-class team focused on ML efficiency, power optimization, and embedded system scalability .️ Contribute to mission-critical products used in industrial automation, medical wearables, smart infrastructure , and more. How to Apply Send your updated resume + GitHub/portfolio links to: jobs@eurthtech.com About the Company About EURTH TECHTRONICS PVT LTD EURTH TECHTRONICS PVT LTD is a cutting-edge Electronics Product Design and Engineering firm specializing in embedded systems, IoT solutions, and high-performance hardware development. We provide end-to-end product development services—from PCB design, firmware development, and system architecture to manufacturing and scalable deployment. With deep expertise in embedded software, signal processing, AI-driven edge computing, RF communication, and ultra-low-power design, we build next-generation industrial automation, consumer electronics, and smart infrastructure solutions. Our Core Capabilities Embedded Systems & Firmware Engineering – Architecting robust, real-time embedded solutions with RTOS, Linux, and MCU/SoC-based firmware. IoT & Wireless Technologies – Developing LoRa, BLE, Wi-Fi, UWB, and 5G-based connected solutions for industrial and smart city applications. Hardware & PCB Design – High-performance PCB layout, signal integrity optimization, and design for manufacturing (DFM/DFA). Product Prototyping & Manufacturing – Accelerating concept-to-market with rapid prototyping, design validation, and scalable production. AI & Edge Computing – Implementing real-time AI/ML on embedded devices for predictive analytics, automation, and security. Security & Cryptography – Integrating post-quantum cryptography, secure boot, and encrypted firmware updates. Our Industry Impact ✅ IoT & Smart Devices – Powering the next wave of connected solutions for industrial automation, logistics, and smart infrastructure. ✅ Medical & Wearable Tech – Designing low-power biomedical devices with precision sensor fusion and embedded intelligence. ✅ Automotive & Industrial Automation – Developing AI-enhanced control systems, predictive maintenance tools, and real-time monitoring solutions. ✅ Scalable Enterprise & B2B Solutions – Delivering custom embedded hardware and software tailored to OEMs, manufacturers, and system integrators. Our Vision We are committed to advancing technology and innovation in embedded product design. With a focus on scalability, security, and efficiency, we empower businesses with intelligent, connected, and future-ready solutions. We currently cater to B2B markets, offering customized embedded development services, with a roadmap to expand into direct-to-consumer (B2C) solutions.

Posted 1 month ago

Apply

3.0 years

0 Lacs

Gurgaon, Haryana, India

On-site

As a Senior Machine Learning Engineer, you will be responsible for designing, developing, and deploying cutting-edge models for end-to-end content generation, including AI-driven image/video generation, lip syncing, and multimodal AI systems. You will work on the latest advancements in deep generative modeling to create highly realistic and controllable AI-generated media. Responsibilities Research and Develop: Design and implement state-of-the-art generative models, including Diffusion Models, 3D VAEs, and GANs for AI-powered media synthesis. End-to-End Content Generation: Build and optimize AI pipelines for high-fidelity image/video generation and lip syncing using diffusion and autoencoder models. Speech and Video Synchronization: Develop advanced lip-syncing and multimodal generation models that integrate speech, video, and facial animation for hyper-realistic AI-driven content. Real-Time AI Systems: Implement and optimize models for real-time content generation and interactive AI applications using efficient model architectures and acceleration techniques. Scaling and Production Deployment: Work closely with software engineers to deploy models efficiently on cloud-based architectures (AWS, GCP, or Azure). Collaboration and Research: Stay ahead of the latest trends in deep generative models, diffusion models, and transformer-based vision systems to enhance AI-generated content quality. Experimentation and Validation: Design and conduct experiments to evaluate model performance, improve fidelity, realism, and computational efficiency, and refine model architectures. Code Quality and Best Practices: Participate in code reviews, improve model efficiency, and document research findings to enhance team knowledge-sharing and product development. Requirements Bachelor's or Master's degree in Computer Science, Machine Learning, or a related field. 3+ years of experience working with deep generative models, including Diffusion Models, 3D VAEs, GANs, and autoregressive models. Strong proficiency in Python and deep learning frameworks such as PyTorch. Expertise inmulti-modal AI, text-to-image, and image-to-video generation, audio to lipsync Strong understanding of machine learning principles and statistical methods. Good to have experience in real-time inference optimization, cloud deployment, and distributed training. Strong problem-solving abilities and a research-oriented mindset to stay updated with the latest AI advancements. Familiarity with generative adversarial techniques, reinforcement learning for generative models, and large-scale AI model training. Preferred Qualifications Experience with transformers and vision-language models(e. g., CLIP, BLIP, GPT-4V). Background in text-to-video generation, lip-sync generation, and real-time synthetic media applications. Experience in cloud-based AI pipelines (AWS, Google Cloud, or Azure) and model compression techniques (quantization, pruning, distillation). Contributions to open-source projects or published research in AI-generated content, speech synthesis, or video synthesis. This job was posted by Meghna Sidda from TrueFan.

Posted 1 month ago

Apply

3.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Roles and Responsibilities: As a, Data scientist / Senior Data scientist you will solve some of the most impactful business problems for our clients using a variety of AI and ML technologies. You will collaborate with business partners and domain experts to design and develop innovative solutions on the data to achieve predefined outcomes. Engage with clients to understand current and future business goals and translate business problems into analytical frameworks Develop custom models based on in-depth understanding of underlying data, data structures, and business problems to ensure deliverables meet client needs Create repeatable, interpretable and scalable models Effectively communicate the analytics approach and insights to a larger business audience Collaborate with team members, peers and leadership at Tredence and client companies Qualification: Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience. 3+ years of experience in data science, building hands-on ML models Experience with LMs (Llama (1/2/3), T5, Falcon, Langchain or framework similar like Langchain) Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning Candidate must be comfortable interpreting research papers and architecture diagrams of Language Models Candidate must be comfortable with LORA, RAG, Instruct fine-tuning, Quantization, etc. Experience leading the end-to-end design, development, and deployment of predictive modeling solutions. Excellent programming skills in Python. Strong working knowledge of Python’s numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc. Advanced SQL skills with SQL Server and Spark experience. Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling. Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment Experience with data visualization tools — PowerBI, Tableau, R Shiny, etc. preferred Experience with cloud platforms such as Azure, AWS is preferred but not require

Posted 1 month ago

Apply

3.0 years

0 Lacs

Gurgaon, Haryana, India

Remote

Capgemini Invent Capgemini Invent is the digital innovation, consulting and transformation brand of the Capgemini Group, a global business line that combines market leading expertise in strategy, technology, data science and creative design, to help CxOs envision and build what’s next for their businesses. Your Role Job Description Edge AI Data Scientists will be responsible for designing, developing, and validating machine learning models—particularly in the domain of computer vision—for deployment on edge devices. This role involves working with data from cameras, sensors, and embedded platforms to enable real-time intelligence for applications such as object detection, activity recognition, and visual anomaly detection. The position requires close collaboration with embedded systems and AI engineers to ensure models are lightweight, efficient, and hardware-compatible. Candidate Requirements Education Bachelor's or Master’s degree in Data Science, Computer Science, or a related field. Experience 3+ years of experience in data science or machine learning with a strong focus on computer vision. Experience in developing models for edge deployment and real-time inference. Familiarity with video/image datasets and deep learning model training. Skills Proficiency in Python and libraries such as OpenCV, PyTorch, TensorFlow, and FastAI. Experience with model optimization techniques (quantization, pruning, etc.) for edge devices. Hands-on experience with deployment tools like TensorFlow Lite, ONNX, or OpenVINO. Strong understanding of computer vision techniques (e.g., object detection, segmentation, tracking). Familiarity with edge hardware platforms (e.g., NVIDIA Jetson, ARM Cortex, Google Coral). Experience in processing data from camera feeds or embedded image sensors. Strong problem-solving skills and ability to work collaboratively with cross-functional teams. Your Profile Responsibilities Develop and train computer vision models tailored for constrained edge environments. Analyze camera and sensor data to extract insights and build vision-based ML pipelines. Optimize model architecture and performance for real-time inference on edge hardware. Validate and benchmark model performance on various embedded platforms. Collaborate with embedded engineers to integrate models into real-world hardware setups. Stay up-to-date with state-of-the-art computer vision and Edge AI advancements. Document models, experiments, and deployment configurations. What You Will Love About Working Here· We recognize the significance of flexible work arrangements to provide support. Be it remote work, or flexible work hours, you will get an environment to maintain healthy work life balance. At the heart of our mission is your career growth. Our array of career growth programs and diverse professions are crafted to support you in exploring a world of opportunities. Equip yourself with valuable certifications in the latest technologies such as Generative AI. About Capgemini Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital andiCa sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, cloud and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2023 global revenues of €22.5 billion.

Posted 1 month ago

Apply

7.0 years

25 - 35 Lacs

India

On-site

AI Lead – Generative & Agentic AI Systems Experience: 7–10 Years Location: Hyderabad (Hybrid) Employment Type: Full-Time About the Role: We are seeking a visionary and hands-on AI Lead to architect, build, and scale next-generation Generative and Agentic AI systems. In this role, you will drive the end-to-end lifecycle—from research and prototyping to production deployment—guiding a team of AI engineers and collaborating cross-functionally to deliver secure, scalable, and impactful AI solutions across multimodal and LLM-based ecosystems. Key Responsibilities: Architect and oversee the development of GenAI and Agentic AI workflows, including multi-agent systems and LLM-based pipelines. Guide AI engineers in best practices for RAG (Retrieval-Augmented Generation), prompt engineering, and agent design. Evaluate and implement the right technology stack: open source (Hugging Face, LangChain, LlamaIndex) vs. closed source (OpenAI, Anthropic, Mistral). Lead fine-tuning and adapter-based training (e.g., LoRA, QLoRA, PEFT). Drive inference optimization using quantization, ONNX, TensorRT, and related tools. Build and refine RAG pipelines using embedding models, vector DBs (FAISS, Qdrant), chunking strategies, and hybrid knowledge graph systems. Manage LLMOps with tools like Weights & Biases, MLflow, and ClearML, ensuring experiment reproducibility and model versioning. Design and implement evaluation frameworks for truthfulness, helpfulness, toxicity, and hallucinations. Integrate guardrails, content filtering, and data privacy best practices into GenAI systems. Lead development of multi-modal AI systems (VLMs, CLIP, LLaVA, video-text fusion models). Oversee synthetic data generation for fine-tuning in low-resource domains. Design APIs and services for Model-as-a-Service (MaaS) and AI agent orchestration. Collaborate with product, cloud, and infrastructure teams to align on deployment, GPU scaling, and cost optimization. Translate cutting-edge AI research into usable product capabilities, from prototyping to production. Mentor and grow the AI team, establishing R&D best practices and benchmarks. Stay up-to-date with emerging trends (arXiv, Papers With Code) to keep the organization ahead of the curve. Required Skills & Expertise: AI & ML Foundations: Generative AI, LLMs, Diffusion Models, Agentic AI Systems, Multi-Agent Planning, Prompt Engineering, Feedback Loops, Task Decomposition Ecosystem & Frameworks: Hugging Face, LangChain, OpenAI, Anthropic, Mistral, LLaMA, GPT, Claude, Mixtral, Falcon, etc. Fine-tuning & Inference: LoRA, QLoRA, PEFT, ONNX, TensorRT, DeepSpeed, vLLM Data & Retrieval Systems: FAISS, Qdrant, Chroma, Pinecone, Hybrid RAG + Knowledge Graphs MLOps & Evaluation: Weights & Biases, ClearML, MLflow, Evaluation metrics (truthfulness, helpfulness, hallucination) Security & Governance: Content moderation, data privacy, model alignment, ethical constraints Deployment & Ops: Cloud (AWS, GCP, Azure) with GPU scaling, Serverless LLMs, API-based inference, Docker/Kubernetes Other: Multi-modal AI (images, video, audio), API Design (Swagger/OpenAPI), Research translation and POC delivery Preferred Qualifications: 7+ years in AI/ML roles, with at least 2–3 years in a technical leadership capacity Proven experience deploying LLM-powered systems at scale Experience working with cross-functional product and infrastructure teams Contributions to open-source AI projects or published research papers (a plus) Strong communication skills to articulate complex AI concepts to diverse stakeholders Why Join Us? Work at the forefront of AI innovation with opportunities to publish, build, and scale impactful systems Lead a passionate team of engineers and researchers Shape the future of ethical, explainable, and usable AI products Ready to shape the next wave of AI? Apply now and join us on this journey! Job Types: Full-time, Permanent Pay: ₹2,500,000.00 - ₹3,500,000.00 per year Benefits: Flexible schedule Health insurance Provident Fund Supplemental Pay: Joining bonus Work Location: In person

Posted 1 month ago

Apply

5.0 years

0 Lacs

Hyderabad, Telangana, India

Remote

We are hiring a contract-based Computer Vision Engineer in Hyderabad, India to lead deep learning model development using PyTorch. The ideal candidate will design and deploy scalable computer vision pipelines focused on image and video analytics. This is a 6-month engagement to support real-time model deployment, optimization, and automation across cloud and edge platforms. Key Responsibilities 1. Deep Learning Model Development • Build and train state-of-the-art CV models using PyTorch for classification, detection (YOLO, Faster R-CNN), and segmentation (UNet, DeepLab) • Optimize data pipelines and preprocessing strategies for high-resolution image and video feeds • Fine-tune pre-trained models and manage custom model development based on project needs 2. Model Optimization & Deployment • Optimize models using ONNX, quantization, or TensorRT for cloud and edge deployment • Deploy real-time inference endpoints using containers (Docker/Kubernetes) and cloud services (Azure, AWS, GCP) • Maintain experiment tracking, model versioning, and deployment automation workflows 3. Data Engineering & Integration • Work with data engineers to build scalable data pipelines for ingestion and preprocessing • Integrate CV models into production systems and IoT environments (e.g., Jetson, Azure IoT) 4. Governance & Performance • Ensure AI workflows are secure, auditable, and production-ready • Apply model compression and tuning for performance at scale 5. Cross-functional Collaboration • Collaborate with DevOps, product teams, and ML engineers for seamless delivery • Document architectures and ensure knowledge transfer at project milestones Required Qualifications • Bachelor’s or Master’s in Computer Science, AI/ML, or a related technical field • 5+ years of experience in AI/ML with at least 2 years in deploying PyTorch-based CV models • Expertise in PyTorch, OpenCV, Python, Git, and deep learning model deployment • Familiarity with cloud platforms (Azure preferred), Docker/Kubernetes • Hands-on experience with model lifecycle tools (MLflow, W&B, DVC, etc.) Preferred Qualifications • Experience with edge AI platforms (Jetson, Coral, Azure Percept) • Knowledge of ONNX, TensorRT, or other optimization tools • Exposure to enterprise-grade security practices and MLOps workflows Contract Details • Duration: 6 months • Location: Hyderabad (Hybrid preferred; remote may be considered for exceptional candidates) • Compensation: Competitive, based on experience and expertise How to Apply Send your resume, portfolio (if available), and GitHub/LinkedIn profiles to: Info@primeverse.in Subject: Computer Vision Engineer – Hyderabad

Posted 1 month ago

Apply

40.0 years

1 - 4 Lacs

Hyderābād

On-site

It's fun to work in a company where people truly BELIEVE in what they're doing! We're committed to bringing passion and customer focus to the business. Job Description Machine Learning Engineer – RAG & Fine-Tuning This role requires working from our local Hyderabad office 2-3x a week. Location: Hyderabad, Telangana, India ABOUT ABC FITNESS ABC Fitness (ABC) is the global market leader in providing technology solutions to the fitness industry. Built on a 40+ year reputation of excellence, ABC helps fitness providers of all sizes and backgrounds to turn their visions into seamless reality. Founded in 1981, ABC serves 40 million+ members globally, processing over $11B+ in payments annually for 31,000 clubs across 92+ countries. Our integrated suite includes best-of-breed platforms: Evo, Glofox, Ignite, and Trainerize. As a Thoma Bravo portfolio company, ABC is backed by the leading private equity firm focused on enterprise software. Learn more at abcfitness.com . ABOUT THE TEAM The AI Platform Engineering team at ABC builds scalable, high-performance AI systems that power next-generation fitness technology. We specialize in retrieval-augmented generation (RAG) architectures and fine-tuning methodologies to deliver context-aware, cost-efficient AI solutions. As our Machine Learning Engineer, you will be responsible for all retrieval and intelligence behind the LLM, delivering performant, low-cost, high-context AI features. At ABC, we love entrepreneurs because we are entrepreneurs. We roll our sleeves up, we act fast, and we learn together. WHAT YOU’LL DO Handle embeddings and chunking strategies to optimize document and data retrieval for GenAI-powered features. Manage vector stores and retrieval workflows using leading vector databases (Pinecone, FAISS, Weaviate, Azure AI Search) to ensure efficient, scalable access to unstructured and structured data. Fine-tune small and large language models using frameworks such as HuggingFace and OpenAI APIs, tailoring models to domain-specific requirements and improving performance on targeted tasks. Optimize cost and reduce latency by implementing best practices for token management, model evaluation, and cloud resource utilization. Collaborate with engineering, product, and data teams to integrate RAG pipelines into production systems, ensuring reliability, scalability, and security. Stay up-to-date with the latest advancements in retrieval-augmented generation, vector search, and LLM fine-tuning, applying new techniques to improve system performance and user experience. WHAT YOU’LL NEED 4–7 years of experience in machine learning or AI engineering, with a proven track record in RAG, vector search, and LLM fine-tuning. Deep expertise with vector databases such as Pinecone, FAISS, Weaviate, or Azure AI Search, including experience designing retrieval workflows and managing embeddings. Familiarity with HuggingFace and OpenAI fine-tuning APIs, and strong understanding of chunking strategies for optimizing retrieval. Proficiency in Python and experience with ML frameworks (PyTorch, TensorFlow) and cloud platforms (AWS, Azure). Understanding of token management, evaluation tuning, and cost optimization for large-scale AI deployments. Strong problem-solving skills, a collaborative mindset, and the ability to communicate complex technical concepts to both technical and non-technical stakeholders. AND IT’S GREAT TO HAVE Experience with NLP, NLU, and NLG techniques for conversational AI or information retrieval. Exposure to ML Ops tools for model monitoring, evaluation, and deployment (ML flow, Weights & Biases). Experience with model compression, quantization, or other efficiency techniques. Certifications in AWS Machine Learning Specialty or Microsoft AI Engineer. WHAT’S IN IT FOR YOU: Purpose led company with a Values focused culture – Best Life, One Team, Growth Mindset Time Off – competitive PTO plans with 15 Earned accrued leave, 12 days Sick leave, and 12 days Casual leave per year 11 Holidays plus 4 Days of Disconnect – once a quarter, we take a collective breather and enjoy a day off together around the globe. #oneteam Group Mediclaim insurance coverage of INR 500,000 for employee + spouse, 2 kids, and parents or parent-in-laws, and including EAP counseling Life Insurance and Personal Accident Insurance Best Life Perk – we are committed to meeting you wherever you are in your fitness journey with a quarterly reimbursement Premium Calm App – enjoy tranquility with a Calm App subscription for you and up to 4 dependents over the age of 16 Support for working women with financial aid towards crèche facility, ensuring a safe and nurturing environment for their little ones while they focus on their careers. We’re committed to diversity and passion, and encourage you to apply, even if you don’t demonstrate all the listed skillsets! ABC’S COMMITMENT TO DIVERSITY, EQUALITY, BELONGING AND INCLUSION: ABC is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We are intentional about creating an environment where employees, our clients and other stakeholders feel valued and inspired to reach their full potential and make authentic connections. We foster a workplace culture that embraces each person’s diversity, including the extent to which they are similar or different. ABC leaders believe that an equitable and inclusive culture is not only the right thing to do, it is a business imperative. Read more about our commitment to diversity, equality, belonging and inclusion at abcfitness.com ABOUT ABC: ABC Fitness (abcfitness.com) is the premier provider of software and related services for the fitness industry and has built a reputation for excellence in support for clubs and their members. ABC is the trusted provider to boost performance and create a total fitness experience for over 41 million members of clubs of all sizes whether a multi-location chain, franchise or an independent gym. Founded in 1981, ABC helps over 31,000 gyms and health clubs globally perform better and more profitably offering a comprehensive SaaS club management solution that enables club operators to achieve optimal performance. ABC Fitness is a Thoma Bravo portfolio company, a private equity firm focused on investing in software and technology companies (thomabravo.com). #LI-HYBRID If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Posted 1 month ago

Apply

3.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

Are you excited by the challenge of building intelligent systems that live at the edge, understand natural language, and adapt to their environment? Do you thrive at the intersection of AI, embedded systems, and real-world impact? Join Mantra Softech as a Senior AI Engineer and help define the future of smart living by designing embedded AI agents , natural language interfaces , and predictive intelligence for next-generation smart devices . From LLMs to edge-deployed vision and sensor analytics, your work will power real-time decisions and intuitive user experiences - all while pushing the boundaries of machine learning innovation. About Us - Mantra Softech Founded in 2006, Mantra Softech is a global leader in high-tech hardware innovation, specializing in biometric and RFID-based solutions. As we expand into the Smart Home ecosystem , we are developing a new portfolio of IoT devices that deliver seamless, secure, and intelligent living experiences. We are building a design-led product team, and this is your opportunity to help shape the identity of a next-gen connected home. Location : Ahmedabad, (Full-Time | On-Site) About The Role We are developing the next generation of intelligent systems that combine embedded sensing, AI at the edge, predictive maintenance, and natural language interfaces. If you're excited about deploying ML pipelines in real-world hardware, turning unstructured prompts into actions or queries, and working across the spectrum of edge devices, cloud infrastructure, and AI modeling - we want to work with you. As a Senior AI Engineer, You Will i. Drive research initiatives and proof-of-concepts that push the state of the art in generative AI and large-scale machine learning for home-automation projects. ii. Develop AI agents that can interpret, plan, and act autonomously in response to multi-modal inputs iii. Design and implement high-throughput, low-latency AI/ML pipelines and to operate at global scale. iv. Work on prompt engineering and iteratively develop LLM-based solutions tailored to custom use cases like text-to-SQL, text-to-command, and context-aware queries . v. Develop and optimize AI models for deployment on embedded hardware. vi. Prototype novel generative AI solutions, integrate advancements into production, and collaborate with research partners. vii. Integrate real-time streaming data (video/audio/sensor) into analytics pipelines and AI workflows. viii. Develop MLOps pipelines : versioning, continuous training, deployment, and monitoring of models. ix. Collaborate on cloud infrastructure (AWS) for scalable backend and edge-cloud sync. Required Skills i. Solid foundation and expertise in developing and deploying statistical ML models ii. Experience with ML frameworks like Scikit-learn, PyTorch, TensorFlow,OpenCV etc. iii. Hands-on experience with real-life signal/data processing (audio, sensor data etc.) is a must iv. Understanding of MLOps tools and lifecycle (e.g., MLflow, DVC, monitoring tools) v. Strong proficiency in Python and/or C/C++ vi. Experience in building multi-modal AI systems (combining text, audio, sensor, vision inputs) vii. Familiarity with voice activity detection (VAD), wake-word detection, and speech-to-command pipelines viii. Hands-on experience with time-series analysis and forecasting models for predictive maintenance ix. Experience with GenAI, prompt engineering and LLMs along with APIs x. Experience with containerization (Docker) and deploying models via microservices architecture xi. Exposure to model quantization and embedded deployment. xii. Strong understanding of data annotation, synthetic data generation, and active learning workflows. Knowledge of code version management like Bit-bucket. xiii. Strong grasp of SQL : writing optimized queries, working with relational schemas, and integrating SQL into ML pipelines xiv. Exposure to NoSQL databases (e.g., MongoDB, Redis) for high-speed or unstructured data applications xv. Knowledge of AI safety, model robustness, and explainability techniques for production-grade systems Preferred Skills i. Experience with LangChain, LlamaIndex, or custom AI agent frameworks ii. Knowledge of digital twins, condition monitoring, or industrial telemetry iii. Exposure to event-driven edge/cloud orchestration Qualifications : Master's degree in relevant or related field of AI/ML from Tier-I/II institute with at least 3-4 years of experience (ref:hirist.tech)

Posted 1 month ago

Apply

0.0 years

0 Lacs

India

Remote

AI Bot Developer Location: Remote (India) Experience: 0-2 years Salary: ₹7 LPA base + ₹ 2 LPA performance bonus (based on skills) Type: Full-Time/Part-time/Contract/Internship About Us: We’re an early-stage startup building AI-driven digital products that blend large language models (LLMs) with scalable products. Our organisation's goal is to increase product and service visibility, user engagement and create high ROI with marketing budget on social media platforms, The ideal candidate can be a university student, recent graduate, contractor or a full-time employee, however, they should have previous part-time or full-time technical experience, have a well composed github profile or personal portfolio, be able to prove their technical capabilities and knowledge on bot development upon immediate request and show commitment to the role. Responsibilities: Design and deploy LLM-powered bots (e.g., auto-reply systems, content summarizers, viral thread generators). Integrate RAG pipelines (e.g., retrieve Reddit comments + LLM responses) with vector DBs (Pinecone, Weaviate). Fine-tune/open-source LLMs (Llama 3, Mistral) for bot-specific tasks (persona mimicry, NSFW filtering). Optimize cost/latency (model quantization, caching, hybrid rule-based + AI logic). Implement stealth measures (human-like delays, randomized phrasing, proxy rotation). Technical Stack: Core: Python (Tweepy, PRAW, AsyncIO, LangChain, LlamaIndex) LLMs: OpenAI, Anthropic, Gemini, or self-hosted (Llama.cpp, vLLM) Infra: AWS Lambda, FastAPI, Docker, Redis (for rate-limiting) Data: Vector DBs (Pinecone), PostgreSQL, Firebase Bonus: Next.js/Streamlit for bot dashboards Ideal Candidate: ✅ 1-2 years of experience building Python bots (X/Reddit API automation). ✅ Worked with LLM integrations (RAG, fine-tuning, prompt engineering). ✅ Understands detection tactics (IP rotation, CAPTCHA solvers). ✅ Pragmatic problem-solver who cares about scalability vs. cost tradeoffs. ✅ Prefered: Deployed bots at scale (10K+ reqs/day) or contributed to open-source LLM projects. Compensation & Benefits: ₹7 lakhs base salary + ₹2 lakhs performance-based bonus (58,350 INR per month, 50k INR bonus per successful quarter) 100% remote work - work from anywhere within the working hours Opportunity to work on cutting-edge AI products from ground up Flat hierarchy and direct impact on technical decisions Learning budget for courses/certificates

Posted 1 month ago

Apply

5.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

About Mihira Visual Labs Mihira Visual Labs is a research-driven CGI and VFX studio redefining filmmaking through AI- and ML-powered workflows. We specialize in the development and production of full-length animated films, empowering creators with cutting-edge tools to accelerate high-quality storytelling and IP creation. Our mission is to make world-class storytelling faster, more efficient, and more cost-effective — where human imagination is the only true differentiator. Role Overview The Lead AI/ML Engineer will spearhead the development and integration of artificial intelligence and machine learning solutions into our VFX production pipeline. You’ll work closely with pipeline developers, production managers, and creative teams to reimagine workflows, automate repetitive tasks, and push the boundaries of innovation in photorealistic rendering, stylized animation, and other CGI processes. This role combines deep technical expertise with a strong understanding of the demands of a fast-paced VFX/animation studio. Key Responsibilities AI Strategy & Roadmap Develop and maintain a strategic plan for implementing AI/ML across the VFX pipeline—covering data wrangling, rendering, asset management, animation, and post-production. Identify new approaches to innovate current workflows to increase efficiency and cut down on time. Algorithm & Model Development Research, design, and implement ML models (e.g., computer vision, generative models, style transfer) that improve artist efficiency, production speed, enhance image quality, or enable new creative possibilities. Optimize models for performance on local GPU/CPU clusters or cloud-based infrastructures. Pipeline Integration & Automation Collaborate with pipeline engineers to seamlessly integrate AI agents or tools into existing software stacks (e.g., Maya, Houdini, Nuke), ensuring minimal disruption to artists’ workflows. Develop automated solutions for tasks like rotoscoping, clean-up, crowd simulation, environment generation, or facial capture/animation. Infrastructure & Tooling Architect and maintain robust data pipelines, ensuring the secure collection and organization of high-quality datasets for training AI models. Evaluate and deploy containerization/MLOps tools (Docker, Kubernetes, MLflow, etc.) for scalable model training, inference, and monitoring. Performance Optimization Profile model performance, memory usage, and render times; implement optimizations in frameworks such as TensorFlow, PyTorch, or custom GPU pipelines. Work with DevOps/IT teams to configure and manage dedicated GPU farms or cloud compute resources. Research & Development Stay updated with state-of-the-art ML/DL techniques, particularly in generative AI, computer vision, and real-time rendering. Introduce emerging methods (e.g., stable diffusion, large language models, neural rendering) to innovate new production techniques. Mentorship & Collaboration Lead a small team of AI engineers or data scientists, providing guidance on best practices, code reviews, and architectural decisions. Educate and train production staff on AI-driven tools and workflows, fostering a culture of continuous improvement. Documentation & Reporting Create clear technical documentation for AI solutions, ensuring maintainability and scalability. Present progress, insights, and ROI to executive leadership, project stakeholders, and cross-functional teams. Qualifications & Skills Bachelor’s or Master’s degree in Computer Science, AI/ML, or related field. A PhD is a bonus but not mandatory. 5+ years of professional experience in applied machine learning or data science, with at least 2 years in a lead/managerial role. Previous experience in VFX, animation, gaming, or related entertainment industries is a bonus. Programming: Expert-level Python (C++ is a plus). ML Frameworks: Deep understanding of TensorFlow, PyTorch, scikit-learn, or similar libraries. Computer Vision & Generative Models: Familiarity with CNNs, GANs, autoencoders, stable diffusion, or neural radiance fields. Pipeline Tools: Experience with integration in VFX software (Maya, Houdini, Nuke) and plugin APIs.[Optional] DevOps & MLOps: Comfortable with containerization (Docker), orchestration (Kubernetes), CI/CD, and cloud platforms (AWS, Azure, GCP). Proven track record of translating production challenges into AI/ML solutions that deliver measurable efficiency gains or cost savings. Experience with model optimization (quantization, pruning) and GPU/CPU performance tuning. Collaboration: Excellent communication to bridge technical and creative teams, explaining complex concepts in clear, accessible language. Leadership: Ability to mentor junior engineers and foster a culture of experimentation and continuous learning. Agility: Adapts quickly to evolving project needs, production pipelines, and new AI techniques. A genuine interest in cinema, animation, or gaming—a plus if you have prior knowledge of the Baahubali IP or similar large-scale IPs. Creativity in applying AI to artistic challenges, from photorealistic digital humans to stylized animated sequences.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

Are you excited by the challenge of building intelligent systems that live at the edge, understand natural language, and adapt to their environment? Do you thrive at the intersection of AI, embedded systems, and real-world impact? About Us – Mantra Softech Mantra Softech is a technology leader in biometric (Face, Finger, & Iris) solutions, now building the next generation of AI-powered smart home devices . As we expand into intelligent IoT, we're forming a world-class AI team to bring real-time perception, decision-making, and automation to the edge. Join us to shape the future of connected living. Role Overview We're looking for a Senior AI Engineer who thrives at the intersection of AI, embedded systems, and real-world impact. You will lead initiatives in LLMs, Edge AI, sensor fusion, ML Models and natural language interfaces to build AI-powered experiences across a new class of smart home devices. Location 📍 Ahmedabad, (Full-Time | On-Site) About the Role We are developing the next generation of intelligent systems that combine embedded sensing, AI at the edge, predictive maintenance, and natural language interfaces. If you're excited about deploying ML pipelines in real-world hardware, turning unstructured prompts into actions or queries, and working across the spectrum of edge devices, cloud infrastructure, and AI modeling — we want to work with you. Key Responsibilities Drive research initiatives and proof-of-concepts that push the state of the art in Gen AI and large-scale machine learning for home-automation projects Build AI agents capable of interpreting multi-modal inputs (vision, voice, sensors) and performing intelligent actions Design and deploy AI/ML pipelines across edge and cloud environments Develop natural language interfaces for voice and text-based interactions Work on prompt engineering and iteratively develop LLM-based solutions tailored to custom use cases like text-to-SQL, text-to-command, and context-aware queries Optimize ML models for embedded deployment and real-time execution Prototype novel generative AI solutions, integrate advancements into production, and collaborate with research partners. Integrate real-time streaming data (video/audio/sensor) into analytics pipelines and AI workflows Implement MLOPs workflows : model versioning, retraining, monitoring Collaborate on cloud infrastructure (AWS/Azure/GCP) for scalable backend and edge-cloud sync. Required Skills 3–5 years of hands-on experience in AI/ML system development Expertise in Python (and/or C/C++), PyTorch/TensorFlow, and ML frameworks Solid foundation and expertise in developing and deploying statistical ML models Experience with ML frameworks like Scikit-learn, PyTorch, TensorFlow,OpenCV etc. Hands-on experience with real-life signal/data processing (audio, sensor data, etc.) is a must Understanding of MLOps tools and lifecycle (e.g., MLflow, DVC, monitoring tools) Hands-on experience with time-series analysis and forecasting models for predictive maintenance Experience with LLMs, prompt engineering, and GenAI pipelines Deployment experience using Docker, microservices, and embedded platforms Exposure to model optimization (quantization, pruning) for edge AI Strong understanding of data annotation, synthetic data generation, and active learning workflows Strong grasp of SQL and exposure to NoSQL databases (e.g., Redis, MongoDB) Preferred Skills Experience with LangChain, LlamaIndex, or custom AI agent frameworks Knowledge of digital twins, condition monitoring, or telemetry Knowledge of AI safety, model robustness, and explainability techniques for production-grade systems Familiarity with edge-cloud orchestration and event-driven systems Qualifications Master’s degree in relevant or related field of AI/ML from Tier-I/II institute with 3-4 years of experience If you're passionate about building intelligent systems that power the next generation of smart home devices — from real-time perception to autonomous decision-making — we want to hear from you. Join the AI team at Mantra Softech and help redefine the smart home experience through cutting-edge artificial intelligence. How to Apply Send your resume and portfolio to jatin.prakash@mantratec.com Please include "Senior - AI Engineer" in the subject line.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

About Us Yubi stands for ubiquitous. But Yubi will also stand for transparency, collaboration, and the power of possibility. From being a disruptor in India’s debt market to marching towards global corporate markets from one product to one holistic product suite with seven products Yubi is the place to unleash potential. Freedom, not fear. Avenues, not roadblocks. Opportunity, not obstacles. About Yubi Yubi, formerly known as CredAvenue, is re-defining global debt markets by freeing the flow of finance between borrowers, lenders, and investors. We are the world's possibility platform for the discovery, investment, fulfillment, and collection of any debt solution. At Yubi, opportunities are plenty and we equip you with tools to seize it. In March 2022, we became India's fastest fintech and most impactful startup to join the unicorn club with a Series B fundraising round of $137 million. In 2020, we began our journey with a vision of transforming and deepening the global institutional debt market through technology. Our two-sided debt marketplace helps institutional and HNI investors find the widest network of corporate borrowers and debt products on one side and helps corporates to discover investors and access debt capital efficiently on the other side. Switching between platforms is easy, which means investors can lend, invest and trade bonds - all in one place. All of our platforms shake up the traditional debt ecosystem and offer new ways of digital finance. Yubi Credit Marketplace - With the largest selection of lenders on one platform, our credit marketplace helps enterprises partner with lenders of their choice for any and all capital requirements. Yubi Invest - Fixed income securities platform for wealth managers & financial advisors to channel client investments in fixed income Financial Services Platform - Designed for financial institutions to manage co-lending partnerships & asset based securitization Spocto - Debt recovery & risk mitigation platform Corpository - Dedicated SaaS solutions platform powered by Decision-grade data, Analytics, Pattern Identifications, Early Warning Signals and Predictions to Lenders, Investors and Business Enterprises So far, we have on-boarded over 17000+ enterprises, 6200+ investors & lenders and have facilitated debt volumes of over INR 1,40,000 crore. Backed by marquee investors like Insight Partners, B Capital Group, Dragoneer, Sequoia Capital, LightSpeed and Lightrock, we are the only-of-its-kind debt platform globally, revolutionizing the segment. At Yubi, People are at the core of the business and our most valuable assets. Yubi is constantly growing, with 1000+ like-minded individuals today, who are changing the way people perceive debt. We are a fun bunch who are highly motivated and driven to create a purposeful impact. Come, join the club to be a part of our epic growth story. Responsibilities This particular role is within our Yubi Invest vertical, and you would get to work on building our bonds platform, called Aspero, for retail users. Be able to operate in ambiguous situations and define clear objectives by breaking down the narratives independently. Work closely with business, research, data and engineering teams to understand the user goals, market dynamics and ship products. Aligning product strategy, proposition and roadmap with measurable metrics with all stakeholders. Drive PRDs, product planning, and product design of new features and enhancements. Clearly communicate product and platform benefits to our users and internal stakeholders About The Role- We’re looking for a highly skilled, results-driven AI engineer who thrives in fast-paced, high-impact environments. If you are passionate about pushing the boundaries of Computer Vision, OCR, and Large Language Models (LLMs) and have a strong foundation in building and deploying AI solutions, this role is for you. As a Senior Data Scientist, you will take ownership of designing and implementing state-of-the-art OCR and Computer Vision systems. This role demands deep technical expertise, the ability to work autonomously, and a mindset that embraces complex challenges head-on. Here, you won’t just fine-tune pre-trained models—you’ll be architecting, optimizing, and scaling AI solutions that power real-world applications. Key Responsibilities- Architect, develop, and deploy high-performance Computer Vision and OCR models for real-world applications. Implement and optimize state-of-the-art OCR models such as Donut, TrOCR, LayoutLM, and DocFormer for document processing and information extraction. Fine-tune and integrate LLMs (GPT, LLaMA, Mistral, etc.) to enhance text understanding and automation. Develop custom deep learning models for large-scale image and document processing. Build and optimize end-to-end AI pipelines, ensuring efficient data processing and model deployment. Work closely with engineers to operationalize AI models in production (Docker, FastAPI, TensorRT, ONNX). Enhance GPU performance and model inference efficiency, applying techniques such as quantization and pruning. Stay ahead of industry advancements, continuously experimenting with new AI architectures and training techniques. Work in a highly dynamic, startup-like environment, balancing rapid experimentation with production-grade robustness. Requirements 5-10 years experience p roven technical expertise – Strong programming skills in Python, PyTorch, TensorFlow with deep experience in Computer Vision and OCR. Hands-on experience in developing, training, and deploying OCR and document AI models. Deep understanding of Transformer-based architectures for vision and text processing. Experience working with Hugging Face, OpenCV, TensorRT, and NVIDIA GPUs for model acceleration. Autonomous problem solver – You take initiative, work independently, and drive projects from research to production. Strong experience in scaling AI solutions, including model optimization and deployment on cloud platforms (AWS/GCP/Azure). Thrives in fast-paced environments – You embrace challenges, pivot quickly, and execute effectively. Familiarity with MLOps tools (Docker, FastAPI, Kubernetes) for seamless model deployment. Experience in multi-modal models (Vision + Text). Nice to Have- Strong background in vector databases, RAG pipelines, and fine-tuning LLMs for document intelligence. Contributions to open-source AI projects.

Posted 1 month ago

Apply

4.0 years

0 Lacs

Hyderābād

On-site

Company: Qualcomm India Private Limited Job Area: Engineering Group, Engineering Group > Software Engineering General Summary: More details below: Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds! Responsibilities: In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force. Requirements: Master’s/Bachelor’s degree in computer science or equivalent. 4+ years of relevant work experience in software development. Strong understanding of Generative AI models – LLM, LVM and LLMs and building blocks Floating-point, Fixed-point representations and Quantization concepts. Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU). Strong development skills in C/C++ Excellent analytical and debugging skills. Good communication skills (verbal, presentation, written). Ability to collaborate across a globally diverse team and multiple interests. Preferred Qualifications Strong understanding of SIMD processor architecture and system design. Proficiency in object-oriented software development. Familiarity with Linux and Windows environment Strong background in kernel development for SIMD architectures. Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus. Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred. Experience with parallel computing systems and Assembly is a plus. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field. 2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc. Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers.

Posted 1 month ago

Apply

4.0 years

1 - 2 Lacs

Hyderābād

On-site

Company: Qualcomm India Private Limited Job Area: Engineering Group, Engineering Group > Software Engineering General Summary: As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world class products that meet and exceed customer needs. Qualcomm Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience. 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc. Machine Learning Engineer Job Location: Hyderabad More details below: Join a new and growing team at Qualcomm focused on advancing state-of-the-art in Machine Learning. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities. See your work directly impact billions of mobile devices around the world. In this position, you will be responsible for the development and commercialization of ML solutions like Snapdragon Neural Processing Engine (SNPE) and AI Model Efficiency Toolkit (AIMET) on Qualcomm SoCs. You will have expert knowledge of design, improvement, and maintenance of large AI software stacks using best practices. Work Experience: 1. 8-12 years of relevant work experience in software development 2. Live and breathe quality software development with excellent analytical and debugging skills. Strong understanding of Deep Learning and Machine learning theory and practice. 3. Experience with Deep learning model development. Data transformations, model training, model design, model optimization. 4. Familiarity with various deep learning architectures and problem domains like Computer Vision, Speech recognition, NLP etc. 5. Strong development skills in Python and C++. Experience with at least one machine learning framework like TensorFlow, ONNX, Pytorch, etc. 6. Understanding of software development and debugging in embedded environments. 7. Excellent communication skills (verbal, presentation, written) 8. Ability to collaborate across a globally diverse team and multiple interests. Preferred Qualifications 1. Familiarity with neural network operators and model formats including PyTorch, ONNX, and Tensorflow. 2. Familiarity with neural network optimization techniques like graph optimization, quantization, pruning, knowledge distillation, network architecture search etc. 3. Strong understanding about embedded systems, system design fundamentals. 4. Well versed in version control tools like git 5. Experience with machine learning accelerators, optimizing algorithms for hardware acceleration cores, working with heterogeneous or parallel computing systems. Educational Requirements Bachelor's/Master’s/PhD in Computer Science, Computer Engineering, or Electrical Engineering Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers.

Posted 1 month ago

Apply

10.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Exp : 15yrs to 23yrs Primary skills :- Vision AI Solution, Nvidia, Computer Vision, Media, Open Stack. Key Responsibilities Define and lead the end-to-end technical architecture for vision-based AI systems across edge and cloud. Design and optimize large-scale video analytics pipelines using NVIDIA DeepStream, TensorRT, and Triton Inference Server. Architect distributed AI systems, including model training, deployment, inferencing, monitoring, and continuous learning. Collaborate with product, research, and engineering teams to translate business requirements into scalable AI solutions. Lead efforts in model optimization (quantization, pruning, distillation) for real-time performance on devices like Jetson Orin/Xavier. Drive the integration of multi-modal AI (vision + language, 3D, audio) where applicable. Guide platform choices (e.g., edge AI vs cloud AI trade-offs), ensuring cost-performance balance. Mentor senior engineers and promote best practices in MLOps, system reliability, and AI observability. Stay current with emerging technologies (e.g., NeRF, Diffusion Models, Vision Transformers, synthetic data). Contribute to internal innovation strategy, including IP generation, publications, and external presentations. ________________________________________ 🛠️ Required Technical Skills Deep expertise in computer vision, deep learning, and multi-modal AI. Proven hands-on experience with: NVIDIA Jetson, DeepStream SDK, TensorRT, Triton Inference Server TAO Toolkit, Isaac SDK, CUDA, cuDNN Strong in PyTorch, TensorFlow, OpenCV, GStreamer, and GPU-accelerated pipelines. Experience deploying vision AI models at large scale (e.g., 1000+ cameras/devices or multi-GPU clusters). Skilled in cloud-native ML infrastructure: Docker, Kubernetes, CI/CD, MLflow, Seldon, Airflow Proficiency in Python, C++, CUDA (or PyCUDA), and scripting. Familiar with 3D vision, synthetic data pipelines, and generative models (e.g., SAM, NeRF, Diffusion). Experience in multi modal (LVM/VLM), SLMs, small LVM/ VLM, Time series Gen AI models, Agentic AI, LLMOps/Edge LLMOps, Guardrails, Security in Gen AI, YOLO/Vision Transformers ________________________________________ 🤝 Soft Skills & Leadership 10+ years in AI/ML/Computer Vision, with 8+ years in technical leadership or architect roles Strong leadership skills with experience mentoring technical teams and driving innovation. Excellent communicator with the ability to engage stakeholders across engineering, product, and business. Strategic thinker with a practical mindset—able to balance innovation with production-readiness. Experience interfacing with enterprise customers, researchers, and hardware partners. ________________________________________ 🧩 Preferred Qualifications MS or PhD in Computer Vision, Machine Learning, Robotics, or a related technical field ( Added Advantage ) Experience with NVIDIA Omniverse, Clara, or MONAI for healthcare or simulation environments. Experience in domains like smart cities, robotics, retail analytics, or medical imaging. Contributions to open-source projects or technical publications. Certifications: NVIDIA Jetson Developer, AWS/GCP AI/ML Certifications.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Primary skill :- NVIDIA Solution Architect, GEN / AI Architect, Azure or AWS cloud. Relevant Exp :- NVIDIA ( 2 to 3 yrs ) Location :- Chennai / Noida. As an NVIDIA Generative AI Solution Architect at , you will lead the design, development, and deployment of AI solutions leveraging NVIDIA’s Edge AI, Computer Vision, Generative AI, and Metropolis technologies . You will collaborate with cross-functional teams and customers to architect scalable, high-performance AI systems integrating real-time computer vision, generative AI workflows, and industrial digital twins on edge, cloud, and metaverse platforms. Key Responsibilities Architect and deliver end-to-end AI solutions using NVIDIA’s AI Enterprise software, NeMo framework, Triton Inference Server, and GPU-accelerated platforms. Design and implement AI pipelines optimized for edge devices (NVIDIA Jetson, Clara), cloud infrastructure (AWS, Azure, GCP), and data centers (NVIDIA DGX). Develop and showcase proof-of-concept solutions using large language models (LLMs), retrieval-augmented generation (RAG), and advanced computer vision models for object detection, segmentation, and video analytics. Utilize NVIDIA Metropolis platform capabilities to architect AI-powered video analytics and smart city solutions, leveraging edge-to-cloud pipelines for real-time insights and automation. Optimize AI inference workloads using CUDA, TensorRT, mixed precision, and model quantization to meet stringent latency and throughput SLAs. Collaborate with company engineering, product, and client teams to embed NVIDIA AI technologies into enterprise workflows and industrial applications. Provide technical leadership, training, and mentorship on NVIDIA SDKs, AI best practices, and solution deployment strategies. Stay abreast of NVIDIA’s product roadmap, AI research trends, and industrial AI innovations to drive continuous solution improvement. Support customer engagements including technical workshops, solution demonstrations, and architectural reviews. Ensure adherence to data privacy, security, and ethical AI standards throughout the solution lifecycle. Required Qualifications Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related technical field. 5+ years of experience architecting and deploying AI/ML solutions with strong expertise in NVIDIA AI platforms (NeMo, Triton, CUDA, TensorRT). Proven experience with generative AI technologies including large language models, prompt engineering, and RAG workflows. Strong background in computer vision applications, including object detection, segmentation, and video analytics frameworks. Hands-on experience deploying AI solutions on edge devices (NVIDIA Jetson, Clara), cloud platforms (Azure, AWS, GCP), and data center GPU infrastructure. Familiarity with NVIDIA Metropolis platform for AI-powered video analytics and smart infrastructure solutions. Proficiency in Python, C++, and deep learning frameworks such as PyTorch or TensorFlow. Experience with container orchestration (Kubernetes, Docker) and MLOps practices including CI/CD pipelines for AI workloads. Excellent communication skills for engaging technical teams and business stakeholders. Willingness to travel up to 15% for client and NVIDIA events. Preferred Skills Experience optimizing AI inference with TensorRT, mixed precision, and model quantization. Knowledge of AI ethics, bias mitigation, and responsible AI principles. Prior experience in industrial, manufacturing, smart cities, or healthcare domains. Certifications related to NVIDIA AI technologies or cloud platforms (AWS, Azure, GCP). Experience working in global, cross-cultural teams.

Posted 1 month ago

Apply

0.0 - 7.0 years

0 Lacs

Hyderabad, Telangana

On-site

It's fun to work in a company where people truly BELIEVE in what they're doing! We're committed to bringing passion and customer focus to the business. Job Description Machine Learning Engineer – RAG & Fine-Tuning This role requires working from our local Hyderabad office 2-3x a week. Location: Hyderabad, Telangana, India ABOUT ABC FITNESS ABC Fitness (ABC) is the global market leader in providing technology solutions to the fitness industry. Built on a 40+ year reputation of excellence, ABC helps fitness providers of all sizes and backgrounds to turn their visions into seamless reality. Founded in 1981, ABC serves 40 million+ members globally, processing over $11B+ in payments annually for 31,000 clubs across 92+ countries. Our integrated suite includes best-of-breed platforms: Evo, Glofox, Ignite, and Trainerize. As a Thoma Bravo portfolio company, ABC is backed by the leading private equity firm focused on enterprise software. Learn more at abcfitness.com . ABOUT THE TEAM The AI Platform Engineering team at ABC builds scalable, high-performance AI systems that power next-generation fitness technology. We specialize in retrieval-augmented generation (RAG) architectures and fine-tuning methodologies to deliver context-aware, cost-efficient AI solutions. As our Machine Learning Engineer, you will be responsible for all retrieval and intelligence behind the LLM, delivering performant, low-cost, high-context AI features. At ABC, we love entrepreneurs because we are entrepreneurs. We roll our sleeves up, we act fast, and we learn together. WHAT YOU’LL DO Handle embeddings and chunking strategies to optimize document and data retrieval for GenAI-powered features. Manage vector stores and retrieval workflows using leading vector databases (Pinecone, FAISS, Weaviate, Azure AI Search) to ensure efficient, scalable access to unstructured and structured data. Fine-tune small and large language models using frameworks such as HuggingFace and OpenAI APIs, tailoring models to domain-specific requirements and improving performance on targeted tasks. Optimize cost and reduce latency by implementing best practices for token management, model evaluation, and cloud resource utilization. Collaborate with engineering, product, and data teams to integrate RAG pipelines into production systems, ensuring reliability, scalability, and security. Stay up-to-date with the latest advancements in retrieval-augmented generation, vector search, and LLM fine-tuning, applying new techniques to improve system performance and user experience. WHAT YOU’LL NEED 4–7 years of experience in machine learning or AI engineering, with a proven track record in RAG, vector search, and LLM fine-tuning. Deep expertise with vector databases such as Pinecone, FAISS, Weaviate, or Azure AI Search, including experience designing retrieval workflows and managing embeddings. Familiarity with HuggingFace and OpenAI fine-tuning APIs, and strong understanding of chunking strategies for optimizing retrieval. Proficiency in Python and experience with ML frameworks (PyTorch, TensorFlow) and cloud platforms (AWS, Azure). Understanding of token management, evaluation tuning, and cost optimization for large-scale AI deployments. Strong problem-solving skills, a collaborative mindset, and the ability to communicate complex technical concepts to both technical and non-technical stakeholders. AND IT’S GREAT TO HAVE Experience with NLP, NLU, and NLG techniques for conversational AI or information retrieval. Exposure to ML Ops tools for model monitoring, evaluation, and deployment (ML flow, Weights & Biases). Experience with model compression, quantization, or other efficiency techniques. Certifications in AWS Machine Learning Specialty or Microsoft AI Engineer. WHAT’S IN IT FOR YOU: Purpose led company with a Values focused culture – Best Life, One Team, Growth Mindset Time Off – competitive PTO plans with 15 Earned accrued leave, 12 days Sick leave, and 12 days Casual leave per year 11 Holidays plus 4 Days of Disconnect – once a quarter, we take a collective breather and enjoy a day off together around the globe. #oneteam Group Mediclaim insurance coverage of INR 500,000 for employee + spouse, 2 kids, and parents or parent-in-laws, and including EAP counseling Life Insurance and Personal Accident Insurance Best Life Perk – we are committed to meeting you wherever you are in your fitness journey with a quarterly reimbursement Premium Calm App – enjoy tranquility with a Calm App subscription for you and up to 4 dependents over the age of 16 Support for working women with financial aid towards crèche facility, ensuring a safe and nurturing environment for their little ones while they focus on their careers. We’re committed to diversity and passion, and encourage you to apply, even if you don’t demonstrate all the listed skillsets! ABC’S COMMITMENT TO DIVERSITY, EQUALITY, BELONGING AND INCLUSION: ABC is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We are intentional about creating an environment where employees, our clients and other stakeholders feel valued and inspired to reach their full potential and make authentic connections. We foster a workplace culture that embraces each person’s diversity, including the extent to which they are similar or different. ABC leaders believe that an equitable and inclusive culture is not only the right thing to do, it is a business imperative. Read more about our commitment to diversity, equality, belonging and inclusion at abcfitness.com ABOUT ABC: ABC Fitness (abcfitness.com) is the premier provider of software and related services for the fitness industry and has built a reputation for excellence in support for clubs and their members. ABC is the trusted provider to boost performance and create a total fitness experience for over 41 million members of clubs of all sizes whether a multi-location chain, franchise or an independent gym. Founded in 1981, ABC helps over 31,000 gyms and health clubs globally perform better and more profitably offering a comprehensive SaaS club management solution that enables club operators to achieve optimal performance. ABC Fitness is a Thoma Bravo portfolio company, a private equity firm focused on investing in software and technology companies (thomabravo.com). #LI-HYBRID If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies