Jobs
Interviews

12 Vision Transformers Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 7.0 years

0 Lacs

karnataka

On-site

As a Machine Learning Engineer at Apple, you will have the opportunity to impact the future of Manufacturing through cutting-edge ML techniques. You will be working on groundbreaking applications of machine learning, research, and implementation that have the potential to change lives for the better. Your work will directly contribute to the optimization of processes, automation of quality control, and improvement of operational efficiency within Apple's supply chain for globally recognized products. In this role, you will collaborate with various engineering and operations teams to develop ML solutions for vision and language-based tasks. Your responsibilities will include crafting, designing, and implementing machine learning strategies for Apple's iPhone, Mac, and iPad supply chains. You will play a key role in fine-tuning Large Language Models (LLMs) and applying them in combination with Computer Vision (CV) techniques to address challenges in manufacturing environments. You will work alongside a vibrant team of machine learning engineers and data scientists to conceive, code, and deploy machine learning models at scale using industry-leading tools. Your role will involve developing scalable Computer Vision and Machine Learning algorithms, rapid prototyping for real-world manufacturing problems, and leveraging LLMs to automate document analysis and process optimization. Key Responsibilities: - Develop and deploy scalable Computer Vision and Machine Learning algorithms on local and cloud-based inferencing platforms - Design algorithms for real-world manufacturing problems in Intelligent Visual Inspection - Automate document analysis, knowledge extraction, and process optimization using Large Language Models - Collaborate with cross-functional teams to integrate ML applications and improve operational efficiency Minimum Qualifications: - Experience with deep learning models such as CNNs, Vision Transformers, or YOLO for image-based tasks in production systems - Proven research and practical experience in developing algorithms for image processing, object detection, segmentation, and tracking - Masters in computer science, Machine Learning, or related field with 3+ years of industry experience - Experience deploying ML models in cloud environments for scalable production use Preferred Qualifications: - Strong grasp of deep learning principles in Computer Vision and Natural Language Processing - Familiarity with LLM architectures like BERT, GPT, and experience fine-tuning these models - Knowledge of machine learning and Deep Learning libraries such as PyTorch, OpenCV, Hugging Face - Experience with version control systems like Git and handling large datasets If you are passionate about influencing the quality, speed, and efficiency of ML algorithms and want to contribute to creating refined products, join us at Apple as a key contributor in developing machine learning solutions for diverse tasks and projects. Your expertise will help drive innovation and excellence in a fast-paced environment.,

Posted 5 days ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

As a Data Scientist specializing in Deep Learning, Natural Language Processing (NLP), Generative AI, and Computer Vision, you will play a crucial role in our team based in Hyderabad, India. Your primary responsibility will be to create, develop, and implement advanced AI models that address real-world challenges and foster innovation across our product offerings. The ideal candidate for this role should hold a Bachelor's or Master's degree in Computer Science, Data Science, AI, or a related discipline (a Ph.D. is preferred). You should have a minimum of 3 years of practical experience in roles involving applied machine learning or data science. Your expertise should encompass deep learning frameworks like TensorFlow, PyTorch, or Keras, as well as hands-on familiarity with NLP libraries such as Hugging Face Transformers, spaCy, and NLTK. Additionally, you should possess proven skills in working with computer vision tools like OpenCV, YOLO, CNNs, and Vision Transformers. Exposure to Generative AI models such as GPT, DALLE, or Stable Diffusion will be beneficial. Proficiency in Python programming language along with knowledge of essential libraries like NumPy, Pandas, and Scikit-learn is required. Experience with cloud platforms like AWS, GCP, Azure, and MLOps tools will be an advantage in this role. The ability to solve complex problems efficiently and function effectively in a dynamic work environment are essential attributes we are seeking in potential candidates. If you are enthusiastic about leveraging your skills and experience to drive AI innovation and contribute to solving real-world problems, we encourage you to share your resume with us at giribabu@pranathiss.com.,

Posted 1 week ago

Apply

10.0 - 14.0 years

0 Lacs

haryana

On-site

As a Lead Data Scientist/AI Product Owner at a reputed IT MNC in Gurgaon, your primary responsibility will be AI Model Development & Deployment, focusing on Computer Vision & Deep Learning. You will be leading the design and implementation of computer vision models for various tasks such as object detection, tracking, segmentation, and action recognition. Your expertise will include architectures like YOLO (v4, v5, v8), Vision Transformers, Mask R-CNN, Faster R-CNN, LSTMs, and Spatio-Temporal Models for image and video analysis. Additionally, you will contextualize the model for challenging situations such as poor detection through specific training on different scenarios. Another key aspect of your role will involve Reinforcement Learning & Model Explainability, where you will develop and integrate reinforcement learning models to optimize decision-making in dynamic AI environments. You will work with techniques such as Deep Q-Networks (DQN), Proximal Policy Optimization (PPO), A3C, SAC, and other RL methods. Furthermore, you will lead the development of a Scalable Data Model for AI Model Training Pipelines, overseeing the end-to-end data preparation process, including data gathering, annotation, and quality review before training. You will focus on enhancing the quality and volume of training data to continually improve model performance and have the ability to think through the data model for future enhancements. In terms of technical skills, you are expected to have a proven track record of leading and delivering AI products, hands-on expertise with advanced AI models, experience in managing and mentoring a small team of data scientists, proficiency in AI/ML frameworks like TensorFlow, PyTorch, and Keras, effective collaboration with cross-functional teams, and strong problem-solving skills in image and video processing. Nice-to-have skills include Experimentation, Iterative Improvement and Testing, Collaboration with Engineering and Product Teams, and Team Leadership. To qualify for this role, you should have at least 10 years of hands-on experience in AI with a solid background in object detection, image and video processing, and computer vision.,

Posted 1 month ago

Apply

1.0 - 3.0 years

8 - 12 Lacs

Bengaluru

Work from Office

computer vision or deep learning roles industrial/safety inspection datasets (e.g., PPE detection, visual defect classification). Familiarity with MLOps tools like MLflow, DVC, or ClearML. ONNX, TensorRT, OpenVINO

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a Senior AI Engineer at Avathon, you will be part of a cutting-edge team revolutionizing industrial AI by developing groundbreaking solutions that shape the future. Your role will involve designing, training, and deploying computer vision models using frameworks like TensorFlow, PyTorch, or ONNX to harness the full potential of operational data. You will utilize your expertise in model optimization techniques such as quantization, pruning, distillation, and structured sparsity to enhance performance on edge devices and low-power hardware. Hands-on experience with state-of-the-art architectures like YOLO, Faster R-CNN, and Vision Transformers will be essential for optimizing models for deployment in industrial environments. Your strong understanding of image preprocessing, feature extraction, traditional computer vision techniques, and end-to-end model pipelines will enable you to create real-time virtual replicas of physical assets for predictive maintenance, performance simulation, and operational optimization. Proficiency in Python and C++ for developing AI solutions, along with experience in parallel processing and hardware-aware optimizations, will be key in driving AI-driven projects that have a meaningful impact across industries. Furthermore, your expertise in profiling and optimizing model inference speed, memory usage, and throughput for resource-constrained environments, as well as practical experience in deploying AI models on embedded systems and low-power hardware, will be crucial for anomaly detection, performance forecasting, and asset lifetime extension in industrial settings. Familiarity with MLOps practices, version control with Git, and collaborative workflows will ensure efficient management of AI workflows and seamless collaboration within cross-functional teams. Join Avathon in Bengaluru and thrive in a high-growth environment where agility, collaboration, and rapid professional growth are the norm. Make a difference by working on AI-driven projects that drive real change across industries and improve lives. If you are a forward-thinking AI Engineer with a passion for innovation and a drive to create scalable solutions in industrial AI, we invite you to be a part of our team and contribute to the revolutionizing of industrial AI.,

Posted 1 month ago

Apply

3.0 - 6.0 years

3 - 8 Lacs

Bengaluru

Remote

Immediate opening for "Computer Vision Engineer" Location: Remote Experience Required: 35years Role Snapshot Quantaleap is looking for a handson Computer Vision Engineer who can design, train, and deploy deeplearning models that power realtime image and video analytics. You’ll collaborate with product, data, and backend teams—so clear, proactive communication is essential. Key Responsibilities Research & prototype Computer Vision algorithms (anomaly detection, object detection, tracking, segmentation). Create end to end image processing and computer vision pipelines. Train, finetune, and optimise CNN/Transformer models in PyTorch or TensorFlow. Package and deploy models to cloud or edge environments with Docker/Kubernetes. Automate data pipelines for image/video collection, augmentation, and annotation. Monitor model performance in production; drive continuous improvement. Document findings and present results to technical and nontechnical stakeholders. MustHave Skills 3–5yrs building computervision solutions in production. Strong grasp of Computer Vision and Deep Learning concepts (CNNs, Vision Transformers like YOLO). Proficiency with PyTorch/TensorFlow , OpenCV , and Python . Experience deploying models via REST/GRPC APIs or ondevice. Excellent written and verbal communication ; able to explain complex ideas simply. NicetoHave Knowledge of CUDA , TensorRT, or other inferencetime optimisers. Experience with AWS/GCP ML services, streaming data (Kafka, Kinesis). Experience with Agentic AI implementation Familiarity with MLOps best practices (CI/CD, model versioning).

Posted 1 month ago

Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

You will be joining a pioneering AI team where you will be responsible for designing and deploying cutting-edge deep learning solutions for computer vision and audio analysis. Your main tasks will include designing, developing, and optimizing deep learning models for image/video analysis (object detection, segmentation) and audio classification tasks. You will work with CNN architectures, Vision Transformers (ViT, Swin), and attention mechanisms (SE, CBAM, self/cross-attention) to address complex real-world challenges. In this role, you will process multi-modal data including video and audio. For video, you will apply spatiotemporal modeling (3D CNNs, temporal attention), while for audio, you will extract features (spectrograms, MFCCs) and build classification pipelines. You will also utilize pretrained models through transfer learning and multi-task learning frameworks, and optimize models for accuracy, speed, and robustness using PyTorch/TensorFlow. Collaboration with MLOps teams to deploy solutions into production is a key aspect of this role. Your required skills include advanced programming in Python (PyTorch/TensorFlow), expertise in computer vision concepts such as Vision Transformers, object detection techniques (YOLO, SSD, Faster R-CNN, DETR), and video analysis methods including temporal modeling. Additionally, you should have experience in audio processing, attention mechanisms, transfer learning, and training strategies. Experience in handling large-scale datasets and building data pipelines is also essential. Preferred qualifications for this role include exposure to multi-modal learning, familiarity with R for statistical analysis, and a background in publications or projects related to computer vision or machine learning conferences such as CVPR, NeurIPS, ICML. Please note that this position is for a client of Hubnex Labs, and selected candidates will work directly with the client's AI team while representing Hubnex.,

Posted 1 month ago

Apply

2.0 - 6.0 years

0 Lacs

karnataka

On-site

We are seeking a highly motivated Deep Learning Engineer with specialized expertise in computer vision and audio analysis. As a part of our team developing AI-driven solutions utilizing multi-modal deep learning, you will play a crucial role in designing and implementing deep learning models for various tasks such as image, video, object detection, and audio classification. Your responsibilities will include integrating attention mechanisms into model architectures, utilizing pretrained models for transfer learning, and working with video data using spatiotemporal modeling techniques. Additionally, you will be responsible for extracting and processing features from audio and evaluating and optimizing models for speed, accuracy, and robustness. Collaboration across teams to deploy models into production will also be a key aspect of this role. The ideal candidate should have strong programming skills in Python, proficiency in PyTorch or TensorFlow, and hands-on experience with CNNs, pretrained networks, and attention modules. A solid understanding of Vision Transformers, recent architectures, and attention mechanisms is essential. Experience in implementing and training object detection models, video analysis, temporal modeling, and audio classification workflows is highly desired. Moreover, familiarity with handling large-scale datasets, designing data pipelines, and training strategies for deep models will be beneficial for this position. If you are passionate about deep learning, possess the required skills and experience, and are eager to contribute to cutting-edge AI solutions, we encourage you to apply for this position. Join us at CureBay and be a part of our dynamic team dedicated to pushing the boundaries of technology and innovation.,

Posted 1 month ago

Apply

0.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos , our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to , our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation , our teams implement data, technology, and AI to create tomorrow, today. Get to know us at and on , , , and . Inviting applications for the role of Senior Principal Consultant - Data Scientists with Computer vision experience! We are seeking a tenured and highly skilled Data Scientist with deep expertise in Computer Vision and a strong foundation in AI/ML modeling. The ideal candidate will not only lead the development of intelligent vision systems but will also serve as a technical mentor, providing guidance to junior data scientists on model selection, optimization, and deployment strategies. Experience in domains such as energy, power generation, industrial equipment, or manufacturing will be considered a strong advantage, as the role involves solving real-world visual AI/ML problems in industrial environments. Key Responsibilities: Lead CV Projects: Design and deliver Computer Vision models across a range of use cases (e.g., anomaly detection, visual inspections, OCR, predictive maintenance). Model Development: Develop, evaluate, and optimize state-of-the-art AI/ML models (e.g., CNNs, Vision Transformers, YOLO, Faster R-CNN, etc.). Mentorship: Guide junior and mid-level data scientists on best practices in feature engineering, model selection, evaluation metrics, and problem-solving strategies. Domain Translation: Translate complex industrial problems into AI-driven CV solutions that can scale in production environments. Collaboration: Work closely with software engineers, MLOps , and business teams to ensure model integration and operational success. Code Quality & Experimentation: Drive code modularity, reproducibility, and experimentation through use of ML pipelines, version control, and testing. Innovation & Research: Stay current with latest CV and AI/ML advancements and apply them appropriately to business problems. Stakeholder Communication: Present insights, models, and outcomes in a clear and impactful way to both technical and non-technical stakeholders. Qualifications we seek in you! Minimum Qualifications Master&rsquos or PhD in Computer Science, Machine Learning, AI, Electrical Engineering, or a related field. industry experience in building and deploying machine learning models, with a strong portfolio in Computer Vision. Deep expertise in ML frameworks and CV libraries such as PyTorch , TensorFlow, OpenCV, Detectron2, MMDetection , etc. Solid understanding of core AI/ML algorithms - classification, regression, segmentation, object detection, time-series, clustering, etc. Experience with MLOps tools (e.g., MLflow , DVC, Kubeflow) and cloud platforms (AWS/GCP/Azure). Strong communication , leadership, and team collaboration skills . Preferred Qualifications: Prior experience in domains such as energy, utilities, power generation, or industrial systems is highly preferred. Experience deploying CV models within real-time environments. Contributions to open-source CV projects or published research. Proficient in statistical modelling, machine learning techniques, AI algorithms, and generative model development using large language models such as GPT-3, BERT, or similar frameworks like RAG, Knowledge Graphs etc. Lead the development of CI/CD pipelines and standardize deployment frameworks. Strong Python programming skills. Why join Genpact Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation Make an impact - Drive change for global enterprises and solve business challenges that matter Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color , religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.

Posted 1 month ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

You will be joining our innovative EdTech team as an experienced AI/ML Content Developer. Your primary responsibility will be to create cutting-edge educational content in the field of artificial intelligence and machine learning. You will play a key role in developing comprehensive courses that cover a wide range of AI/ML domains, starting from foundational machine learning concepts to advanced generative AI and agentic AI systems. This position presents an exciting opportunity for you to influence how the next generation of AI practitioners learn and apply these transformative technologies. In this role, your tasks will include designing and developing comprehensive courses across AI/ML domains such as machine learning, natural language processing, computer vision, generative AI, and agentic AI systems. You will create diverse content formats, including hands-on labs, interactive exercises, assessments, and project-based learning modules. Additionally, you will structure learning paths that logically progress from foundational concepts to advanced AI applications. You will also develop practical coding exercises and real-world implementation scenarios using industry-standard tools and frameworks. Ensuring the quality and industry relevance of the content will be a crucial aspect of your role. You will need to ensure that all content reflects current AI/ML best practices and stays aligned with rapidly evolving industry trends. Creating practical, real-world scenarios that simulate actual AI development workflows and challenges will be part of your responsibilities. You will also collaborate with our instructional design team to optimize learning outcomes for complex technical concepts. Furthermore, you will work closely with our platform team to ensure the seamless delivery of AI/ML content and interactive experiences. Continuous iteration and improvement of existing content based on learner feedback and performance metrics will be essential. Staying updated with the latest AI research, tools, and educational methodologies, as well as integrating emerging AI technologies and frameworks into the curriculum as they become industry-relevant, will also be part of your role. For this position, we are looking for candidates with 3+ years of hands-on industry experience in machine learning, AI development, or related technical roles. Essential qualifications include domain expertise in machine learning algorithms, deep learning architectures, and AI system design. Proficiency in Python and core data science libraries, experience with machine learning frameworks like PyTorch and TensorFlow, and expertise in natural language processing techniques are also required. Additionally, strong communication skills, the ability to explain complex AI concepts clearly, and experience in creating technical AI/ML content are essential for this role. Preferred qualifications include an advanced degree (Master's/PhD) in Computer Science, Machine Learning, Data Science, or a related field, published research in AI/ML conferences or journals, experience with reinforcement learning and multi-agent systems, familiarity with AI safety, ethics, and responsible AI development practices, and knowledge of emerging AI architectures and research trends. Background in technical training, workshops, or mentoring in AI/ML domains and an understanding of accessibility standards in educational content are also desirable.,

Posted 1 month ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

You are an experienced AI/ML Content Developer who will be joining an innovative EdTech team. Your primary responsibility will involve creating cutting-edge educational content in artificial intelligence and machine learning. This includes developing comprehensive courses covering various AI/ML domains such as machine learning, natural language processing, computer vision, generative AI, and agentic AI systems. Your role will be pivotal in shaping how the next generation of AI practitioners learn and apply these transformative technologies. Your key responsibilities will include designing and developing comprehensive courses, content creation in various formats like hands-on labs, interactive exercises, assessments, and project-based learning modules. You will also be structuring learning paths logically from foundational concepts to advanced AI applications. Practical coding exercises and real-world implementation scenarios using industry-standard tools and frameworks will be a significant part of your role. Ensuring that all content reflects current AI/ML best practices and industry trends will be crucial. You will be creating practical, real-world scenarios that replicate AI development workflows and challenges. Moreover, developing hands-on projects involving model training, fine-tuning, and deployment to provide learners with portfolio-worthy experience will be essential. Collaboration with the instructional design team to optimize learning outcomes for complex technical concepts is also expected. You will work closely with the platform team to ensure the seamless delivery of AI/ML content and interactive experiences. Continuous iteration and improvement of existing content based on learner feedback and performance metrics will be part of your responsibilities. Staying current with the latest AI research, tools, and educational methodologies is necessary. Additionally, integrating emerging AI technologies and frameworks into the curriculum as they become industry-relevant is also expected. To qualify for this role, you should have at least 3+ years of hands-on industry experience in machine learning, AI development, or related technical roles. Domain expertise in machine learning algorithms, deep learning architectures, and AI system design is required. Proficiency in Python and core data science libraries, experience with machine learning frameworks like PyTorch and/or TensorFlow, and expertise in natural language processing techniques are necessary technical skills. Strong communication skills, the ability to explain complex AI concepts clearly, and experience in creating compelling narratives around AI applications and use cases are also essential. Preferred qualifications include an advanced degree (Master's/PhD) in Computer Science, Machine Learning, Data Science, or related fields, published research in AI/ML conferences or journals, experience with reinforcement learning and multi-agent systems, familiarity with AI safety, ethics, and responsible AI development practices, and knowledge of emerging AI architectures and research trends. Background in technical training, workshops, or mentoring in AI/ML domains and understanding of accessibility standards in educational content are also desirable.,

Posted 1 month ago

Apply

0.0 years

5 Lacs

Bengaluru, Karnataka, India

On-site

Job Description: Computer Vision modeler who will develop real time computer vision models like Python Pytorch tensorflow CV ML model development Object recognition Detection NVIDIA technologies MLOps Key Responsibilities: Should have the hands on experience to train computer vision and deep learning models to solve computer vision use cases Should be able to create and optimize algorithms for image and video analysis for tasks such as object detection image recognition segmentation and video analytics Should have expertise on different object tracking algorithms and track multi objects in multi cameras Should be able to articulate camera calibration and region of interest extraction Should have experience working on a large scale implementation with 100 stream in real time Fine tune deep learning models like CNNs vision transformers using frameworks such as Tensorflow PyTorch onnx etc Should have used NVIDIA frameworks preferably like NGC models TAO toolkit metropolis Deep stream triton server and triton server to create inferencing pipeline and its deployment Should have knowledge of optimizing inferencing pipelines to work on edge Should be able to perform necessary pre and post processing to optimize the computer vision models and evaluate it Should have experience working on action recognition with temporal analysis Should have knowledge on pruning the model for deploying on edge ARM devices Collaborate with Business and IT teams Preferred Skills: Technology->Artificial Intelligence->Computer Vision,Technology->Machine Learning->Python

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies