Posted:6 days ago|
Platform:
Work from Office
Full Time
Job Area: Engineering Group, Engineering Group > Software Engineering General Summary: JD for Cloud Machine Learning LLM Serving engineer Job Overview: The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must. Key Responsibilities Improve and optimize key Deep Learning models on Qualcomm AI 100. Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories. Implement Kernels for AI workloads Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning. Build software tools and ecosystem around AI SW Stack. Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator. Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems. Optimize the entire deep learning pipeline including graph compiler integration. Apply knowledge of software engineering best practices. Desirable Skills and Aptitudes Deep Learning experience or knowledge- LLMs, Natural Language Processing, Vision, Audio, Recommendation systems. Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks. Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design. Ability to work independently, define requirements and scope, and lead your own development effort. Well versed with open-source development practices. Strong developer with a research mindset- strives to innovate. Avid problem solver- should be able to find solutions to key engineering and domain problems. Knowledge of tiling and scheduling a Machine learning operator is a plus. Experience in using C++ 14 (advanced features) Experience of profiling software and optimization techniques Hands on experience writing SIMD and/or multi-threaded high-performance code is a plus. Experience of ML compiler, Auto-code generation (using MLIR) is a plus. Experiences to run workloads on large scale heterogeneous clusters is a plus. Hands-on experience with CUDA, CUDNN is a plus. Qualifications : Bachelor's / Masters/ PHD degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field. 2+ years Software Engineering or related work experience. 2+ years experience with Programming Language such as C++, Python. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field.
Qualcomm
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
My Connections Qualcomm
13.0 - 18.0 Lacs P.A.
Bengaluru / Bangalore, Karnataka, India
Experience: Not specified
2.0 - 5.0 Lacs P.A.
13.0 - 18.0 Lacs P.A.
4.0 - 8.0 Lacs P.A.
Bengaluru
22.5 - 27.5 Lacs P.A.
Gurugram
7.0 - 11.0 Lacs P.A.
Gurugram
5.0 - 9.0 Lacs P.A.
6.0 - 10.0 Lacs P.A.
8.0 - 9.0 Lacs P.A.
Pune, Chennai, Bengaluru
10.0 - 14.0 Lacs P.A.