On-site
Part Time
Design, implement, and maintain operators and kernels within the Blaize SDK and perlib.
Optimize operator performance and improve execution efficiency across workloads.
Enable and support new features and performance improvements for next-generation Blaize chips.
Collaborate with cross-functional teams including hardware, compiler, and ML engineers.
Analyze performance bottlenecks and implement optimizations at the graph and kernel levels.
Education & Experience
Bachelor’s or Master's degree in computer science or a related field.
5–8 years of hands-on software engineering experience, preferably in performance-critical systems.
Strong proficiency in C and C++, including extensive use of STL libraries.
Solid experience with ONNX and/or PyTorch operators, including graph and node-level optimization.
Experience writing parallel kernels for GPUs or similar accelerator architectures.
Understanding of performance optimization techniques for compute-intensive workloads.
Basic knowledge of machine learning networks and large language models (LLMs) is a plus.
C, C++
Data Structures and Algorithms
STL Libraries
ONNX / PyTorch Operators
Graph and Node Optimization
BLAIZE
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
bengaluru
10.0 - 20.0 Lacs P.A.
bengaluru
12.0 - 22.0 Lacs P.A.
chennai
5.0 - 9.0 Lacs P.A.
gurugram
6.0 - 10.0 Lacs P.A.
7.0 - 12.0 Lacs P.A.
gurugram
5.0 - 9.0 Lacs P.A.
8.0 - 13.0 Lacs P.A.
bengaluru
7.0 - 12.0 Lacs P.A.
hyderabad
6.96 - 7.92 Lacs P.A.
6.0 - 10.0 Lacs P.A.