Cambricon’s MLU chips aren’t just processors — they’re China’s answer to scalable, efficient AI acceleration across cloud, edge, and data center environments.
This instructor-led training guides engineers and AI developers through the Cambricon stack: from deep learning model deployment to performance optimization on MLU hardware.
Courses are delivered either as online live training via interactive remote desktop, or onsite in Graz, where hands-on labs mirror the AI challenges Cambricon is built to solve.
Whether you're scaling up an AI lab or future-proofing a data center team, onsite sessions can take place at your facility in Graz or in a NobleProg training center designed for immersive technical learning.
Also referred to as Cambricon AI, MLU accelerator, or Machine Learning Unit, this training supports teams building AI infrastructure beyond the conventional GPU path.
NobleProg – Your Local Training Provider
NobleProg Graz
Waagner-Biro-Strasse 47, Graz, Austria, 8020
Overview
Our training facilities are located at Waagner-Biro-Strasse 47 in Graz. Our spacious training rooms are located directly in the old town and offer optimal training conditions for your needs.
Directions
The NobleProg training facilities are best reached via the A9 motorway and the federal highway 67.
Parking spaces
There are parking spaces in the streets around our training rooms as well as the ContiPark multi-storey car park.
Local infrastructure
There are numerous restaurants in the downtown area and hotels are also within walking distance.
Ascend, Biren, and Cambricon are leading AI hardware platforms in China, each offering unique acceleration and profiling tools for production-scale AI workloads.
This instructor-led, live training (online or onsite) is aimed at advanced-level AI infrastructure and performance engineers who wish to optimize model inference and training workflows across multiple Chinese AI chip platforms.
By the end of this training, participants will be able to:
Benchmark models on Ascend, Biren, and Cambricon platforms.
Identify system bottlenecks and memory/compute inefficiencies.
Apply graph-level, kernel-level, and operator-level optimizations.
Tune deployment pipelines to improve throughput and latency.
Format of the Course
Interactive lecture and discussion.
Hands-on use of profiling and optimization tools on each platform.
Guided exercises focused on practical tuning scenarios.
Course Customization Options
To request a customized training for this course based on your performance environment or model type, please contact us to arrange.
Chinese GPU architectures such as Huawei Ascend, Biren, and Cambricon MLUs offer CUDA alternatives tailored for local AI and HPC markets.
This instructor-led, live training (online or onsite) is aimed at advanced-level GPU programmers and infrastructure specialists who wish to migrate and optimize existing CUDA applications for deployment on Chinese hardware platforms.
By the end of this training, participants will be able to:
Evaluate compatibility of existing CUDA workloads with Chinese chip alternatives.
Port CUDA codebases to Huawei CANN, Biren SDK, and Cambricon BANGPy environments.
Compare performance and identify optimization points across platforms.
Address practical challenges in cross-architecture support and deployment.
Format of the Course
Interactive lecture and discussion.
Hands-on code translation and performance comparison labs.
Guided exercises focused on multi-GPU adaptation strategies.
Course Customization Options
To request a customized training for this course based on your platform or CUDA project, please contact us to arrange.
Cambricon MLUs (Machine Learning Units) are specialized AI chips optimized for inference and training in edge and datacenter scenarios.
This instructor-led, live training (online or onsite) is aimed at intermediate-level developers who wish to build and deploy AI models using the BANGPy framework and Neuware SDK on Cambricon MLU hardware.
By the end of this training, participants will be able to:
Set up and configure the BANGPy and Neuware development environments.
Develop and optimize Python- and C++-based models for Cambricon MLUs.
Deploy models to edge and data center devices running Neuware runtime.
Integrate ML workflows with MLU-specific acceleration features.
Format of the Course
Interactive lecture and discussion.
Hands-on use of BANGPy and Neuware for development and deployment.
Guided exercises focused on optimization, integration, and testing.
Course Customization Options
To request a customized training for this course based on your Cambricon device model or use case, please contact us to arrange.
Online Cambricon AI training in Graz, Cambricon (MLU) training courses in Graz, Weekend Machine Learning Unit courses in Graz, Evening Cambricon AI training in Graz, MLU accelerator instructor-led in Graz, Cambricon AI one on one training in Graz, Weekend MLU accelerator training in Graz, Machine Learning Unit private courses in Graz, Machine Learning Unit boot camp in Graz, Cambricon (MLU) trainer in Graz, MLU accelerator classes in Graz, Machine Learning Unit instructor in Graz, Machine Learning Unit coaching in Graz, Evening Machine Learning Unit courses in Graz, Cambricon AI on-site in Graz, Online MLU accelerator training in Graz, MLU accelerator instructor-led in Graz