<aside> ❗ Headings and text with “▶” can be toggled to expand contained info. You can also see the Table of Contents as a collapsible sidebar on the right →

</aside>

Table of Contents

Introduction

In this module, students will be introduced to the steps of designing ML accelerators with GPUs. It will provide an opportunity for understanding the relationship between ML accelerators and GPUs, uncovering how GPUs play a vital role in accelerating machine learning tasks.

Objectives

Be able to understand the basic kernels of ML accelerators for GPUs.

Readings

Required

Optional

Notes / Things that stand out

GPU and ML

Why are GPUs Good for ML?

ML workloads have a high number of floating point operations.
They are also high data parallel operations.
GPUs provide dense floating point operations.
GPUs have high memory bandwidth (e.g. GDDR, HBM), and can make better use of them.
GPUs can easily employ flexible data format standards (e.g. TF, BF, INT4 etc).
ML workloads are statistical computing based computations, which works well with GPU’s underlying execution models.

DNN Operation Categories

Element-wise operations.
- e.g. Activation operations.