הטכניון - מכון טכנולוגי לישראל Technion - Israel Institute of Technology Технион - израильский технологический институт ألتخنيون - معهد تكنولوجي لإسرائيل

02360509 - נושאים מתקדמים במבנה מחשבים 02360509 - Advanced topics in hardware accelerators for deep learning 02360509 - Advanced topics in hardware accelerators for deep learning 02360509 - Advanced topics in hardware accelerators for deep learning

אביב 2023-2024Spring 2023-2024Весна 2023-2024ربيع 2023-2024

מידע כללי General Information Общая Информация معلومات عامة

When: Tuesday, lecture 16:30 – 18:30, Tutorial 18:30 – 19:30
Where: Taub 4 + zoom (*)
Credit: 3 academic points
Format: Frontal presentations + students’ presentations
Grade: Based on assignments and final project (see below)
Prerequisites: Deep learning course 236781 or a similar one, Computer Architecture course 236267 or a similar one

Attendance: Students are required to attend, however zoom attendance and recordings will be provided in special cases.

Syllabus:

This graduate-level course aims to allow students to experience the relationship between
deep-learning algorithms and the hardware on which they are deployed. The course
discusses theoretical, practical, and research aspects, and aims to encourage students to
pursue future research and projects in these directions.

Description:
The course will focus on the relationship between deep-learning algorithms and the hardware on
which they are deployed. We will study how different types of accelerators impact performance
efficiency and power. Students will gain hands-on experience with tools and algorithms on current
hardware solutions. Some classes may be given by industrial experts.
In the scope of this course, we will discuss various approaches to acceleration of deep-learning
algorithms and their implementations. Among other topics, we will discuss:

CPU-based accelerators
GPU-based accelerators
FPGA and ASIC accelerators
Support for low power
CUDA kernels
Dataflow and memory management
Binarization and quantization
parallelization on multiple accelerators

Course assignments and Grade distribution:

Practical Homework assignments (40% total): submitting 2-3 code-based homework
assignments, in the scopes of CUDA kernels, quantization of DNNs and FPGA accelerator.
Final project (60% total): Each student will freely choose a topic for the final project based on
a paper relevant to the course, and correspondingly fulfill the following assignments under the
guidance of the course staff:

Presenting the original paper and the project proposal (15%) - During class
Final Project presentation (15%) - At a date TBD, after the end of the semester
Submission of project’s report and code implementation (30%) – Will be submitted
along with the final project presentations
Submission of project’s poster (optional)