MatAdd Autotiler

This example computes a Matrix Addition with different data types (float32, Fixed point 16 bits, float16). The code is generated by the autotiler to maximize the performance.

The Matrixes are initially stored in L2 and the code generated tiles and double buffers the data movement between L2 and L1. To check the results a sequential, non-optimized Matrix Add is computed by the FC core in the main code.

How to run

To run the code with Mat1 [64x64] and Mat2 [64x64] –> Out [64x64]:

mkdir build && cd build && cmake ../
make run -j

To change the size of the matrixes, you can modify the following lines in the CMakeLists.txt file:

list(APPEND MATADD_OPT_FLAG -DMAT_W=128 -DMAT_H=128)