Optimize GEMM

My gemm optimization on RPi (ARM) achieved a 170x performance boost, showing speeds faster than Eigen and close to OpenBLAS.

- READ MORE -

Visualize web log with GoAccess

GoAccess is an open-source web log analyzer and interactive viewer that runs in a terminal or through your browser. It provides fast and valuable HTTP statistics for system administrators that require a visual server report on the fly.

- READ MORE -

Handcrafted CNN from scratch - convolutional layer

This repo is inspired by my undergrad homework. The training speed was improved by 60x after refactoring the code. This article aims to illustrate how I used dot product instead of loops to perform forward/backward computations in convolutional layers.

- READ MORE -