Optimize GEMM

My gemm optimization on RPi (ARM) achieved a 170x performance boost, showing speeds faster than Eigen and close to OpenBLAS.

- READ MORE -

Visualize web log with GoAccess

GoAccess is an open-source web log analyzer and interactive viewer that runs in a terminal or through your browser. It provides fast and valuable HTTP statistics for system administrators that require a visual server report on the fly.

- READ MORE -

Handcrafted CNN from scratch - convolutional layer

This project was inspired by an assignment in my university course. With numerous optimizations, the runtime performance has been enhanced by nearly an order of magnitude, cutting down the training time from several hours to just a few minutes. This article aims to illustrate how I used tensor dot product instead of for or while loops to perform forward and backward computations in convolutional layers.

- READ MORE -

Implement hash table in C with uthash library

This article briefly introduces how to create a hash table using uthash, as well as perform operations such as adding, deleting, looking up an item, printing the table, and clearing the table.

- READ MORE -