My Blog

December 10, 2024

How to make Matrix Multiplication fast on CPU (Part 2 with SIMD)

December 9, 2024

How to make Matrix Multiplication fast on CPU (Part 1)

November 20, 2024

How I setup a fresh machine for development

November 15, 2024

How to build your own PyTorch Compiler

October 1, 2024

The rabbit hole of C++ templates