Home About Blogs Projects Moisture Meter
Teaching Roles Workshops Talks
Contact
Complete

Matmul Deepdive on ESP32-S3

Does the $3 AI chip actually do AI?

A ground-up benchmarking study of matrix multiplication on the ESP32-S3. Four implementations built from scratch — naive, compiler-optimized, cache-blocked, and PIE SIMD — measured with cycle-accurate timing. The short answer: yes, but only up to a point, and the reason why is more interesting than the headline number.

ESP32-S3 C SIMD Cache Benchmarking

More coming

Next experiment in progress

Something is being built. Check back soon.