Embedded platforms, TinyML prototypes, and hardware experiments. Each one started with a question I couldn't answer without building something first.
Does the $3 AI chip actually do AI?
A ground-up benchmarking study of matrix multiplication on the ESP32-S3. Four implementations built from scratch — naive, compiler-optimized, cache-blocked, and PIE SIMD — measured with cycle-accurate timing. The short answer: yes, but only up to a point, and the reason why is more interesting than the headline number.
Next experiment in progress
Something is being built. Check back soon.