idle intelligence
SmolLM2-360M-Instruct (Q4_K_M, ~271 MB) running client-side via WebAssembly using wllama
github.com/idle-intelligence/llm-web
Everything runs on this device.