Recent Posts

Optimal Formats and the Cube Root of the PDF

9 minute read

Your boss emails you a point in 128-billion-dimensional space. “Llama 3.1 8B,” the message reads. “A not-so-large language model in bfloat16. But it’s too bi...