Optimal Formats and the Cube Root of the PDF
Your boss emails you a point in 128-billion-dimensional space. “Llama 3.1 8B,” the message reads. “A not-so-large language model in bfloat16. But it’s too bi...
Your boss emails you a point in 128-billion-dimensional space. “Llama 3.1 8B,” the message reads. “A not-so-large language model in bfloat16. But it’s too bi...
Hurtling past the NeurIPS submission deadline into the summer months, we switch from huddling around server rooms to keep warm to babysitting experiments whi...
April has been a busy month for the AI research community, with ICLR (the first of the “big three” AI conferences of the year) taking place in Singapore. We’...
We’ve enjoyed March, bringing improving weather and many excellent ML papers to keep us busy. As usual, we’re here to share summaries of four of our favourit...
Welcome to Papers of the Month! This time around, our monthly selection of ML papers revolves around the central theme of scale – and learning how to scale e...