Graphcore Research Blog

Graphcore Research

Our mission is to advance AI research and characterise the computational requirements of machine intelligence.

April Papers: TriForce, QuaRot & Mixture-of-Depths

13 minute read

For our April selection of AI research papers, there is a clear common thread: efficient LLM inference. But as it happens, ML researchers are showing there a...

A transformer walk-through, with Gemma

36 minute read

Transformer-based LLMs seem mysterious, but they don’t need to. In this post, we’ll walk through a modern transformer LLM, Google’s Gemma, providing bare-bon...

March Papers: Low-Rank Galore & 1.58-Bit Weights

17 minute read

March was a fruitful month for AI research, with plenty of papers for us to choose from. A trend in the work we’ve selected is the pushing of previously publ...

February Papers: Longer RoPEs & Better Quantisation

17 minute read

Improving LLM inference is a key research topic at the moment, and something we’re particularly interested in at Graphcore because of its hardware implicatio...

January Papers: Great Teachers & Beyond Chinchilla

15 minute read

For the research community, 2023 was dominated by large transformers and the associated challenges with training, tuning and deploying them. This trend has c...