Recent Posts

Llama 3.2 Vision — A Deep Dive

13 minute read

Vision-Language Models (VLMs) allow LLMs to “see”, but how do they work? In this post, we’ll walk through the model changes needed to turn an LLM into a VLM ...

November Papers: An LLM Feast

18 minute read

This month we’ve got an all-LLM menu of papers for you, with summaries of four great works exploring many different aspects of crafting systems for LLM train...

Graphcore Research is hiring!

2 minute read

We are pleased to have announce we have open positions for Research Scientists and Engineers to join our team.

September Papers: Proper Conditioning

15 minute read

We’re pleased to share four papers from different domains: LLM self-correction, FP8 training, generative crystals and optimisation. They are united, somewhat...