techx.talk
back to the vault

Let's build GPT: from scratch, in code, spelled out

Andrej Karpathy·2023·1 hr 57 min
watch on youtube

Karpathy builds and trains a decoder-only Transformer from first principles, following Attention Is All You Need, ending at the core of nanoGPT.

ai-mllanguages
more like this

Search the vault

Jump to a talk or filter by speaker.