#260: Digital Archeology: The Primitive Power of GPT-1
Revisit the 2018 model that started it all. Herman and Corn dive into GPT-1's romance-novel roots and its 117-million-parameter legacy.
gpt-1absolute-positional-embeddingsunsupervised-pre-training