Training
OPT: Open Pre-trained Transformer Language Models
Meta·May 2, 2022
Susan Zhang, Stephen Roller, Naman Goyal
View on arXivTL;DR
An open replication of GPT-3 scale, released with weights and a candid training logbook.
Why it matters
An early, transparent push to open up frontier-scale model research.