LLMOpen weights

RoBERTa

Name: RoBERTa
Author: Meta

Specifications

Type: LLM
Access: Open weights
Released: July 26, 2019
Parameters: 355M
Context window: 512 tokens
Input: Text
Output: Text

Our take

Showed BERT was undertrained: with more data and tuning, the same architecture got substantially better. A lesson in the value of training recipe over novelty.

Related research

RoBERTa: A Robustly Optimized BERT Pretraining ApproachModel paper

Compare RoBERTa with

vs Sonar Reasoning Pro vs R1 1776 vs Qwen3.7 Max vs Gemini 3.5 Flash

See all RoBERTa alternatives →