AI
Hub
Search…
⌘K
Leaderboard
Models
Benchmarks
Research
Subscribe
Research Papers
The research that matters, distilled — search, filter by topic, sort, and group.
Topic
Organization
Newest
Group
107 results
Table
Cards
Title
Source
Topics
Date
gpt-oss-120b & gpt-oss-20b Model Card
OpenAI
Architecture
Training
Safety
Aug 11, 2025
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Z.ai (Zhipu AI)
Architecture
Training
Reinforcement Learning
Aug 8, 2025
Kimi K2: Open Agentic Intelligence
Moonshot AI
Architecture
Training
Reinforcement Learning
Jul 28, 2025
Group Sequence Policy Optimization
Alibaba (Qwen Team)
Reinforcement Learning
Training
Jul 24, 2025
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
KAIST AI / Google DeepMind / Mila
Architecture
Jul 14, 2025
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Google DeepMind
Architecture
Training
Evaluation
Jul 8, 2025
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
MiniMax AI
Architecture
Reinforcement Learning
Evaluation
Jun 16, 2025
Qwen3 Technical Report
Alibaba (Qwen Team)
Architecture
Training
Reinforcement Learning
May 14, 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek
Reinforcement Learning
Training
Jan 22, 2025
DeepSeek-V3 Technical Report
DeepSeek
Architecture
Training
Dec 27, 2024
The Llama 3 Herd of Models
Meta
Architecture
Training
Jul 31, 2024
Qwen2 Technical Report
Alibaba
Training
Jul 15, 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Model
DeepSeek
Architecture
Training
May 7, 2024
Phi-3 Technical Report
Microsoft
Training
Apr 22, 2024
Gemma: Open Models Based on Gemini Research and Technology
Google
Training
Mar 13, 2024
Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context
Google
Architecture
Evaluation
Mar 8, 2024
Mixtral of Experts
Mistral AI
Architecture
Jan 8, 2024
Gemini: A Family of Highly Capable Multimodal Models
Google
Architecture
Evaluation
Dec 19, 2023
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Carnegie Mellon
Architecture
Dec 1, 2023
Mistral 7B
Mistral AI
Architecture
Oct 10, 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Meta
Training
Safety
Jul 18, 2023
Textbooks Are All You Need
Microsoft
Training
Jun 20, 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Stanford
Reinforcement Learning
Training
May 29, 2023
QLoRA: Efficient Finetuning of Quantized LLMs
University of Washington
Training
May 23, 2023
1
2
3
…
5