AI Hub

Timeline

A chronological history of AI — model releases and the research behind them, newest first. Filter by type, organization, or year.

767 entries

2026126 models · 20 papers

  1. MiniCPM5-1B
    LLMOpenBMB26.9
  2. Tokenisation via Convex Relaxations
    Paper
  3. Integrable Elasticity via Neural Demand Potentials
    Paper
  4. Vector Policy Optimization: Training for Diversity Improves Test-Time Search
    Paper
  5. Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration
    Paper
  6. The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning
    Paper
  7. Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models
    Paper
  8. MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems
    Paper
  9. Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention
    Paper
  10. LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems
    Paper
  11. Evaluating Commercial AI Chatbots as News Intermediaries
    Paper
  12. DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback
    Paper
  13. FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection
    Paper
  14. SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis
    Paper
  15. MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data
    Paper
  16. CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation
    Paper
  17. Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job Arrivals
    Paper
  18. Reducing Political Manipulation with Consistency Training
    Paper
  19. Understanding Data Temporality Impact on Large Language Models Pre-training
    Paper
  20. Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation
    Paper
  21. Advancing Mathematics Research with AI-Driven Formal Proof Search
    Paper
  22. Qwen3.7 Max
    LLMAlibaba92.3
  23. Grok Build 0.1
    MultimodalxAI
  24. Gemini 3.5 Flash
    MultimodalGoogle92.2
  25. JT-35B-Flash
    LLMChina Mobile82.9
  26. Perceptron Mk1
    MultimodalPerceptron
  27. MiniCPM-V 4.6 1.3B
    LLMOpenBMB30.5
  28. Ring-2.6-1T
    LLMInclusionAI85.7
  29. Gemini 3.1 Flash Lite
    MultimodalGoogle82.2
  30. Grok 4.3
    LLMxAI90.1
  31. GPT-5.5 Instant
    LLMOpenAI
  32. GPT Chat
    MultimodalOpenAI
  33. Mistral Medium 3.5
    MultimodalMistral AI74.8
  34. Granite 4.1 8B
    LLMIBM43.3
  35. Granite 4.1 30B
    LLMIBM48.1
  36. Granite 4.1 3B
    LLMIBM31.4
  37. Nemotron 3 Nano Omni 30B A3B Reasoning
    LLMNVIDIA46.9
  38. OpenAI GPT
    MultimodalOpenAI
  39. Anthropic Claude Sonnet
    MultimodalAnthropic
  40. Google Gemini Flash
    MultimodalGoogle
  41. MoonshotAI Kimi
    MultimodalMoonshot AI
  42. Google Gemini Pro
    MultimodalGoogle
  43. OpenAI GPT Mini
    MultimodalOpenAI
  44. Anthropic Claude Haiku
    MultimodalAnthropic
  45. Qwen3.6 27B
    MultimodalAlibaba84.2
  46. Qwen3.6 Max
    LLMAlibaba88.8
  47. Qwen3.6 35B A3B
    MultimodalAlibaba84.1
  48. Qwen3.6 Flash
    MultimodalAlibaba
  49. GPT-5.5 Pro
    MultimodalOpenAI
  50. DeepSeek-V4-Flash
    LLMDeepSeek89.4
  51. DeepSeek-V4-Pro
    LLMDeepSeek88.2
  52. Ling-2.6-1T
    LLMInclusionAI75.2
  53. GPT-5.5
    LLMOpenAI76.1
  54. MiMo-V2.5
    MultimodalXiaomi84.9
  55. MiMo-V2.5-Pro
    LLMXiaomi86.6
  56. Hy3
    LLMTencent86.7
  57. Claude Opus
    MultimodalAnthropic
  58. Ling-2.6-flash
    LLMInclusionAI59.3
  59. GPT-5.4 Image 2
    MultimodalOpenAI
  60. Qianfan-OCR-Fast
    MultimodalBaidu
  61. Kimi K2.6
    LLMMoonshot AI74.9
  62. Claude Opus 4.7
    LLMAnthropic90.9
  63. JT-MINI
    LLMChina Mobile67.6
  64. EXAONE 4.5 33B
    LLMLG AI Research79.4
  65. Muse Spark
    MultimodalMeta88.4
  66. Grok 4.20 0309 v2
    LLMxAI91.1
  67. GLM 5.1
    LLMZhipu AI86.8
  68. Gemma 4 E4B
    LLMGoogle57.6
  69. Gemma 4 26B A4B
    MultimodalGoogle79.2
  70. Step 3.5 Flash 2603
    LLMStepFun82.6
  71. Gemma 4 E2B
    LLMGoogle43.3
  72. Qwen3.6 Plus
    MultimodalAlibaba88.2
  73. Gemma 4 31B
    MultimodalGoogle85.7
  74. Trinity Large Thinking
    LLMArcee AI75.2
  75. GLM 5V Turbo
    MultimodalZhipu AI80.9
  76. Grok 4.20 Multi-Agent
    MultimodalxAI
  77. Qwen3.5 Omni Flash
    LLMAlibaba74.2
  78. Qwen3.5 Omni Plus
    LLMAlibaba82.6
  79. Lyria 3 Clip
    MultimodalGoogle
  80. Lyria 3 Pro
    MultimodalGoogle
  81. MiMo-V2-Omni-0327
    LLMXiaomi85.5
  82. KAT-Coder-Pro V2
    LLMKuaishou85.5
  83. Reka Edge
    MultimodalReka AI
  84. Nemotron Cascade 2 30B A3B
    LLMNVIDIA75.8
  85. MiMo-V2-Pro
    LLMXiaomi87
  86. MiMo-V2-Omni
    MultimodalXiaomi82.8
  87. MiniMax M2.7
    LLMMiniMax87.4
  88. GPT-5.4 nano
    LLMOpenAI81.7
  89. GPT-5.4 mini
    LLMOpenAI87.5
  90. NVIDIA Nemotron 3 Nano 4B
    LLMNVIDIA51.3
  91. Mistral Small 4
    MultimodalMistral AI76.9
  92. GLM 5 Turbo
    LLMZhipu AI84.7
  93. NVIDIA Nemotron 3 Super 120B A12B
    LLMNVIDIA80
  94. Nemotron 3 Super
    LLMNVIDIA
  95. Grok 4.20 0309
    LLMxAI88.5
  96. Seed-2.0-Lite
    MultimodalByteDance
  97. Qwen3.5-9B
    MultimodalAlibaba80.6
  98. Sarvam 30B
    LLMSarvam63.3
  99. Sarvam 105B
    LLMSarvam73.8
  100. GPT-5.4 Pro
    MultimodalOpenAI
  101. GPT-5.4
    LLMOpenAI74.9
  102. Mercury 2
    LLMInception77
  103. GPT-5.3 Chat
    MultimodalOpenAI
  104. Qwen3.5 2B
    LLMAlibaba45.6
  105. Qwen3.5 4B
    LLMAlibaba77.1
  106. Qwen3.5 0.8B
    LLMAlibaba23.6
  107. Seed-2.0-Mini
    MultimodalByteDance
  108. Nano Banana 2
    MultimodalGoogle
  109. Gemini 3.1 Pro Custom Tools
    MultimodalGoogle
  110. LFM2-24B-A2B
    LLMLiquid AI47.4
  111. Qwen3.5-Flash
    MultimodalAlibaba
  112. Qwen3.5-122B-A10B
    MultimodalAlibaba85.7
  113. Qwen3.5-27B
    MultimodalAlibaba85.8
  114. Qwen3.5-35B-A3B
    MultimodalAlibaba84.5
  115. GPT-5.3-Codex
    MultimodalOpenAI91.5
  116. Aion-2.0
    LLMAion Labs
  117. Gemini 3.1 Pro
    MultimodalGoogle83.2
  118. Tiny Aya Global
    LLMCohere30.5
  119. Grok 4.20
    LLMxAI
  120. Claude Sonnet 4.6
    LLMAnthropic76.3
  121. Qwen3.5 397B A17B
    MultimodalAlibaba89.3
  122. Qwen3.5 Plus
    MultimodalAlibaba
  123. Qwen3.5
    MultimodalAlibaba
  124. MiniMax M2.5
    LLMMiniMax84.8
  125. Nanbeige4.1-3B
    LLMNanbeige84.9
  126. GLM-5
    LLMZhipu AI81.9
  127. Tri-21B-Think
    LLMTrillion Labs60.1
  128. Qwen3 Max Thinking
    LLMAlibaba76.1
  129. Claude Opus 4.6
    LLMAnthropic79.4
  130. Qwen3 Coder Next
    LLMAlibaba73.7
  131. Step 3.5 Flash
    LLMStepFun83.1
  132. LongCat Flash Lite
    LLMLongCat63.6
  133. Solar Pro 3
    LLMUpstage72.4
  134. Kimi K2.5
    MultimodalMoonshot AI87.9
  135. MiniMax M2-her
    LLMMiniMax
  136. Palmyra X5
    LLMWriter
  137. Step3 VL 10B
    LLMStepFun69
  138. LFM2.5-1.2B-Thinking
    LLMLiquid AI33.9
  139. GPT Audio Mini
    MultimodalOpenAI
  140. GPT Audio
    MultimodalOpenAI
  141. GLM 4.7 Flash
    LLMZhipu AI58.1
  142. GPT-5.2-Codex
    MultimodalOpenAI89.9
  143. Olmo 3.1 32B Instruct
    LLMAllen Institute for AI53.9
  144. LFM2.5-VL-1.6B
    LLMLiquid AI28.9
  145. LFM2.5-1.2B-Instruct
    LLMLiquid AI32.6
  146. Falcon-H1R-7B
    LLMTII UAE72.8

2025362 models · 13 papers

  1. K-EXAONE
    LLMLG AI Research82.3
  2. HyperCLOVA X SEED Think
    LLMNaver65.5
  3. Seed 1.6
    MultimodalByteDance
  4. Seed 1.6 Flash
    MultimodalByteDance
  5. MiniMax M2.1
    LLMMiniMax83.6
  6. GLM 4.7
    LLMZhipu AI89
  7. Solar Open 100B
    LLMUpstage65.7
  8. Gemini 3 Flash
    MultimodalGoogle90.2
  9. K2 Think V2
    LLMMBZUAI Institute of Foundation Models71.3
  10. NVIDIA Nemotron 3 Nano 30B A3B
    LLMNVIDIA80.1
  11. Nemotron 3 Nano
    LLMNVIDIA
  12. MiMo-V2-Flash
    LLMXiaomi88
  13. Nemotron 3 Nano 30B A3B
    LLMNVIDIA
  14. Olmo 3.1 32B Think
    LLMAllen Institute for AI70.6
  15. Mi:dm K 2.5 Pro
    LLMKorea Telecom74.4
  16. Molmo2-8B
    LLMAllen Institute for AI42.5
  17. GPT-5.2
    LLMOpenAI86.2
  18. GPT-5.2 Pro
    MultimodalOpenAI
  19. GPT-5.2 Chat
    MultimodalOpenAI
  20. Devstral Small 2
    LLMMistral AI47.5
  21. Devstral 2
    LLMMistral AI54.3
  22. DeepSeek V3.1 Nex N1
    LLMNex Agi
  23. Relace Search
    LLMRelace
  24. GLM 4.6V
    MultimodalZhipu AI69.6
  25. Rnj 1 Instruct
    LLMEssential AI
  26. K2-V2
    LLMMBZUAI Institute of Foundation Models73.6
  27. Motif-2-12.7B-Reasoning
    LLMMotif Technologies73.6
  28. GPT-5.1-Codex-Max
    MultimodalOpenAI
  29. Ministral 3 3B
    MultimodalMistral AI33.7
  30. Ministral 3 8B
    MultimodalMistral AI43.3
  31. Ministral 3 14B
    MultimodalMistral AI47.9
  32. Nova 2 Lite
    MultimodalAmazon82.1
  33. Mistral Large 3
    LLMMistral AI58.3
  34. DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
    PaperDeepSeek AI
  35. Trinity Mini
    LLMArcee AI
  36. DeepSeek V3.2 Speciale
    LLMDeepSeek89.9
  37. Seedream 4.5
    ImageByteDance
  38. Seedance 1.5 Pro
    VideoByteDance
  39. Kling O1
    VideoKuaishou
  40. Runway Gen-4.5
    VideoRunway
  41. DeepSeek-V3.2
    LLMDeepSeek87.1
  42. Nova 2.0 Pro
    LLMAmazon80.9
  43. INTELLECT-3
    LLMPrime Intellect81
  44. Nova 2.0 Omni
    LLMAmazon78.2
  45. Apriel-v1.6-15B-Thinker
    LLMServiceNow80.3
  46. FLUX.2
    ImageBlack Forest Labs
  47. Claude Opus 4.5
    LLMAnthropic88
  48. Natural Emergent Misalignment from Reward Hacking in Production RL
    PaperAnthropic
  49. Olmo 3 32B Think
    LLMAllen Institute for AI69.5
  50. HunyuanVideo 1.5
    VideoTencent
  51. Olmo 3 7B Think
    LLMAllen Institute for AI62.4
  52. Olmo 3 7B Instruct
    LLMAllen Institute for AI40
  53. Nano Banana Pro
    MultimodalGoogle
  54. Gemini 3 Pro Image
    ImageGoogle
  55. OLMo 3
    LLMAllen Institute for AI
  56. Grok 4.1 Fast
    LLMxAI85.6
  57. Cogito v2.1
    LLMDeep Cogito75.8
  58. Gemini 3 Deep Think
    MultimodalGoogle69.5
  59. Gemini 3 Pro
    MultimodalGoogle82.8
  60. Grok 4.1
    LLMxAI
  61. ERNIE 5.0 Thinking
    LLMBaidu81.7
  62. Cogito v2.1 671B
    LLMDeep Cogito
  63. GPT-5.1-Codex-Mini
    MultimodalOpenAI84.7
  64. GPT-5.1-Codex
    MultimodalOpenAI88.2
  65. GPT-5.1 Chat
    MultimodalOpenAI
  66. GPT-5.1
    LLMOpenAI89
  67. Olympiad-level Formal Mathematical Reasoning with Reinforcement Learning (AlphaProof)
    PaperGoogle DeepMind
  68. Doubao Seed Code
    LLMByteDance79.4
  69. KAT-Coder-Pro V1
    LLMKuaishou81.8
  70. Kimi K2 Thinking
    LLMMoonshot AI85.6
  71. Nova Premier 1.0
    MultimodalAmazon
  72. Kimi Linear 48B A3B Instruct
    LLMMoonshot AI43.5
  73. Sonar Pro Search
    MultimodalPerplexity
  74. Voxtral Small 24B
    MultimodalMistral AI
  75. gpt-oss-safeguard-20b
    LLMOpenAI
  76. Granite 4.0 H 1B
    LLMIBM18
  77. Granite 4.0 1B
    LLMIBM17.9
  78. Granite 4.0 350M
    LLMIBM10.2
  79. Granite 4.0 H 350M
    LLMIBM10.4
  80. NVIDIA Nemotron Nano 12B v2 VL
    LLMNVIDIA69.4
  81. MiniMax-M2
    LLMMiniMax76
  82. Qwen3 VL 32B Instruct
    MultimodalAlibaba66.5
  83. Qwen3 VL 32B
    LLMAlibaba78.4
  84. Granite 4.0 Micro
    LLMIBM25.6
  85. Phi 4 Mini Instruct
    LLMMicrosoft32.6
  86. GPT-5 Image Mini
    MultimodalOpenAI
  87. Veo 3.1
    VideoGoogle
  88. Claude Haiku 4.5
    LLMAnthropic75.3
  89. Qwen3 VL 8B
    LLMAlibaba49.7
  90. Qwen3 VL 4B Instruct
    LLMAlibaba41.6
  91. Qwen3 VL 4B
    LLMAlibaba44.3
  92. GPT-5 Image
    MultimodalOpenAI
  93. Qwen3 VL 8B Instruct
    MultimodalAlibaba43
  94. Qwen3 VL 8B Thinking
    MultimodalAlibaba
  95. Ring-1T
    LLMInclusionAI77.9
  96. Llama 3.3 Nemotron Super 49B V1.5
    LLMNVIDIA
  97. o4 Mini Deep Research
    MultimodalOpenAI
  98. o3 Deep Research
    MultimodalOpenAI
  99. ERNIE 4.5 21B A3B Thinking
    LLMBaidu
  100. Ling-1T
    LLMInclusionAI73.3
  101. Jamba Reasoning 3B
    LLMAI21 Labs30.7
  102. LFM2 8B A1B
    LLMLiquid AI31.3
  103. Nano Banana
    MultimodalGoogle
  104. Qwen3 VL 30B A3B Instruct
    MultimodalAlibaba66.5
  105. Qwen3 VL 30B A3B Thinking
    MultimodalAlibaba
  106. GPT-5 Pro
    LLMOpenAI88.4
  107. Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
    PaperStanford / SambaNova / UC Berkeley
  108. Qwen3 VL 30B A3B
    LLMAlibaba76.2
  109. IBM Granite 4.0
    LLMIBM
  110. Apriel-v1.5-15B-Thinker
    LLMServiceNow77.2
  111. Sora 2
    VideoOpenAI
  112. GLM-4.6
    LLMZhipu AI72.4
  113. DeepSeek V3.2 Exp
    LLMDeepSeek72.2
  114. Claude Sonnet 4.5
    LLMAnthropic80.4
  115. HunyuanImage 3.0
    ImageTencent
  116. Relace Apply 3
    LLMRelace
  117. Gemini 2.5 Flash
    LLMGoogle78.3
  118. Gemini 2.5 Flash Lite 09-
    MultimodalGoogle
  119. Qwen3 VL 235B A22B
    LLMAlibaba78.4
  120. LFM2 2.6B
    LLMLiquid AI19.2
  121. GPT-5 Codex
    MultimodalOpenAI87.1
  122. Qwen3 Coder Plus
    LLMAlibaba
  123. Qwen3 Max
    LLMAlibaba79.5
  124. Qwen3 VL 235B A22B Instruct
    MultimodalAlibaba70.9
  125. Qwen3 VL 235B A22B Thinking
    MultimodalAlibaba
  126. Kling 2.5 Turbo
    VideoKuaishou
  127. Qwen3 Omni 30B A3B Instruct
    LLMAlibaba57.3
  128. Qwen3 Omni 30B A3B
    LLMAlibaba73.4
  129. Granite 4.0 H Small
    LLMIBM35.7
  130. DeepSeek V3.1 Terminus
    LLMDeepSeek83.5
  131. Ring-flash-2.0
    LLMInclusionAI74.6
  132. Grok 4 Fast
    LLMxAI78.7
  133. Magistral Medium 1.2
    LLMMistral AI78.1
  134. Tongyi DeepResearch 30B A3B
    LLMAlibaba
  135. Luma Ray3
    VideoLuma AI
  136. Ling-flash-2.0
    LLMInclusionAI66.9
  137. Magistral Small 1.2
    LLMMistral AI73.9
  138. Qwen3 Coder Flash
    LLMAlibaba
  139. Qwen3 Next 80B A3B Instruct
    LLMAlibaba68.9
  140. Qwen3 Next 80B A3B Thinking
    LLMAlibaba77.5
  141. Stable Audio 2.5
    AudioStability AI
  142. Qwen3-Next-80B-A3B
    LLMAlibaba80.3
  143. Ling-mini-2.0
    LLMInclusionAI53.9
  144. Gemini 2.5 Flash-Lite
    LLMGoogle72.3
  145. Qwen Plus
    LLMAlibaba
  146. Kimi K2-Instruct-0905
    LLMMoonshot AI66
  147. Kimi K2 0905
    LLMMoonshot AI71
  148. Nemotron Nano 9B V2
    LLMNVIDIA77.6
  149. Seedream 4.0
    ImageByteDance
  150. Apertus 70B Instruct
    LLMSwiss AI Initiative27.2
  151. Apertus 8B Instruct
    LLMSwiss AI Initiative25.6
  152. Suno v5
    AudioSuno
  153. Grok Code Fast 1
    LLMxAI65.3
  154. Qwen3 30B A3B Thinking
    LLMAlibaba
  155. Hermes 4 - Llama-3.1 405B
    LLMNous Research73.5
  156. Hermes 4 - Llama-3.1 70B
    LLMNous Research71.3
  157. Hermes 4 405B
    LLMNous Research
  158. Hermes 4 70B
    LLMNous Research
  159. Gemini 2.5 Flash Image
    ImageGoogle
  160. Command A Reasoning
    LLMCohere
  161. DeepSeek-V3.1
    LLMDeepSeek59.8
  162. Seed-OSS-36B-Instruct
    LLMByteDance78.8
  163. NVIDIA Nemotron Nano 9B V2
    LLMNVIDIA68.3
  164. GPT-4o Audio
    MultimodalOpenAI
  165. Imagen 4
    ImageGoogle
  166. Gemma 3 270M
    LLMGoogle7.6
  167. Mistral Medium 3.1
    MultimodalMistral AI51.5
  168. ERNIE 4.5 VL 28B A3B
    MultimodalBaidu
  169. ERNIE 4.5 21B A3B
    LLMBaidu
  170. GLM 4.5V
    MultimodalZhipu AI70.1
  171. gpt-oss-120b & gpt-oss-20b Model Card
    PaperOpenAI
  172. Jamba Large 1.7
    LLMAI21 Labs36.5
  173. GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
    PaperZ.ai (Zhipu AI)
  174. GPT-5 Chat
    MultimodalOpenAI
  175. GPT-5 nano
    LLMOpenAI71.2
  176. GPT-5 mini
    LLMOpenAI79.2
  177. GPT-5
    LLMOpenAI80.5
  178. Qwen3 4B 2507
    LLMAlibaba71.9
  179. Qwen3 4B 2507 Instruct
    LLMAlibaba52.2
  180. Genie 3
    VideoGoogle
  181. gpt-oss-20b
    LLMOpenAI73.6
  182. gpt-oss-120b
    LLMOpenAI79.6
  183. Claude Opus 4.1
    LLMAnthropic75.4
  184. Codestral
    LLMMistral AI
  185. Qwen3 Coder 30B A3B Instruct
    LLMAlibaba55.4
  186. Qwen3 30B A3B 2507
    LLMAlibaba74.7
  187. Qwen3 30B A3B 2507 Instruct
    LLMAlibaba69.3
  188. Qwen3 30B A3B Instruct
    LLMAlibaba
  189. Wan 2.2
    VideoAlibaba
  190. GLM-4.5
    LLMZhipu AI73
  191. Kimi K2: Open Agentic Intelligence
    PaperMoonshot AI
  192. Qwen3 235B A22B 2507
    LLMAlibaba84.2
  193. Llama Nemotron Super 49B v1.5
    LLMNVIDIA79.4
  194. Qwen3-235B-A22B-Thinking-2507
    LLMAlibaba79.6
  195. Qwen3 235B A22B Thinking
    LLMAlibaba
  196. GLM 4.5 Air
    LLMZhipu AI70.4
  197. GLM 4 32B
    LLMZhipu AI
  198. Group Sequence Policy Optimization
    PaperAlibaba (Qwen Team)
  199. Qwen3 Coder 480B A35B
    LLMAlibaba
  200. Qwen3 Coder 480B A35B Instruct
    LLMAlibaba66.5
  201. UI-TARS 7B
    MultimodalByteDance
  202. Qwen3-235B-A22B-Instruct-2507
    LLMAlibaba72.2
  203. Gemini 2.5 Flash Lite
    MultimodalGoogle57
  204. Qwen3-Coder
    LLMAlibaba55.4
  205. Qwen3 235B A22B Instruct
    LLMAlibaba
  206. EXAONE 4.0 32B
    LLMLG AI Research79.8
  207. Exaone 4.0 1.2B
    LLMLG AI Research53.1
  208. Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
    PaperKAIST AI / Google DeepMind / Mila
  209. Switchpoint Router
    LLMSwitchpoint
  210. Kimi K2 Instruct
    LLMMoonshot AI66.5
  211. Kimi K2 Base
    LLMMoonshot AI50.2
  212. Kimi K2
    LLMMoonshot AI73.6
  213. LFM2 1.2B
    LLMLiquid AI13.5
  214. Devstral Small 1.1
    LLMMistral AI
  215. Devstral Medium
    LLMMistral AI47.8
  216. Grok-4 Heavy
    MultimodalxAI89.3
  217. Grok 4
    LLMxAI78.2
  218. Hunyuan A13B Instruct
    LLMTencent
  219. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
    PaperGoogle DeepMind
  220. Jamba 1.7 Mini
    LLMAI21 Labs22.5
  221. Morph V3 Fast
    LLMMorph
  222. Morph V3 Large
    LLMMorph
  223. ERNIE 4.5 300B A47B
    LLMBaidu68.2
  224. ERNIE 4.5 VL 424B A47B
    MultimodalBaidu
  225. ERNIE 4.5
    LLMBaidu
  226. FLUX.1 Kontext [dev]
    ImageBlack Forest Labs
  227. Hunyuan-A13B
    LLMTencent
  228. Gemma 3n E2B Instruct
    LLMGoogle27.5
  229. Gemma 3n E4B Instructed
    MultimodalGoogle24.8
  230. Gemma 3n E4B
    MultimodalGoogle56.9
  231. Gemma 3n E2B Instructed
    MultimodalGoogle21.3
  232. Gemma 3n E2B
    MultimodalGoogle49.1
  233. Mistral Small 3.2
    LLMMistral AI51
  234. Mistral Small 3.2 24B Instruct
    MultimodalMistral AI56.2
  235. Mistral Small 3.2 24B
    MultimodalMistral AI
  236. MiniMax M1 40k
    LLMMiniMax67.6
  237. MiniMax M1 80k
    LLMMiniMax75.5
  238. MiniMax-M1
    LLMMiniMax61.5
  239. MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
    PaperMiniMax AI
  240. Seedance 1.0
    VideoByteDance
  241. Magistral Small 1
    LLMMistral AI64.7
  242. Magistral Medium 1
    LLMMistral AI65.5
  243. Magistral Small 2506
    LLMMistral AI62.1
  244. Magistral Medium
    MultimodalMistral AI62.9
  245. o3 Pro
    MultimodalOpenAI84.5
  246. Magistral Small
    LLMMistral AI
  247. Gemini 2.5 Pro Preview 06-05
    MultimodalGoogle76.6
  248. ElevenLabs v3
    AudioElevenLabs
  249. DeepSeek R1 0528 Qwen3 8B
    LLMDeepSeek66.2
  250. DeepSeek-R1-0528
    LLMDeepSeek63.3
  251. Sarvam M
    LLMSarvam56.4
  252. Claude Sonnet 4
    LLMAnthropic74.5
  253. Claude Opus 4
    LLMAnthropic69.4
  254. Devstral Small
    LLMMistral AI45.3
  255. Gemma 3n E4B Instruct
    LLMGoogle34.7
  256. Llama 3.1 Nemotron Nano 4B v1.1
    LLMNVIDIA54.5
  257. Solar Pro 2
    LLMUpstage72.5
  258. MedGemma 4B IT
    MultimodalGoogle
  259. Gemma 3n E4B Instructed LiteRT Preview
    MultimodalGoogle30.3
  260. Gemma 3n E2B Instructed LiteRT (Preview)
    MultimodalGoogle25.4
  261. Gemini Diffusion
    LLMGoogle30.2
  262. Gemma 3n 4B
    LLMGoogle
  263. Lyria 2
    AudioGoogle
  264. Veo 3
    VideoGoogle
  265. Qwen3 Technical Report
    PaperAlibaba (Qwen Team)
  266. Mistral Medium 3
    MultimodalMistral AI58.6
  267. Coder Large
    LLMArcee AI
  268. Virtuoso Large
    LLMArcee AI
  269. Maestro Reasoning
    LLMArcee AI
  270. Spotlight
    MultimodalArcee AI
  271. IBM Granite 4.0 Tiny Preview
    LLMIBM48
  272. Nova Premier
    LLMAmazon53.1
  273. Phi 4 Reasoning Plus
    LLMMicrosoft70.4
  274. Phi 4 Reasoning
    LLMMicrosoft66.4
  275. Phi 4 Mini Reasoning
    LLMMicrosoft73.3
  276. Llama Guard 4 12B
    MultimodalMeta
  277. Qwen3 0.6B
    LLMAlibaba29.3
  278. Qwen3 4B
    LLMAlibaba56.5
  279. Qwen3 1.7B
    LLMAlibaba46.9
  280. Qwen3 235B A22B
    LLMAlibaba74.9
  281. Qwen3 32B
    LLMAlibaba73.8
  282. Qwen3 14B
    LLMAlibaba66.8
  283. Qwen3 8B
    LLMAlibaba57.8
  284. Qwen3 30B A3B
    LLMAlibaba71.7
  285. Qwen3
    LLMAlibaba75.8
  286. Gemini 2.5 Flash
    MultimodalGoogle73.1
  287. Granite 3.3 8B
    LLMIBM32.5
  288. Granite 3.3 8B Instruct
    MultimodalIBM68.5
  289. Granite 3.3 8B Base
    MultimodalIBM64.6
  290. o4 Mini High
    MultimodalOpenAI
  291. o4-mini
    MultimodalOpenAI78.4
  292. o3
    LLMOpenAI71.6
  293. GPT-4.1 Nano
    MultimodalOpenAI42.1
  294. GPT-4.1 Mini
    MultimodalOpenAI58.2
  295. GPT-4.1
    MultimodalOpenAI63.8
  296. Llama 3.1 Nemotron Ultra 253B v1
    LLMNVIDIA78.3
  297. Llama 4 Scout
    MultimodalMeta58.9
  298. Llama 4 Maverick
    MultimodalMeta63.9
  299. Runway Gen-4
    VideoRunway
  300. Qwen2.5-Omni-7B
    MultimodalAlibaba51.5
  301. DeepSeek-V3 0324
    LLMDeepSeek65.9
  302. Gemini 2.5 Pro
    MultimodalGoogle71.6
  303. o1-pro
    MultimodalOpenAI82.5
  304. Llama-3.3 Nemotron Super 49B v1
    LLMNVIDIA63.9
  305. Llama 3.1 Nemotron Nano 8B V1
    LLMNVIDIA68.2
  306. Mistral Small 3.1
    LLMMistral AI42.4
  307. Mistral Small 3.1 24B Instruct
    MultimodalMistral AI48
  308. Mistral Small 3.1 24B Base
    MultimodalMistral AI50.9
  309. Mistral Small 3.1 24B
    MultimodalMistral AI
  310. OLMo 2 32B
    LLMAllen Institute for AI23.5
  311. Gemma 3 1B Instruct
    LLMGoogle16.2
  312. DeepHermes 3 - Mistral 24B
    LLMNous Research43.8
  313. Command A
    LLMCohere55.9
  314. Gemma 3 12B
    MultimodalGoogle55.5
  315. Gemma 3 4B
    MultimodalGoogle45.8
  316. Gemma 3 12B Instruct
    LLMGoogle40
  317. Gemma 3 4B Instruct
    LLMGoogle31.7
  318. Gemma 3 27B Instruct
    LLMGoogle44.5
  319. Reka Flash 3
    LLMReka AI56.2
  320. Gemma 3 1B
    LLMGoogle21.2
  321. Gemma 3 27B
    MultimodalGoogle58.4
  322. GPT-4o Search
    LLMOpenAI
  323. GPT-4o-mini Search
    LLMOpenAI
  324. Sonar Deep Research
    LLMPerplexity
  325. Sonar Pro
    MultimodalPerplexity58.8
  326. Sonar Reasoning Pro
    MultimodalPerplexity95.7
  327. Jamba 1.6 Large
    LLMAI21 Labs42.6
  328. Jamba 1.6 Mini
    LLMAI21 Labs24.9
  329. QwQ-32B
    LLMAlibaba67.8
  330. Qwen2.5 VL 32B Instruct
    MultimodalAlibaba62.1
  331. GPT-4.5
    MultimodalOpenAI59.4
  332. Gemini 2.0 Flash Lite
    MultimodalGoogle54.4
  333. Claude 3.7 Sonnet
    LLMAnthropic74.7
  334. Grok 3 Reasoning
    LLMxAI
  335. Grok 3 mini Reasoning
    LLMxAI80.9
  336. R1 1776
    LLMPerplexity95.4
  337. Mistral Saba
    LLMMistral AI57.1
  338. Saba
    LLMMistral AI
  339. Grok-3 Mini
    MultimodalxAI85.9
  340. Grok-3
    MultimodalxAI82.6
  341. DeepHermes 3 - Llama-3.1 8B
    LLMNous Research23.5
  342. o3 Mini High
    LLMOpenAI
  343. Llama Guard 3 8B
    LLMMeta
  344. Gemini 2.0 Pro
    LLMGoogle67.4
  345. Aion-RP 1.0
    LLMAion Labs
  346. Aion-1.0-Mini
    LLMAion Labs
  347. Aion-1.0
    LLMAion Labs
  348. Phi-4-multimodal-instruct
    MultimodalMicrosoft46.2
  349. Phi 4 Mini
    LLMMicrosoft45.3
  350. Qwen2.5 VL 72B Instruct
    MultimodalAlibaba79.1
  351. o3-mini
    LLMOpenAI64.1
  352. Llama 3.1 Tulu3 405B
    LLMAllen Institute for AI57.5
  353. Mistral Small 3 24B Instruct
    LLMMistral AI62.1
  354. Mistral Small 3 24B Base
    MultimodalMistral AI44.4
  355. Mistral Small 3
    LLMMistral AI43.6
  356. R1 Distill Qwen 32B
    LLMDeepSeek
  357. Qwen2.5 Max
    LLMAlibaba63.6
  358. Sonar Reasoning
    LLMPerplexity77.2
  359. Sonar
    MultimodalPerplexity56.8
  360. Qwen2.5 VL 7B Instruct
    MultimodalAlibaba70
  361. R1 Distill Llama 70B
    LLMDeepSeek
  362. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
    PaperDeepSeek
  363. Gemini 2.0 Flash Thinking
    MultimodalGoogle69.1
  364. Kimi-k1.5
    MultimodalMoonshot AI82.2
  365. DeepSeek R1 Zero
    LLMDeepSeek71.5
  366. DeepSeek R1 Distill Qwen 7B
    LLMDeepSeek58.3
  367. DeepSeek R1 Distill Qwen 32B
    LLMDeepSeek68.4
  368. DeepSeek R1 Distill Qwen 14B
    LLMDeepSeek65.7
  369. DeepSeek R1 Distill Qwen 1.5B
    LLMDeepSeek32.6
  370. DeepSeek R1 Distill Llama 8B
    LLMDeepSeek53.3
  371. DeepSeek R1 Distill Llama 70B
    LLMDeepSeek70.1
  372. R1
    LLMDeepSeek
  373. DeepSeek-R1
    LLMDeepSeek75
  374. MiniMax-01
    MultimodalMiniMax
  375. Phi 4
    LLMMicrosoft47.6

2024112 models · 8 papers

  1. DeepSeek-V3 Technical Report
    PaperDeepSeek
  2. DeepSeek-V3
    LLMDeepSeek58.1
  3. QvQ-72B-Preview
    MultimodalAlibaba70.9
  4. GPT-4o Realtime
    LLMOpenAI
  5. GPT-4o mini Realtime
    LLMOpenAI
  6. Veo 2
    VideoGoogle
  7. Command R7B
    LLMCohere
  8. DeepSeek VL2 Tiny
    MultimodalDeepSeek67.2
  9. DeepSeek VL2 Small
    MultimodalDeepSeek73.1
  10. DeepSeek VL2
    MultimodalDeepSeek74.9
  11. Gemini 2.0 Flash
    MultimodalGoogle60.3
  12. Sora
    VideoOpenAI
  13. Llama 3.3 70B Instruct
    LLMMeta50.6
  14. Llama 3.3 70B
    LLMMeta
  15. Nova Pro 1.0
    MultimodalAmazon
  16. Nova Micro 1.0
    LLMAmazon
  17. Nova Lite 1.0
    MultimodalAmazon
  18. o1
    LLMOpenAI65.4
  19. QwQ-32B-Preview
    LLMAlibaba62.6
  20. OLMo 2 7B
    LLMAllen Institute for AI15.5
  21. Nova Pro
    MultimodalAmazon61.6
  22. Nova Micro
    LLMAmazon49
  23. Nova Lite
    MultimodalAmazon57.7
  24. Pixtral Large
    MultimodalMistral AI53.1
  25. Mistral Large
    LLMMistral AI39.3
  26. Qwen2.5 Turbo
    LLMAlibaba50.3
  27. Qwen2.5 Coder 32B Instruct
    LLMAlibaba50.1
  28. Claude 3.5 Haiku
    LLMAnthropic54.5
  29. Ministral 8B Instruct
    LLMMistral AI70.9
  30. Qwen2.5 7B Instruct
    LLMAlibaba46.6
  31. Inflection 3 Pi
    LLMInflection
  32. Inflection 3 Productivity
    LLMInflection
  33. Reka Flash
    LLMReka AI52.9
  34. Llama 3.1 Nemotron 70B Instruct
    LLMNVIDIA43.7
  35. LFM 40B
    LLMLiquid AI33.2
  36. Molmo 7B-D
    LLMAllen Institute for AI16.3
  37. Llama 3.2 90B Instruct
    MultimodalMeta54
  38. Llama 3.2 11B Instruct
    MultimodalMeta36.7
  39. Llama 3.2 3B Instruct
    LLMMeta30.8
  40. Llama 3.2 1B Instruct
    LLMMeta12.1
  41. Llama 3.2 11B Vision Instruct
    MultimodalMeta
  42. Qwen2.5-Coder 7B Instruct
    LLMAlibaba39.6
  43. Qwen2.5 32B Instruct
    LLMAlibaba66.7
  44. Qwen2.5 14B Instruct
    LLMAlibaba66.1
  45. Qwen2.5 72B Instruct
    LLMAlibaba59.1
  46. Qwen2.5 72B
    LLMAlibaba
  47. Pixtral-12B
    MultimodalMistral AI66.1
  48. o1-preview
    LLMOpenAI57.3
  49. o1-mini
    LLMOpenAI70.5
  50. Qwen2-VL-72B-Instruct
    MultimodalAlibaba67.3
  51. Phi-3.5-vision-instruct
    MultimodalMicrosoft61.7
  52. Phi-3.5-MoE-instruct
    LLMMicrosoft49.8
  53. Phi-3.5-mini-instruct
    LLMMicrosoft46
  54. Jamba 1.5 Mini
    LLMAI21 Labs29.6
  55. Jamba 1.5 Large
    LLMAI21 Labs42.8
  56. Hermes 3 70B Instruct
    LLMNous Research
  57. Hermes 3 405B Instruct
    LLMNous Research
  58. Hermes 3 - Llama-3.1 70B
    LLMNous Research42.5
  59. Grok
    LLMxAI53.8
  60. Grok-2 mini
    MultimodalxAI65.9
  61. Grok-2
    MultimodalxAI62.4
  62. FLUX.1
    ImageBlack Forest Labs
  63. The Llama 3 Herd of Models
    PaperMeta
  64. Mistral Large 2
    LLMMistral AI47.9
  65. Qwen2 7B Instruct
    LLMAlibaba37.4
  66. Qwen2 72B Instruct
    LLMAlibaba59.9
  67. Llama 3.1 405B Instruct
    LLMMeta60.9
  68. Llama 3.1 8B Instruct
    LLMMeta45
  69. Llama 3.1 70B Instruct
    LLMMeta56
  70. Llama 3.1 405B
    LLMMeta
  71. Mistral Nemo
    LLMMistral AI
  72. Mistral NeMo Instruct
    LLMMistral AI
  73. GPT-4o-mini
    MultimodalOpenAI49.1
  74. Qwen2 Technical Report
    PaperAlibaba
  75. Gemma 2 27B
    LLMGoogle
  76. Runway Gen-3 Alpha
    VideoRunway
  77. Gemma 2 9B
    LLMGoogle
  78. Gemma 2
    LLMGoogle
  79. Claude 3.5 Sonnet
    LLMAnthropic70.3
  80. DeepSeek Coder V2 Lite Instruct
    LLMDeepSeek30.2
  81. DeepSeek-Coder-V2
    LLMDeepSeek74.3
  82. Stable Diffusion 3
    ImageStability AI
  83. Qwen2 72B
    LLMAlibaba
  84. Codestral-22B
    LLMMistral AI
  85. Hermes 2 Pro - Llama-3 8B
    LLMNous Research
  86. GPT-4o
    MultimodalOpenAI56.4
  87. DeepSeek-V2.5
    LLMDeepSeek63.4
  88. DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Model
    PaperDeepSeek
  89. DeepSeek-V2-Chat
    LLMDeepSeek
  90. DeepSeek-V2
    LLMDeepSeek
  91. Gemini 1.5 Flash
    MultimodalGoogle61.9
  92. Qwen1.5 Chat 110B
    LLMAlibaba28.9
  93. Arctic Instruct
    LLMSnowflake
  94. Phi-3 Mini Instruct 3.8B
    LLMMicrosoft27.5
  95. Phi-3
    LLMMicrosoft
  96. Phi-3 Technical Report
    PaperMicrosoft
  97. Llama 3 8B Instruct
    LLMMeta32.4
  98. Llama 3 70B Instruct
    LLMMeta40.8
  99. Llama 3 70B
    LLMMeta
  100. Mixtral 8x22B Instruct
    LLMMistral AI39.1
  101. WizardLM-2 8x22B
    LLMMicrosoft
  102. Grok-1.5V
    MultimodalxAI71.3
  103. Mixtral 8x22B
    LLMMistral AI
  104. Command R+
    LLMCohere28.9
  105. Grok-1.5
    LLMxAI50.3
  106. DBRX Instruct
    LLMDatabricks27.5
  107. DBRX
    LLMDatabricks
  108. Gemini 1.5 Flash 8B
    MultimodalGoogle48.4
  109. Claude 3 Haiku
    MultimodalAnthropic38.9
  110. Gemma: Open Models Based on Gemini Research and Technology
    PaperGoogle
  111. Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context
    PaperGoogle
  112. Claude 3 Opus
    LLMAnthropic58.5
  113. Claude 3 Sonnet
    MultimodalAnthropic45.8
  114. Mistral Small
    LLMMistral AI40.3
  115. Gemma
    LLMGoogle
  116. Gemini 1.0 Pro
    LLMGoogle34
  117. Gemini 1.5 Pro
    MultimodalGoogle67.3
  118. Solar Mini
    LLMUpstage33.1
  119. text-embedding-3-large
    EmbeddingOpenAI
  120. Mixtral of Experts
    PaperMistral AI

202334 models · 12 papers

  1. Gemini: A Family of Highly Capable Multimodal Models
    PaperGoogle
  2. OpenChat 3.5
    LLMOpenChat24.1
  3. Phi-2
    LLMMicrosoft
  4. Mistral Medium
    LLMMistral AI33.6
  5. Mixtral 8x7B Instruct
    LLMMistral AI26.1
  6. Mixtral 8x7B
    LLMMistral AI
  7. Gemini 1.0 Ultra
    LLMGoogle
  8. Gemini 1.0
    MultimodalGoogle
  9. Mamba: Linear-Time Sequence Modeling with Selective State Spaces
    PaperCarnegie Mellon
  10. Qwen Chat 72B
    LLMAlibaba
  11. DeepSeek LLM 67B Chat
    LLMDeepSeek
  12. Claude 2.1
    LLMAnthropic34.6
  13. Stable Video Diffusion
    VideoStability AI
  14. GPT-4 Turbo
    MultimodalOpenAI59.8
  15. Grok-1
    LLMxAI
  16. Mistral 7B
    PaperMistral AI
  17. DALL·E 3
    ImageOpenAI
  18. GPT-3.5 Turbo Instruct
    LLMOpenAI
  19. Mistral 7B Instruct v0.1
    LLMMistral AI
  20. Mistral 7B Instruct
    LLMMistral AI14.7
  21. Mistral 7B
    LLMMistral AI
  22. Qwen Chat 14B
    LLMAlibaba
  23. GPT-3.5 Turbo 16k
    LLMOpenAI
  24. Stable Diffusion XL
    ImageStability AI
  25. Llama 2 Chat 7B
    LLMMeta11.3
  26. Llama 2 Chat 13B
    LLMMeta28.9
  27. Llama 2 Chat 70B
    LLMMeta28.9
  28. Llama 2 70B
    LLMMeta
  29. Llama 2: Open Foundation and Fine-Tuned Chat Models
    PaperMeta
  30. Claude 2
    LLMAnthropic33.4
  31. Textbooks Are All You Need
    PaperMicrosoft
  32. MusicGen
    AudioMeta
  33. Direct Preference Optimization: Your Language Model is Secretly a Reward Model
    PaperStanford
  34. QLoRA: Efficient Finetuning of Quantized LLMs
    PaperUniversity of Washington
  35. Tree of Thoughts: Deliberate Problem Solving with Large Language Models
    PaperPrinceton
  36. PaLM 2
    LLMGoogle
  37. Segment Anything
    PaperMeta
  38. GPT-4 Technical Report
    PaperOpenAI
  39. Claude Instant
    LLMAnthropic28.4
  40. Claude 1
    LLMAnthropic
  41. GPT-4
    MultimodalOpenAI58.3
  42. GPT-3.5 Turbo
    LLMOpenAI35.2
  43. LLaMA: Open and Efficient Foundation Language Models
    PaperMeta
  44. Llama 65B
    LLMMeta
  45. LLaMA
    LLMMeta
  46. Toolformer: Language Models Can Teach Themselves to Use Tools
    PaperMeta

20228 models · 13 papers

  1. Constitutional AI: Harmlessness from AI Feedback
    PaperAnthropic
  2. Robust Speech Recognition via Large-Scale Weak Supervision
    PaperOpenAI
  3. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
    PaperBigScience
  4. Scaling Instruction-Finetuned Language Models
    PaperGoogle
  5. ReAct: Synergizing Reasoning and Acting in Language Models
    PaperPrinceton
  6. Whisper
    AudioOpenAI
  7. Stable Diffusion
    ImageStability AI
  8. Midjourney
    ImageMidjourney
  9. BLOOM
    LLMBigScience
  10. Emergent Abilities of Large Language Models
    PaperGoogle
  11. FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
    PaperStanford
  12. Imagen
    ImageGoogle
  13. OPT-175B
    LLMMeta
  14. OPT: Open Pre-trained Transformer Language Models
    PaperMeta
  15. Flamingo: a Visual Language Model for Few-Shot Learning
    PaperDeepMind
  16. DALL·E 2
    ImageOpenAI
  17. PaLM: Scaling Language Modeling with Pathways
    PaperGoogle
  18. PaLM
    LLMGoogle
  19. Training Compute-Optimal Large Language Models
    PaperDeepMind
  20. Training language models to follow instructions with human feedback
    PaperOpenAI
  21. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
    PaperGoogle

20213 models · 8 papers

  1. High-Resolution Image Synthesis with Latent Diffusion Models
    PaperLMU Munich
  2. Codex
    LLMOpenAI
  3. AlphaFold 2
    ModelDeepMind
  4. Highly Accurate Protein Structure Prediction with AlphaFold
    PaperDeepMind
  5. Evaluating Large Language Models Trained on Code
    PaperOpenAI
  6. LoRA: Low-Rank Adaptation of Large Language Models
    PaperMicrosoft
  7. RoFormer: Enhanced Transformer with Rotary Position Embedding
    PaperZhuiyi Technology
  8. Learning Transferable Visual Models From Natural Language Supervision
    PaperOpenAI
  9. Zero-Shot Text-to-Image Generation
    PaperOpenAI
  10. Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
    PaperGoogle
  11. DALL·E
    ImageOpenAI

20201 models · 5 papers

  1. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
    PaperGoogle
  2. Denoising Diffusion Probabilistic Models
    PaperUC Berkeley
  3. GPT-3
    LLMOpenAI
  4. Language Models are Few-Shot Learners
    PaperOpenAI
  5. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
    PaperMeta
  6. Scaling Laws for Neural Language Models
    PaperOpenAI

20194 models · 3 papers

  1. T5
    LLMGoogle
  2. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
    PaperGoogle
  3. Sentence-BERT
    EmbeddingUKP Lab
  4. RoBERTa
    LLMMeta
  5. RoBERTa: A Robustly Optimized BERT Pretraining Approach
    PaperMeta
  6. GPT-2
    LLMOpenAI
  7. Language Models are Unsupervised Multitask Learners
    PaperOpenAI

20182 models · 2 papers

  1. BERT
    LLMGoogle
  2. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
    PaperGoogle
  3. GPT-1
    LLMOpenAI
  4. Improving Language Understanding by Generative Pre-Training
    PaperOpenAI

20171 models · 3 papers

  1. AlphaZero
    ModelDeepMind
  2. Mastering Chess and Shogi by Self-Play with a General RL Algorithm
    PaperDeepMind
  3. Proximal Policy Optimization Algorithms
    PaperOpenAI
  4. Attention Is All You Need
    PaperGoogle

20161 models · 1 papers

  1. AlphaGo
    ModelDeepMind
  2. Mastering the Game of Go with Deep Neural Networks and Tree Search
    PaperDeepMind

20150 models · 4 papers

  1. Deep Residual Learning for Image Recognition
    PaperMicrosoft
  2. Deep Learning
    PaperNature
  3. Distilling the Knowledge in a Neural Network
    PaperGoogle
  4. Batch Normalization: Accelerating Deep Network Training
    PaperGoogle

20141 models · 5 papers

  1. Adam: A Method for Stochastic Optimization
    PaperUniversity of Toronto
  2. GloVe: Global Vectors for Word Representation
    PaperStanford
  3. Sequence to Sequence Learning with Neural Networks
    PaperGoogle
  4. GloVe
    EmbeddingStanford
  5. Dropout: A Simple Way to Prevent Neural Networks from Overfitting
    PaperUniversity of Toronto
  6. Generative Adversarial Networks
    PaperUniversity of Montreal

20131 models · 3 papers

  1. Auto-Encoding Variational Bayes
    PaperUniversity of Amsterdam
  2. Playing Atari with Deep Reinforcement Learning
    PaperDeepMind
  3. word2vec
    EmbeddingGoogle
  4. Efficient Estimation of Word Representations in Vector Space
    PaperGoogle

20121 models · 1 papers

  1. AlexNet
    ModelUniversity of Toronto
  2. ImageNet Classification with Deep Convolutional Neural Networks
    PaperUniversity of Toronto

20111 models · 0 papers

  1. IBM Watson
    ModelIBM

19980 models · 1 papers

  1. Gradient-Based Learning Applied to Document Recognition
    PaperAT&T Labs

19971 models · 1 papers

  1. Long Short-Term Memory
    PaperTU Munich
  2. Deep Blue
    ModelIBM

19860 models · 1 papers

  1. Learning Representations by Back-Propagating Errors
    PaperUC San Diego

19661 models · 0 papers

  1. ELIZA
    ModelMIT

19580 models · 1 papers

  1. The Perceptron: A Probabilistic Model for Information Storage and Organization
    PaperCornell Aeronautical Laboratory

19500 models · 1 papers

  1. Computing Machinery and Intelligence
    PaperUniversity of Manchester

19430 models · 1 papers

  1. A Logical Calculus of the Ideas Immanent in Nervous Activity
    PaperUniversity of Chicago