Timeline
A chronological history of AI — model releases and the research behind them, newest first. Filter by type, organization, or year.
767 entries
2026126 models · 20 papers
- MiniCPM5-1BLLMOpenBMB26.9
- Tokenisation via Convex RelaxationsPaper
- Integrable Elasticity via Neural Demand PotentialsPaper
- Vector Policy Optimization: Training for Diversity Improves Test-Time SearchPaper
- Remember to be Curious: Episodic Context and Persistent Worlds for 3D ExplorationPaper
- The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation LearningPaper
- Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting ModelsPaper
- MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent SystemsPaper
- Gated DeltaNet-2: Decoupling Erase and Write in Linear AttentionPaper
- LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent SystemsPaper
- Evaluating Commercial AI Chatbots as News IntermediariesPaper
- DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/RollbackPaper
- FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly DetectionPaper
- SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival AnalysisPaper
- MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking DataPaper
- CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead AdaptationPaper
- Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job ArrivalsPaper
- Reducing Political Manipulation with Consistency TrainingPaper
- Understanding Data Temporality Impact on Large Language Models Pre-trainingPaper
- Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State ReformulationPaper
- Advancing Mathematics Research with AI-Driven Formal Proof SearchPaper
- Qwen3.7 MaxLLMAlibaba92.3
- Grok Build 0.1MultimodalxAI
- Gemini 3.5 FlashMultimodalGoogle92.2
- JT-35B-FlashLLMChina Mobile82.9
- Perceptron Mk1MultimodalPerceptron
- MiniCPM-V 4.6 1.3BLLMOpenBMB30.5
- Ring-2.6-1TLLMInclusionAI85.7
- Gemini 3.1 Flash LiteMultimodalGoogle82.2
- Grok 4.3LLMxAI90.1
- GPT-5.5 InstantLLMOpenAI
- GPT ChatMultimodalOpenAI
- Mistral Medium 3.5MultimodalMistral AI74.8
- Granite 4.1 8BLLMIBM43.3
- Granite 4.1 30BLLMIBM48.1
- Granite 4.1 3BLLMIBM31.4
- Nemotron 3 Nano Omni 30B A3B ReasoningLLMNVIDIA46.9
- OpenAI GPTMultimodalOpenAI
- Anthropic Claude SonnetMultimodalAnthropic
- Google Gemini FlashMultimodalGoogle
- MoonshotAI KimiMultimodalMoonshot AI
- Google Gemini ProMultimodalGoogle
- OpenAI GPT MiniMultimodalOpenAI
- Anthropic Claude HaikuMultimodalAnthropic
- Qwen3.6 27BMultimodalAlibaba84.2
- Qwen3.6 MaxLLMAlibaba88.8
- Qwen3.6 35B A3BMultimodalAlibaba84.1
- Qwen3.6 FlashMultimodalAlibaba
- GPT-5.5 ProMultimodalOpenAI
- DeepSeek-V4-FlashLLMDeepSeek89.4
- DeepSeek-V4-ProLLMDeepSeek88.2
- Ling-2.6-1TLLMInclusionAI75.2
- GPT-5.5LLMOpenAI76.1
- MiMo-V2.5MultimodalXiaomi84.9
- MiMo-V2.5-ProLLMXiaomi86.6
- Hy3LLMTencent86.7
- Claude OpusMultimodalAnthropic
- Ling-2.6-flashLLMInclusionAI59.3
- GPT-5.4 Image 2MultimodalOpenAI
- Qianfan-OCR-FastMultimodalBaidu
- Kimi K2.6LLMMoonshot AI74.9
- Claude Opus 4.7LLMAnthropic90.9
- JT-MINILLMChina Mobile67.6
- EXAONE 4.5 33BLLMLG AI Research79.4
- Muse SparkMultimodalMeta88.4
- Grok 4.20 0309 v2LLMxAI91.1
- GLM 5.1LLMZhipu AI86.8
- Gemma 4 E4BLLMGoogle57.6
- Gemma 4 26B A4BMultimodalGoogle79.2
- Step 3.5 Flash 2603LLMStepFun82.6
- Gemma 4 E2BLLMGoogle43.3
- Qwen3.6 PlusMultimodalAlibaba88.2
- Gemma 4 31BMultimodalGoogle85.7
- Trinity Large ThinkingLLMArcee AI75.2
- GLM 5V TurboMultimodalZhipu AI80.9
- Grok 4.20 Multi-AgentMultimodalxAI
- Qwen3.5 Omni FlashLLMAlibaba74.2
- Qwen3.5 Omni PlusLLMAlibaba82.6
- Lyria 3 ClipMultimodalGoogle
- Lyria 3 ProMultimodalGoogle
- MiMo-V2-Omni-0327LLMXiaomi85.5
- KAT-Coder-Pro V2LLMKuaishou85.5
- Reka EdgeMultimodalReka AI
- Nemotron Cascade 2 30B A3BLLMNVIDIA75.8
- MiMo-V2-ProLLMXiaomi87
- MiMo-V2-OmniMultimodalXiaomi82.8
- MiniMax M2.7LLMMiniMax87.4
- GPT-5.4 nanoLLMOpenAI81.7
- GPT-5.4 miniLLMOpenAI87.5
- NVIDIA Nemotron 3 Nano 4BLLMNVIDIA51.3
- Mistral Small 4MultimodalMistral AI76.9
- GLM 5 TurboLLMZhipu AI84.7
- NVIDIA Nemotron 3 Super 120B A12BLLMNVIDIA80
- Nemotron 3 SuperLLMNVIDIA
- Grok 4.20 0309LLMxAI88.5
- Seed-2.0-LiteMultimodalByteDance
- Qwen3.5-9BMultimodalAlibaba80.6
- Sarvam 30BLLMSarvam63.3
- Sarvam 105BLLMSarvam73.8
- GPT-5.4 ProMultimodalOpenAI
- GPT-5.4LLMOpenAI74.9
- Mercury 2LLMInception77
- GPT-5.3 ChatMultimodalOpenAI
- Qwen3.5 2BLLMAlibaba45.6
- Qwen3.5 4BLLMAlibaba77.1
- Qwen3.5 0.8BLLMAlibaba23.6
- Seed-2.0-MiniMultimodalByteDance
- Nano Banana 2MultimodalGoogle
- Gemini 3.1 Pro Custom ToolsMultimodalGoogle
- LFM2-24B-A2BLLMLiquid AI47.4
- Qwen3.5-FlashMultimodalAlibaba
- Qwen3.5-122B-A10BMultimodalAlibaba85.7
- Qwen3.5-27BMultimodalAlibaba85.8
- Qwen3.5-35B-A3BMultimodalAlibaba84.5
- GPT-5.3-CodexMultimodalOpenAI91.5
- Aion-2.0LLMAion Labs
- Gemini 3.1 ProMultimodalGoogle83.2
- Tiny Aya GlobalLLMCohere30.5
- Grok 4.20LLMxAI
- Claude Sonnet 4.6LLMAnthropic76.3
- Qwen3.5 397B A17BMultimodalAlibaba89.3
- Qwen3.5 PlusMultimodalAlibaba
- Qwen3.5MultimodalAlibaba
- MiniMax M2.5LLMMiniMax84.8
- Nanbeige4.1-3BLLMNanbeige84.9
- GLM-5LLMZhipu AI81.9
- Tri-21B-ThinkLLMTrillion Labs60.1
- Qwen3 Max ThinkingLLMAlibaba76.1
- Claude Opus 4.6LLMAnthropic79.4
- Qwen3 Coder NextLLMAlibaba73.7
- Step 3.5 FlashLLMStepFun83.1
- LongCat Flash LiteLLMLongCat63.6
- Solar Pro 3LLMUpstage72.4
- Kimi K2.5MultimodalMoonshot AI87.9
- MiniMax M2-herLLMMiniMax
- Palmyra X5LLMWriter
- Step3 VL 10BLLMStepFun69
- LFM2.5-1.2B-ThinkingLLMLiquid AI33.9
- GPT Audio MiniMultimodalOpenAI
- GPT AudioMultimodalOpenAI
- GLM 4.7 FlashLLMZhipu AI58.1
- GPT-5.2-CodexMultimodalOpenAI89.9
- Olmo 3.1 32B InstructLLMAllen Institute for AI53.9
- LFM2.5-VL-1.6BLLMLiquid AI28.9
- LFM2.5-1.2B-InstructLLMLiquid AI32.6
- Falcon-H1R-7BLLMTII UAE72.8
2025362 models · 13 papers
- K-EXAONELLMLG AI Research82.3
- HyperCLOVA X SEED ThinkLLMNaver65.5
- Seed 1.6MultimodalByteDance
- Seed 1.6 FlashMultimodalByteDance
- MiniMax M2.1LLMMiniMax83.6
- GLM 4.7LLMZhipu AI89
- Solar Open 100BLLMUpstage65.7
- Gemini 3 FlashMultimodalGoogle90.2
- K2 Think V2LLMMBZUAI Institute of Foundation Models71.3
- NVIDIA Nemotron 3 Nano 30B A3BLLMNVIDIA80.1
- Nemotron 3 NanoLLMNVIDIA
- MiMo-V2-FlashLLMXiaomi88
- Nemotron 3 Nano 30B A3BLLMNVIDIA
- Olmo 3.1 32B ThinkLLMAllen Institute for AI70.6
- Mi:dm K 2.5 ProLLMKorea Telecom74.4
- Molmo2-8BLLMAllen Institute for AI42.5
- GPT-5.2LLMOpenAI86.2
- GPT-5.2 ProMultimodalOpenAI
- GPT-5.2 ChatMultimodalOpenAI
- Devstral Small 2LLMMistral AI47.5
- Devstral 2LLMMistral AI54.3
- DeepSeek V3.1 Nex N1LLMNex Agi
- Relace SearchLLMRelace
- GLM 4.6VMultimodalZhipu AI69.6
- Rnj 1 InstructLLMEssential AI
- K2-V2LLMMBZUAI Institute of Foundation Models73.6
- Motif-2-12.7B-ReasoningLLMMotif Technologies73.6
- GPT-5.1-Codex-MaxMultimodalOpenAI
- Ministral 3 3BMultimodalMistral AI33.7
- Ministral 3 8BMultimodalMistral AI43.3
- Ministral 3 14BMultimodalMistral AI47.9
- Nova 2 LiteMultimodalAmazon82.1
- Mistral Large 3LLMMistral AI58.3
- DeepSeek-V3.2: Pushing the Frontier of Open Large Language ModelsPaperDeepSeek AI
- Trinity MiniLLMArcee AI
- DeepSeek V3.2 SpecialeLLMDeepSeek89.9
- Seedream 4.5ImageByteDance
- Seedance 1.5 ProVideoByteDance
- Kling O1VideoKuaishou
- Runway Gen-4.5VideoRunway
- DeepSeek-V3.2LLMDeepSeek87.1
- Nova 2.0 ProLLMAmazon80.9
- INTELLECT-3LLMPrime Intellect81
- Nova 2.0 OmniLLMAmazon78.2
- Apriel-v1.6-15B-ThinkerLLMServiceNow80.3
- FLUX.2ImageBlack Forest Labs
- Claude Opus 4.5LLMAnthropic88
- Natural Emergent Misalignment from Reward Hacking in Production RLPaperAnthropic
- Olmo 3 32B ThinkLLMAllen Institute for AI69.5
- HunyuanVideo 1.5VideoTencent
- Olmo 3 7B ThinkLLMAllen Institute for AI62.4
- Olmo 3 7B InstructLLMAllen Institute for AI40
- Nano Banana ProMultimodalGoogle
- Gemini 3 Pro ImageImageGoogle
- OLMo 3LLMAllen Institute for AI
- Grok 4.1 FastLLMxAI85.6
- Cogito v2.1LLMDeep Cogito75.8
- Gemini 3 Deep ThinkMultimodalGoogle69.5
- Gemini 3 ProMultimodalGoogle82.8
- Grok 4.1LLMxAI
- ERNIE 5.0 ThinkingLLMBaidu81.7
- Cogito v2.1 671BLLMDeep Cogito
- GPT-5.1-Codex-MiniMultimodalOpenAI84.7
- GPT-5.1-CodexMultimodalOpenAI88.2
- GPT-5.1 ChatMultimodalOpenAI
- GPT-5.1LLMOpenAI89
- Olympiad-level Formal Mathematical Reasoning with Reinforcement Learning (AlphaProof)PaperGoogle DeepMind
- Doubao Seed CodeLLMByteDance79.4
- KAT-Coder-Pro V1LLMKuaishou81.8
- Kimi K2 ThinkingLLMMoonshot AI85.6
- Nova Premier 1.0MultimodalAmazon
- Kimi Linear 48B A3B InstructLLMMoonshot AI43.5
- Sonar Pro SearchMultimodalPerplexity
- Voxtral Small 24BMultimodalMistral AI
- gpt-oss-safeguard-20bLLMOpenAI
- Granite 4.0 H 1BLLMIBM18
- Granite 4.0 1BLLMIBM17.9
- Granite 4.0 350MLLMIBM10.2
- Granite 4.0 H 350MLLMIBM10.4
- NVIDIA Nemotron Nano 12B v2 VLLLMNVIDIA69.4
- MiniMax-M2LLMMiniMax76
- Qwen3 VL 32B InstructMultimodalAlibaba66.5
- Qwen3 VL 32BLLMAlibaba78.4
- Granite 4.0 MicroLLMIBM25.6
- Phi 4 Mini InstructLLMMicrosoft32.6
- GPT-5 Image MiniMultimodalOpenAI
- Veo 3.1VideoGoogle
- Claude Haiku 4.5LLMAnthropic75.3
- Qwen3 VL 8BLLMAlibaba49.7
- Qwen3 VL 4B InstructLLMAlibaba41.6
- Qwen3 VL 4BLLMAlibaba44.3
- GPT-5 ImageMultimodalOpenAI
- Qwen3 VL 8B InstructMultimodalAlibaba43
- Qwen3 VL 8B ThinkingMultimodalAlibaba
- Ring-1TLLMInclusionAI77.9
- Llama 3.3 Nemotron Super 49B V1.5LLMNVIDIA
- o4 Mini Deep ResearchMultimodalOpenAI
- o3 Deep ResearchMultimodalOpenAI
- ERNIE 4.5 21B A3B ThinkingLLMBaidu
- Ling-1TLLMInclusionAI73.3
- Jamba Reasoning 3BLLMAI21 Labs30.7
- LFM2 8B A1BLLMLiquid AI31.3
- Nano BananaMultimodalGoogle
- Qwen3 VL 30B A3B InstructMultimodalAlibaba66.5
- Qwen3 VL 30B A3B ThinkingMultimodalAlibaba
- GPT-5 ProLLMOpenAI88.4
- Agentic Context Engineering: Evolving Contexts for Self-Improving Language ModelsPaperStanford / SambaNova / UC Berkeley
- Qwen3 VL 30B A3BLLMAlibaba76.2
- IBM Granite 4.0LLMIBM
- Apriel-v1.5-15B-ThinkerLLMServiceNow77.2
- Sora 2VideoOpenAI
- GLM-4.6LLMZhipu AI72.4
- DeepSeek V3.2 ExpLLMDeepSeek72.2
- Claude Sonnet 4.5LLMAnthropic80.4
- HunyuanImage 3.0ImageTencent
- Relace Apply 3LLMRelace
- Gemini 2.5 FlashLLMGoogle78.3
- Gemini 2.5 Flash Lite 09-MultimodalGoogle
- Qwen3 VL 235B A22BLLMAlibaba78.4
- LFM2 2.6BLLMLiquid AI19.2
- GPT-5 CodexMultimodalOpenAI87.1
- Qwen3 Coder PlusLLMAlibaba
- Qwen3 MaxLLMAlibaba79.5
- Qwen3 VL 235B A22B InstructMultimodalAlibaba70.9
- Qwen3 VL 235B A22B ThinkingMultimodalAlibaba
- Kling 2.5 TurboVideoKuaishou
- Qwen3 Omni 30B A3B InstructLLMAlibaba57.3
- Qwen3 Omni 30B A3BLLMAlibaba73.4
- Granite 4.0 H SmallLLMIBM35.7
- DeepSeek V3.1 TerminusLLMDeepSeek83.5
- Ring-flash-2.0LLMInclusionAI74.6
- Grok 4 FastLLMxAI78.7
- Magistral Medium 1.2LLMMistral AI78.1
- Tongyi DeepResearch 30B A3BLLMAlibaba
- Luma Ray3VideoLuma AI
- Ling-flash-2.0LLMInclusionAI66.9
- Magistral Small 1.2LLMMistral AI73.9
- Qwen3 Coder FlashLLMAlibaba
- Qwen3 Next 80B A3B InstructLLMAlibaba68.9
- Qwen3 Next 80B A3B ThinkingLLMAlibaba77.5
- Stable Audio 2.5AudioStability AI
- Qwen3-Next-80B-A3BLLMAlibaba80.3
- Ling-mini-2.0LLMInclusionAI53.9
- Gemini 2.5 Flash-LiteLLMGoogle72.3
- Qwen PlusLLMAlibaba
- Kimi K2-Instruct-0905LLMMoonshot AI66
- Kimi K2 0905LLMMoonshot AI71
- Nemotron Nano 9B V2LLMNVIDIA77.6
- Seedream 4.0ImageByteDance
- Apertus 70B InstructLLMSwiss AI Initiative27.2
- Apertus 8B InstructLLMSwiss AI Initiative25.6
- Suno v5AudioSuno
- Grok Code Fast 1LLMxAI65.3
- Qwen3 30B A3B ThinkingLLMAlibaba
- Hermes 4 - Llama-3.1 405BLLMNous Research73.5
- Hermes 4 - Llama-3.1 70BLLMNous Research71.3
- Hermes 4 405BLLMNous Research
- Hermes 4 70BLLMNous Research
- Gemini 2.5 Flash ImageImageGoogle
- Command A ReasoningLLMCohere
- DeepSeek-V3.1LLMDeepSeek59.8
- Seed-OSS-36B-InstructLLMByteDance78.8
- NVIDIA Nemotron Nano 9B V2LLMNVIDIA68.3
- GPT-4o AudioMultimodalOpenAI
- Imagen 4ImageGoogle
- Gemma 3 270MLLMGoogle7.6
- Mistral Medium 3.1MultimodalMistral AI51.5
- ERNIE 4.5 VL 28B A3BMultimodalBaidu
- ERNIE 4.5 21B A3BLLMBaidu
- GLM 4.5VMultimodalZhipu AI70.1
- gpt-oss-120b & gpt-oss-20b Model CardPaperOpenAI
- Jamba Large 1.7LLMAI21 Labs36.5
- GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation ModelsPaperZ.ai (Zhipu AI)
- GPT-5 ChatMultimodalOpenAI
- GPT-5 nanoLLMOpenAI71.2
- GPT-5 miniLLMOpenAI79.2
- GPT-5LLMOpenAI80.5
- Qwen3 4B 2507LLMAlibaba71.9
- Qwen3 4B 2507 InstructLLMAlibaba52.2
- Genie 3VideoGoogle
- gpt-oss-20bLLMOpenAI73.6
- gpt-oss-120bLLMOpenAI79.6
- Claude Opus 4.1LLMAnthropic75.4
- CodestralLLMMistral AI
- Qwen3 Coder 30B A3B InstructLLMAlibaba55.4
- Qwen3 30B A3B 2507LLMAlibaba74.7
- Qwen3 30B A3B 2507 InstructLLMAlibaba69.3
- Qwen3 30B A3B InstructLLMAlibaba
- Wan 2.2VideoAlibaba
- GLM-4.5LLMZhipu AI73
- Kimi K2: Open Agentic IntelligencePaperMoonshot AI
- Qwen3 235B A22B 2507LLMAlibaba84.2
- Llama Nemotron Super 49B v1.5LLMNVIDIA79.4
- Qwen3-235B-A22B-Thinking-2507LLMAlibaba79.6
- Qwen3 235B A22B ThinkingLLMAlibaba
- GLM 4.5 AirLLMZhipu AI70.4
- GLM 4 32BLLMZhipu AI
- Group Sequence Policy OptimizationPaperAlibaba (Qwen Team)
- Qwen3 Coder 480B A35BLLMAlibaba
- Qwen3 Coder 480B A35B InstructLLMAlibaba66.5
- UI-TARS 7BMultimodalByteDance
- Qwen3-235B-A22B-Instruct-2507LLMAlibaba72.2
- Gemini 2.5 Flash LiteMultimodalGoogle57
- Qwen3-CoderLLMAlibaba55.4
- Qwen3 235B A22B InstructLLMAlibaba
- EXAONE 4.0 32BLLMLG AI Research79.8
- Exaone 4.0 1.2BLLMLG AI Research53.1
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level ComputationPaperKAIST AI / Google DeepMind / Mila
- Switchpoint RouterLLMSwitchpoint
- Kimi K2 InstructLLMMoonshot AI66.5
- Kimi K2 BaseLLMMoonshot AI50.2
- Kimi K2LLMMoonshot AI73.6
- LFM2 1.2BLLMLiquid AI13.5
- Devstral Small 1.1LLMMistral AI
- Devstral MediumLLMMistral AI47.8
- Grok-4 HeavyMultimodalxAI89.3
- Grok 4LLMxAI78.2
- Hunyuan A13B InstructLLMTencent
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesPaperGoogle DeepMind
- Jamba 1.7 MiniLLMAI21 Labs22.5
- Morph V3 FastLLMMorph
- Morph V3 LargeLLMMorph
- ERNIE 4.5 300B A47BLLMBaidu68.2
- ERNIE 4.5 VL 424B A47BMultimodalBaidu
- ERNIE 4.5LLMBaidu
- FLUX.1 Kontext [dev]ImageBlack Forest Labs
- Hunyuan-A13BLLMTencent
- Gemma 3n E2B InstructLLMGoogle27.5
- Gemma 3n E4B InstructedMultimodalGoogle24.8
- Gemma 3n E4BMultimodalGoogle56.9
- Gemma 3n E2B InstructedMultimodalGoogle21.3
- Gemma 3n E2BMultimodalGoogle49.1
- Mistral Small 3.2LLMMistral AI51
- Mistral Small 3.2 24B InstructMultimodalMistral AI56.2
- Mistral Small 3.2 24BMultimodalMistral AI
- MiniMax M1 40kLLMMiniMax67.6
- MiniMax M1 80kLLMMiniMax75.5
- MiniMax-M1LLMMiniMax61.5
- MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning AttentionPaperMiniMax AI
- Seedance 1.0VideoByteDance
- Magistral Small 1LLMMistral AI64.7
- Magistral Medium 1LLMMistral AI65.5
- Magistral Small 2506LLMMistral AI62.1
- Magistral MediumMultimodalMistral AI62.9
- o3 ProMultimodalOpenAI84.5
- Magistral SmallLLMMistral AI
- Gemini 2.5 Pro Preview 06-05MultimodalGoogle76.6
- ElevenLabs v3AudioElevenLabs
- DeepSeek R1 0528 Qwen3 8BLLMDeepSeek66.2
- DeepSeek-R1-0528LLMDeepSeek63.3
- Sarvam MLLMSarvam56.4
- Claude Sonnet 4LLMAnthropic74.5
- Claude Opus 4LLMAnthropic69.4
- Devstral SmallLLMMistral AI45.3
- Gemma 3n E4B InstructLLMGoogle34.7
- Llama 3.1 Nemotron Nano 4B v1.1LLMNVIDIA54.5
- Solar Pro 2LLMUpstage72.5
- MedGemma 4B ITMultimodalGoogle
- Gemma 3n E4B Instructed LiteRT PreviewMultimodalGoogle30.3
- Gemma 3n E2B Instructed LiteRT (Preview)MultimodalGoogle25.4
- Gemini DiffusionLLMGoogle30.2
- Gemma 3n 4BLLMGoogle
- Lyria 2AudioGoogle
- Veo 3VideoGoogle
- Qwen3 Technical ReportPaperAlibaba (Qwen Team)
- Mistral Medium 3MultimodalMistral AI58.6
- Coder LargeLLMArcee AI
- Virtuoso LargeLLMArcee AI
- Maestro ReasoningLLMArcee AI
- SpotlightMultimodalArcee AI
- IBM Granite 4.0 Tiny PreviewLLMIBM48
- Nova PremierLLMAmazon53.1
- Phi 4 Reasoning PlusLLMMicrosoft70.4
- Phi 4 ReasoningLLMMicrosoft66.4
- Phi 4 Mini ReasoningLLMMicrosoft73.3
- Llama Guard 4 12BMultimodalMeta
- Qwen3 0.6BLLMAlibaba29.3
- Qwen3 4BLLMAlibaba56.5
- Qwen3 1.7BLLMAlibaba46.9
- Qwen3 235B A22BLLMAlibaba74.9
- Qwen3 32BLLMAlibaba73.8
- Qwen3 14BLLMAlibaba66.8
- Qwen3 8BLLMAlibaba57.8
- Qwen3 30B A3BLLMAlibaba71.7
- Qwen3LLMAlibaba75.8
- Gemini 2.5 FlashMultimodalGoogle73.1
- Granite 3.3 8BLLMIBM32.5
- Granite 3.3 8B InstructMultimodalIBM68.5
- Granite 3.3 8B BaseMultimodalIBM64.6
- o4 Mini HighMultimodalOpenAI
- o4-miniMultimodalOpenAI78.4
- o3LLMOpenAI71.6
- GPT-4.1 NanoMultimodalOpenAI42.1
- GPT-4.1 MiniMultimodalOpenAI58.2
- GPT-4.1MultimodalOpenAI63.8
- Llama 3.1 Nemotron Ultra 253B v1LLMNVIDIA78.3
- Llama 4 ScoutMultimodalMeta58.9
- Llama 4 MaverickMultimodalMeta63.9
- Runway Gen-4VideoRunway
- Qwen2.5-Omni-7BMultimodalAlibaba51.5
- DeepSeek-V3 0324LLMDeepSeek65.9
- Gemini 2.5 ProMultimodalGoogle71.6
- o1-proMultimodalOpenAI82.5
- Llama-3.3 Nemotron Super 49B v1LLMNVIDIA63.9
- Llama 3.1 Nemotron Nano 8B V1LLMNVIDIA68.2
- Mistral Small 3.1LLMMistral AI42.4
- Mistral Small 3.1 24B InstructMultimodalMistral AI48
- Mistral Small 3.1 24B BaseMultimodalMistral AI50.9
- Mistral Small 3.1 24BMultimodalMistral AI
- OLMo 2 32BLLMAllen Institute for AI23.5
- Gemma 3 1B InstructLLMGoogle16.2
- DeepHermes 3 - Mistral 24BLLMNous Research43.8
- Command ALLMCohere55.9
- Gemma 3 12BMultimodalGoogle55.5
- Gemma 3 4BMultimodalGoogle45.8
- Gemma 3 12B InstructLLMGoogle40
- Gemma 3 4B InstructLLMGoogle31.7
- Gemma 3 27B InstructLLMGoogle44.5
- Reka Flash 3LLMReka AI56.2
- Gemma 3 1BLLMGoogle21.2
- Gemma 3 27BMultimodalGoogle58.4
- GPT-4o SearchLLMOpenAI
- GPT-4o-mini SearchLLMOpenAI
- Sonar Deep ResearchLLMPerplexity
- Sonar ProMultimodalPerplexity58.8
- Sonar Reasoning ProMultimodalPerplexity95.7
- Jamba 1.6 LargeLLMAI21 Labs42.6
- Jamba 1.6 MiniLLMAI21 Labs24.9
- QwQ-32BLLMAlibaba67.8
- Qwen2.5 VL 32B InstructMultimodalAlibaba62.1
- GPT-4.5MultimodalOpenAI59.4
- Gemini 2.0 Flash LiteMultimodalGoogle54.4
- Claude 3.7 SonnetLLMAnthropic74.7
- Grok 3 ReasoningLLMxAI
- Grok 3 mini ReasoningLLMxAI80.9
- R1 1776LLMPerplexity95.4
- Mistral SabaLLMMistral AI57.1
- SabaLLMMistral AI
- Grok-3 MiniMultimodalxAI85.9
- Grok-3MultimodalxAI82.6
- DeepHermes 3 - Llama-3.1 8BLLMNous Research23.5
- o3 Mini HighLLMOpenAI
- Llama Guard 3 8BLLMMeta
- Gemini 2.0 ProLLMGoogle67.4
- Aion-RP 1.0LLMAion Labs
- Aion-1.0-MiniLLMAion Labs
- Aion-1.0LLMAion Labs
- Phi-4-multimodal-instructMultimodalMicrosoft46.2
- Phi 4 MiniLLMMicrosoft45.3
- Qwen2.5 VL 72B InstructMultimodalAlibaba79.1
- o3-miniLLMOpenAI64.1
- Llama 3.1 Tulu3 405BLLMAllen Institute for AI57.5
- Mistral Small 3 24B InstructLLMMistral AI62.1
- Mistral Small 3 24B BaseMultimodalMistral AI44.4
- Mistral Small 3LLMMistral AI43.6
- R1 Distill Qwen 32BLLMDeepSeek
- Qwen2.5 MaxLLMAlibaba63.6
- Sonar ReasoningLLMPerplexity77.2
- SonarMultimodalPerplexity56.8
- Qwen2.5 VL 7B InstructMultimodalAlibaba70
- R1 Distill Llama 70BLLMDeepSeek
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningPaperDeepSeek
- Gemini 2.0 Flash ThinkingMultimodalGoogle69.1
- Kimi-k1.5MultimodalMoonshot AI82.2
- DeepSeek R1 ZeroLLMDeepSeek71.5
- DeepSeek R1 Distill Qwen 7BLLMDeepSeek58.3
- DeepSeek R1 Distill Qwen 32BLLMDeepSeek68.4
- DeepSeek R1 Distill Qwen 14BLLMDeepSeek65.7
- DeepSeek R1 Distill Qwen 1.5BLLMDeepSeek32.6
- DeepSeek R1 Distill Llama 8BLLMDeepSeek53.3
- DeepSeek R1 Distill Llama 70BLLMDeepSeek70.1
- R1LLMDeepSeek
- DeepSeek-R1LLMDeepSeek75
- MiniMax-01MultimodalMiniMax
- Phi 4LLMMicrosoft47.6
2024112 models · 8 papers
- DeepSeek-V3 Technical ReportPaperDeepSeek
- DeepSeek-V3LLMDeepSeek58.1
- QvQ-72B-PreviewMultimodalAlibaba70.9
- GPT-4o RealtimeLLMOpenAI
- GPT-4o mini RealtimeLLMOpenAI
- Veo 2VideoGoogle
- Command R7BLLMCohere
- DeepSeek VL2 TinyMultimodalDeepSeek67.2
- DeepSeek VL2 SmallMultimodalDeepSeek73.1
- DeepSeek VL2MultimodalDeepSeek74.9
- Gemini 2.0 FlashMultimodalGoogle60.3
- SoraVideoOpenAI
- Llama 3.3 70B InstructLLMMeta50.6
- Llama 3.3 70BLLMMeta
- Nova Pro 1.0MultimodalAmazon
- Nova Micro 1.0LLMAmazon
- Nova Lite 1.0MultimodalAmazon
- o1LLMOpenAI65.4
- QwQ-32B-PreviewLLMAlibaba62.6
- OLMo 2 7BLLMAllen Institute for AI15.5
- Nova ProMultimodalAmazon61.6
- Nova MicroLLMAmazon49
- Nova LiteMultimodalAmazon57.7
- Pixtral LargeMultimodalMistral AI53.1
- Mistral LargeLLMMistral AI39.3
- Qwen2.5 TurboLLMAlibaba50.3
- Qwen2.5 Coder 32B InstructLLMAlibaba50.1
- Claude 3.5 HaikuLLMAnthropic54.5
- Ministral 8B InstructLLMMistral AI70.9
- Qwen2.5 7B InstructLLMAlibaba46.6
- Inflection 3 PiLLMInflection
- Inflection 3 ProductivityLLMInflection
- Reka FlashLLMReka AI52.9
- Llama 3.1 Nemotron 70B InstructLLMNVIDIA43.7
- LFM 40BLLMLiquid AI33.2
- Molmo 7B-DLLMAllen Institute for AI16.3
- Llama 3.2 90B InstructMultimodalMeta54
- Llama 3.2 11B InstructMultimodalMeta36.7
- Llama 3.2 3B InstructLLMMeta30.8
- Llama 3.2 1B InstructLLMMeta12.1
- Llama 3.2 11B Vision InstructMultimodalMeta
- Qwen2.5-Coder 7B InstructLLMAlibaba39.6
- Qwen2.5 32B InstructLLMAlibaba66.7
- Qwen2.5 14B InstructLLMAlibaba66.1
- Qwen2.5 72B InstructLLMAlibaba59.1
- Qwen2.5 72BLLMAlibaba
- Pixtral-12BMultimodalMistral AI66.1
- o1-previewLLMOpenAI57.3
- o1-miniLLMOpenAI70.5
- Qwen2-VL-72B-InstructMultimodalAlibaba67.3
- Phi-3.5-vision-instructMultimodalMicrosoft61.7
- Phi-3.5-MoE-instructLLMMicrosoft49.8
- Phi-3.5-mini-instructLLMMicrosoft46
- Jamba 1.5 MiniLLMAI21 Labs29.6
- Jamba 1.5 LargeLLMAI21 Labs42.8
- Hermes 3 70B InstructLLMNous Research
- Hermes 3 405B InstructLLMNous Research
- Hermes 3 - Llama-3.1 70BLLMNous Research42.5
- GrokLLMxAI53.8
- Grok-2 miniMultimodalxAI65.9
- Grok-2MultimodalxAI62.4
- FLUX.1ImageBlack Forest Labs
- The Llama 3 Herd of ModelsPaperMeta
- Mistral Large 2LLMMistral AI47.9
- Qwen2 7B InstructLLMAlibaba37.4
- Qwen2 72B InstructLLMAlibaba59.9
- Llama 3.1 405B InstructLLMMeta60.9
- Llama 3.1 8B InstructLLMMeta45
- Llama 3.1 70B InstructLLMMeta56
- Llama 3.1 405BLLMMeta
- Mistral NemoLLMMistral AI
- Mistral NeMo InstructLLMMistral AI
- GPT-4o-miniMultimodalOpenAI49.1
- Qwen2 Technical ReportPaperAlibaba
- Gemma 2 27BLLMGoogle
- Runway Gen-3 AlphaVideoRunway
- Gemma 2 9BLLMGoogle
- Gemma 2LLMGoogle
- Claude 3.5 SonnetLLMAnthropic70.3
- DeepSeek Coder V2 Lite InstructLLMDeepSeek30.2
- DeepSeek-Coder-V2LLMDeepSeek74.3
- Stable Diffusion 3ImageStability AI
- Qwen2 72BLLMAlibaba
- Codestral-22BLLMMistral AI
- Hermes 2 Pro - Llama-3 8BLLMNous Research
- GPT-4oMultimodalOpenAI56.4
- DeepSeek-V2.5LLMDeepSeek63.4
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts ModelPaperDeepSeek
- DeepSeek-V2-ChatLLMDeepSeek
- DeepSeek-V2LLMDeepSeek
- Gemini 1.5 FlashMultimodalGoogle61.9
- Qwen1.5 Chat 110BLLMAlibaba28.9
- Arctic InstructLLMSnowflake
- Phi-3 Mini Instruct 3.8BLLMMicrosoft27.5
- Phi-3LLMMicrosoft
- Phi-3 Technical ReportPaperMicrosoft
- Llama 3 8B InstructLLMMeta32.4
- Llama 3 70B InstructLLMMeta40.8
- Llama 3 70BLLMMeta
- Mixtral 8x22B InstructLLMMistral AI39.1
- WizardLM-2 8x22BLLMMicrosoft
- Grok-1.5VMultimodalxAI71.3
- Mixtral 8x22BLLMMistral AI
- Command R+LLMCohere28.9
- Grok-1.5LLMxAI50.3
- DBRX InstructLLMDatabricks27.5
- DBRXLLMDatabricks
- Gemini 1.5 Flash 8BMultimodalGoogle48.4
- Claude 3 HaikuMultimodalAnthropic38.9
- Gemma: Open Models Based on Gemini Research and TechnologyPaperGoogle
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of ContextPaperGoogle
- Claude 3 OpusLLMAnthropic58.5
- Claude 3 SonnetMultimodalAnthropic45.8
- Mistral SmallLLMMistral AI40.3
- GemmaLLMGoogle
- Gemini 1.0 ProLLMGoogle34
- Gemini 1.5 ProMultimodalGoogle67.3
- Solar MiniLLMUpstage33.1
- text-embedding-3-largeEmbeddingOpenAI
- Mixtral of ExpertsPaperMistral AI
202334 models · 12 papers
- Gemini: A Family of Highly Capable Multimodal ModelsPaperGoogle
- OpenChat 3.5LLMOpenChat24.1
- Phi-2LLMMicrosoft
- Mistral MediumLLMMistral AI33.6
- Mixtral 8x7B InstructLLMMistral AI26.1
- Mixtral 8x7BLLMMistral AI
- Gemini 1.0 UltraLLMGoogle
- Gemini 1.0MultimodalGoogle
- Mamba: Linear-Time Sequence Modeling with Selective State SpacesPaperCarnegie Mellon
- Qwen Chat 72BLLMAlibaba
- DeepSeek LLM 67B ChatLLMDeepSeek
- Claude 2.1LLMAnthropic34.6
- Stable Video DiffusionVideoStability AI
- GPT-4 TurboMultimodalOpenAI59.8
- Grok-1LLMxAI
- Mistral 7BPaperMistral AI
- DALL·E 3ImageOpenAI
- GPT-3.5 Turbo InstructLLMOpenAI
- Mistral 7B Instruct v0.1LLMMistral AI
- Mistral 7B InstructLLMMistral AI14.7
- Mistral 7BLLMMistral AI
- Qwen Chat 14BLLMAlibaba
- GPT-3.5 Turbo 16kLLMOpenAI
- Stable Diffusion XLImageStability AI
- Llama 2 Chat 7BLLMMeta11.3
- Llama 2 Chat 13BLLMMeta28.9
- Llama 2 Chat 70BLLMMeta28.9
- Llama 2 70BLLMMeta
- Llama 2: Open Foundation and Fine-Tuned Chat ModelsPaperMeta
- Claude 2LLMAnthropic33.4
- Textbooks Are All You NeedPaperMicrosoft
- MusicGenAudioMeta
- Direct Preference Optimization: Your Language Model is Secretly a Reward ModelPaperStanford
- QLoRA: Efficient Finetuning of Quantized LLMsPaperUniversity of Washington
- Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsPaperPrinceton
- PaLM 2LLMGoogle
- Segment AnythingPaperMeta
- GPT-4 Technical ReportPaperOpenAI
- Claude InstantLLMAnthropic28.4
- Claude 1LLMAnthropic
- GPT-4MultimodalOpenAI58.3
- GPT-3.5 TurboLLMOpenAI35.2
- LLaMA: Open and Efficient Foundation Language ModelsPaperMeta
- Llama 65BLLMMeta
- LLaMALLMMeta
- Toolformer: Language Models Can Teach Themselves to Use ToolsPaperMeta
20228 models · 13 papers
- Constitutional AI: Harmlessness from AI FeedbackPaperAnthropic
- Robust Speech Recognition via Large-Scale Weak SupervisionPaperOpenAI
- BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelPaperBigScience
- Scaling Instruction-Finetuned Language ModelsPaperGoogle
- ReAct: Synergizing Reasoning and Acting in Language ModelsPaperPrinceton
- WhisperAudioOpenAI
- Stable DiffusionImageStability AI
- MidjourneyImageMidjourney
- BLOOMLLMBigScience
- Emergent Abilities of Large Language ModelsPaperGoogle
- FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessPaperStanford
- ImagenImageGoogle
- OPT-175BLLMMeta
- OPT: Open Pre-trained Transformer Language ModelsPaperMeta
- Flamingo: a Visual Language Model for Few-Shot LearningPaperDeepMind
- DALL·E 2ImageOpenAI
- PaLM: Scaling Language Modeling with PathwaysPaperGoogle
- PaLMLLMGoogle
- Training Compute-Optimal Large Language ModelsPaperDeepMind
- Training language models to follow instructions with human feedbackPaperOpenAI
- Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsPaperGoogle
20213 models · 8 papers
- High-Resolution Image Synthesis with Latent Diffusion ModelsPaperLMU Munich
- CodexLLMOpenAI
- AlphaFold 2ModelDeepMind
- Highly Accurate Protein Structure Prediction with AlphaFoldPaperDeepMind
- Evaluating Large Language Models Trained on CodePaperOpenAI
- LoRA: Low-Rank Adaptation of Large Language ModelsPaperMicrosoft
- RoFormer: Enhanced Transformer with Rotary Position EmbeddingPaperZhuiyi Technology
- Learning Transferable Visual Models From Natural Language SupervisionPaperOpenAI
- Zero-Shot Text-to-Image GenerationPaperOpenAI
- Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient SparsityPaperGoogle
- DALL·EImageOpenAI
20201 models · 5 papers
- An Image is Worth 16x16 Words: Transformers for Image Recognition at ScalePaperGoogle
- Denoising Diffusion Probabilistic ModelsPaperUC Berkeley
- GPT-3LLMOpenAI
- Language Models are Few-Shot LearnersPaperOpenAI
- Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksPaperMeta
- Scaling Laws for Neural Language ModelsPaperOpenAI
20194 models · 3 papers
20182 models · 2 papers
20171 models · 3 papers
20161 models · 1 papers
20150 models · 4 papers
20141 models · 5 papers
- Adam: A Method for Stochastic OptimizationPaperUniversity of Toronto
- GloVe: Global Vectors for Word RepresentationPaperStanford
- Sequence to Sequence Learning with Neural NetworksPaperGoogle
- GloVeEmbeddingStanford
- Dropout: A Simple Way to Prevent Neural Networks from OverfittingPaperUniversity of Toronto
- Generative Adversarial NetworksPaperUniversity of Montreal