AI Hub

Research Papers

The research that matters, distilled — search, filter by topic, sort, and group.

107 results

Topics
Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsPrinceton
Agents
May 17, 2023
Segment AnythingMeta
Architecture
Apr 5, 2023
GPT-4 Technical ReportOpenAI
ArchitectureEvaluationSafety
Mar 15, 2023
LLaMA: Open and Efficient Foundation Language ModelsMeta
Training
Feb 27, 2023
Toolformer: Language Models Can Teach Themselves to Use ToolsMeta
Agents
Feb 9, 2023
Constitutional AI: Harmlessness from AI FeedbackAnthropic
SafetyReinforcement Learning
Dec 15, 2022
Robust Speech Recognition via Large-Scale Weak SupervisionOpenAI
Architecture
Dec 6, 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelBigScience
Training
Nov 9, 2022
Scaling Instruction-Finetuned Language ModelsGoogle
Training
Oct 20, 2022
ReAct: Synergizing Reasoning and Acting in Language ModelsPrinceton
Agents
Oct 6, 2022
Emergent Abilities of Large Language ModelsGoogle
Evaluation
Jun 15, 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessStanford
Architecture
May 27, 2022
OPT: Open Pre-trained Transformer Language ModelsMeta
Training
May 2, 2022
Flamingo: a Visual Language Model for Few-Shot LearningDeepMind
Architecture
Apr 29, 2022
PaLM: Scaling Language Modeling with PathwaysGoogle
Training
Apr 5, 2022
Training Compute-Optimal Large Language ModelsDeepMind
Training
Mar 29, 2022
Training language models to follow instructions with human feedbackOpenAI
TrainingReinforcement LearningSafety
Mar 4, 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsGoogle
EvaluationAgents
Jan 28, 2022
High-Resolution Image Synthesis with Latent Diffusion ModelsLMU Munich
Architecture
Dec 20, 2021
Highly Accurate Protein Structure Prediction with AlphaFoldDeepMind
Architecture
Jul 15, 2021
Evaluating Large Language Models Trained on CodeOpenAI
Evaluation
Jul 7, 2021
LoRA: Low-Rank Adaptation of Large Language ModelsMicrosoft
Training
Jun 17, 2021
RoFormer: Enhanced Transformer with Rotary Position EmbeddingZhuiyi Technology
Architecture
Apr 20, 2021
Learning Transferable Visual Models From Natural Language SupervisionOpenAI
ArchitectureTraining
Feb 26, 2021