AI Hub

Research Papers

The research that matters, distilled — search, filter by topic, sort, and group.

107 results

Topics
Zero-Shot Text-to-Image GenerationOpenAI
Architecture
Feb 24, 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient SparsityGoogle
ArchitectureTraining
Jan 11, 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at ScaleGoogle
Architecture
Oct 22, 2020
Denoising Diffusion Probabilistic ModelsUC Berkeley
Architecture
Jun 19, 2020
Language Models are Few-Shot LearnersOpenAI
TrainingEvaluation
May 28, 2020
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksMeta
Architecture
May 22, 2020
Scaling Laws for Neural Language ModelsOpenAI
Training
Jan 23, 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerGoogle
Training
Oct 23, 2019
RoBERTa: A Robustly Optimized BERT Pretraining ApproachMeta
Training
Jul 26, 2019
Language Models are Unsupervised Multitask LearnersOpenAI
TrainingEvaluation
Feb 14, 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingGoogle
ArchitectureTraining
Oct 11, 2018
Improving Language Understanding by Generative Pre-TrainingOpenAI
Training
Jun 11, 2018
Mastering Chess and Shogi by Self-Play with a General RL AlgorithmDeepMind
Reinforcement Learning
Dec 5, 2017
Proximal Policy Optimization AlgorithmsOpenAI
Reinforcement Learning
Jul 20, 2017
Attention Is All You NeedGoogle
Architecture
Jun 12, 2017
Mastering the Game of Go with Deep Neural Networks and Tree SearchDeepMind
Reinforcement Learning
Jan 27, 2016
Deep Residual Learning for Image RecognitionMicrosoft
Architecture
Dec 10, 2015
Deep LearningNature
Architecture
May 28, 2015
Distilling the Knowledge in a Neural NetworkGoogle
Training
Mar 9, 2015
Batch Normalization: Accelerating Deep Network TrainingGoogle
Training
Feb 11, 2015
Adam: A Method for Stochastic OptimizationUniversity of Toronto
Training
Dec 22, 2014
GloVe: Global Vectors for Word RepresentationStanford
Training
Oct 1, 2014
Sequence to Sequence Learning with Neural NetworksGoogle
Architecture
Sep 10, 2014
Dropout: A Simple Way to Prevent Neural Networks from OverfittingUniversity of Toronto
Training
Jun 15, 2014