Research Papers

The research that matters, distilled — search, filter by topic, sort, and group.

107 results

		Topics
Tokenisation via Convex Relaxations	Jan Tempus, Philip Whittington	—	May 21, 2026
Integrable Elasticity via Neural Demand Potentials	Carlos Heredia, Daniel Roncel	—	May 21, 2026
Vector Policy Optimization: Training for Diversity Improves Test-Time Search	Ryan Bahlous-Boldi, Isha Puri	—	May 21, 2026
Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration	Lily Goli, Justin Kerr	—	May 21, 2026
The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning	Vishal Rajput	—	May 21, 2026
Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models	Krishnakumar Balasubramanian	—	May 21, 2026
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems	Qianshu Cai, Yonggang Zhang	—	May 21, 2026
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention	Ali Hatamizadeh, Yejin Choi	—	May 21, 2026
LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems	Sadia Asif, Mohammad Mohammadi Amiri	—	May 21, 2026
Evaluating Commercial AI Chatbots as News Intermediaries	Mirac Suzgun, Emily Shen	—	May 21, 2026
DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback	Yunpeng Dong, Jingkai He	—	May 21, 2026
FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection	Huanchi Wang, Zihang Huang	—	May 21, 2026
SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis	Stanislav R. Kirpichenko, Andrei V. Konstantinov	—	May 21, 2026
MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data	Amir Mousavi, Mohammad Sadegh Sirjani	—	May 21, 2026
CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation	Amir Mousavi, Mohammad Sadegh Sirjani	—	May 21, 2026
Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job Arrivals	Yu Tang, Muhammad Zakwan	—	May 21, 2026
Reducing Political Manipulation with Consistency Training	Long Phan, Devin Kim	—	May 21, 2026
Understanding Data Temporality Impact on Large Language Models Pre-training	Pilchen Hippolyte, Fabre Romain	—	May 21, 2026
Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation	Samson Gourevitch, Yazid Janati	—	May 21, 2026
Advancing Mathematics Research with AI-Driven Formal Proof Search	George Tsoukalas, Anton Kovsharov	—	May 21, 2026
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models	DeepSeek AI	ArchitectureReinforcement LearningAgents	Dec 2, 2025
Natural Emergent Misalignment from Reward Hacking in Production RL	Anthropic	Safety	Nov 23, 2025
Olympiad-level Formal Mathematical Reasoning with Reinforcement Learning (AlphaProof)	Google DeepMind	Reinforcement LearningEvaluation	Nov 12, 2025
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models	Stanford / SambaNova / UC Berkeley	Agents	Oct 6, 2025