AI Hub

Research Papers

The research that matters, distilled — search, filter by topic, sort, and group.

107 results

Topics
Tokenisation via Convex RelaxationsJan Tempus, Philip WhittingtonMay 21, 2026
Integrable Elasticity via Neural Demand PotentialsCarlos Heredia, Daniel RoncelMay 21, 2026
Vector Policy Optimization: Training for Diversity Improves Test-Time SearchRyan Bahlous-Boldi, Isha PuriMay 21, 2026
Remember to be Curious: Episodic Context and Persistent Worlds for 3D ExplorationLily Goli, Justin KerrMay 21, 2026
The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation LearningVishal RajputMay 21, 2026
Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting ModelsKrishnakumar BalasubramanianMay 21, 2026
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent SystemsQianshu Cai, Yonggang ZhangMay 21, 2026
Gated DeltaNet-2: Decoupling Erase and Write in Linear AttentionAli Hatamizadeh, Yejin ChoiMay 21, 2026
LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent SystemsSadia Asif, Mohammad Mohammadi AmiriMay 21, 2026
Evaluating Commercial AI Chatbots as News IntermediariesMirac Suzgun, Emily ShenMay 21, 2026
DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/RollbackYunpeng Dong, Jingkai HeMay 21, 2026
FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly DetectionHuanchi Wang, Zihang HuangMay 21, 2026
SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival AnalysisStanislav R. Kirpichenko, Andrei V. KonstantinovMay 21, 2026
MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking DataAmir Mousavi, Mohammad Sadegh SirjaniMay 21, 2026
CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead AdaptationAmir Mousavi, Mohammad Sadegh SirjaniMay 21, 2026
Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job ArrivalsYu Tang, Muhammad ZakwanMay 21, 2026
Reducing Political Manipulation with Consistency TrainingLong Phan, Devin KimMay 21, 2026
Understanding Data Temporality Impact on Large Language Models Pre-trainingPilchen Hippolyte, Fabre RomainMay 21, 2026
Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State ReformulationSamson Gourevitch, Yazid JanatiMay 21, 2026
Advancing Mathematics Research with AI-Driven Formal Proof SearchGeorge Tsoukalas, Anton KovsharovMay 21, 2026
DeepSeek-V3.2: Pushing the Frontier of Open Large Language ModelsDeepSeek AI
ArchitectureReinforcement LearningAgents
Dec 2, 2025
Natural Emergent Misalignment from Reward Hacking in Production RLAnthropic
Safety
Nov 23, 2025
Olympiad-level Formal Mathematical Reasoning with Reinforcement Learning (AlphaProof)Google DeepMind
Reinforcement LearningEvaluation
Nov 12, 2025
Agentic Context Engineering: Evolving Contexts for Self-Improving Language ModelsStanford / SambaNova / UC Berkeley
Agents
Oct 6, 2025