Daily AI Papers — May 14, 2026

13 minute read

Published: May 14, 2026

1. MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Authors: Mind Lab, Song Cao, Vic Cao, Andrew Chen, Kaijie Chen, Cleon Cheng et al.

Summary: MinT is a managed infrastructure system for LoRA post-training and online serving designed for settings where many fine-tuned policies are produced over a small number of expensive base-model deployments. Instead of materializing each policy as a merged full checkpoint, MinT keeps the base model resident and dynamically loads exported LoRA adapter revisions, enabling efficient multi-tenant LLM serving at massive scale.

Arxiv: arxiv.org/abs/2605.13779

Sources: HuggingFace Daily Papers (#1, 3 upvotes)

Why trending: Highest-upvoted paper today on HuggingFace; addresses a critical MLOps challenge — serving millions of fine-tuned LoRA models without N×base-model memory overhead, highly relevant to production AI infrastructure teams.

2. Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion

Authors: Chien Van Nguyen, Chaitra Hegde, Van Cuong Pham, Ryan A. Rossi, Franck Dernoncourt, Thien Huu Nguyen

Summary: Orthrus introduces a dual-architecture framework unifying the exact generation fidelity of autoregressive LLMs with the high-speed parallel token generation of diffusion models. The sequential nature of standard autoregressive decoding is a fundamental throughput bottleneck that Orthrus addresses by using diffusion-based parallel decoding while retaining AR-quality outputs.

Arxiv: arxiv.org/abs/2605.12825

Sources: HuggingFace Daily Papers (2 upvotes)

Why trending: Inference speed and memory efficiency are top priorities across the field; combining AR fidelity with diffusion parallelism is a compelling architectural direction.

3. Revisiting DAgger in the Era of LLM-Agents

Authors: Changhao Li, Rushi Qiang, Jiawei Huang, Chenxiao Gao, Chao Zhang, Niao He

Summary: Long-horizon LM agents trained from multi-turn interaction suffer from covariate shift with SFT (off-policy teacher data) and sparse rewards with RL. This paper revisits DAgger — the classic interactive imitation learning algorithm — showing it elegantly bridges both failure modes for LLM agents by providing on-policy teacher corrections at training time.

Arxiv: arxiv.org/abs/2605.12913

Sources: HuggingFace Daily Papers (2 upvotes)

Why trending: Agentic LLM training is one of the hottest research areas; reframing a classical algorithm (DAgger) for modern LLM agent settings gives the community a concrete and principled recipe.

4. MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image

Authors: Alan Arazi, Eilam Shapira, Shoham Grunblat, Mor Ventura, Elad Hoffer, Gioia Blayer

Summary: MulTaBench is a comprehensive benchmark for multimodal tabular learning that natively incorporates text and image modalities alongside structured numerical and categorical data. It reveals that tabular foundation models relying on frozen pretrained embeddings for unstructured inputs underperform tuned end-to-end multimodal models on real-world tabular tasks.

Arxiv: arxiv.org/abs/2605.10616

Sources: HuggingFace Daily Papers (2 upvotes)

Why trending: Tabular learning is ubiquitous in industry; adding multimodal support is a natural frontier and this benchmark sets a standard for the field.

5. AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Authors: Yuchao Gu, Guian Fang, Yuxin Jiang, Weijia Mao, Song Han

Summary: AnyFlow addresses a core limitation of consistency-distilled video diffusion models: performance degrades as more sampling steps are allocated at test time. By replacing the off-policy consistency trajectory with on-policy flow map distillation, AnyFlow achieves high-quality generation at any number of inference steps without the typical step-count performance cliff.

Arxiv: arxiv.org/abs/2605.13724