论文雷达日报｜2026-05-22

一句话结论：今日论文层是一轮以 KV cache 为焦点的『注意力/推理效率』微周期，并行叠加 agentic reasoning（临床寻证 + 形式化证明）；最强观察是 KV 被同时从压缩、安全共享、稀疏化三路攻坚。

摘要

今日 140 篇候选全部为新面孔（无一落入过去 14 天 seen-pool，seen_count=231），主线集中在注意力 / 推理效率：HF 头部 Full Attention Strikes Back（百步把全注意力迁成稀疏）领跑，叠加 Gated DeltaNet-2（线性注意力）、WorldKV、KVServe、Q-ARVD（视频扩散量化）共同构成一轮 inference-efficiency 微周期。第二主轴是 agentic reasoning：ClinSeekAgent（临床主动寻证）、Lean Refactor 与 DeepMind 的形式化证明搜索、LCGuard（多智能体 KV 安全）交叉命中 reasoning+agent。最强单点观察：KV cache 同时从压缩（WorldKV/KVServe）、安全共享（LCGuard）、稀疏化（Full Attention）三个角度被攻，说明 KV 已成为本轮系统层争夺焦点。S2 metadata 稀疏（6/140 有 tldr、0 篇返回相似图谱），故延伸阅读段空缺。
候选规模 140 篇（arXiv primary + HF curated + S2 metadata），seen-pool 去重后 0 篇重复，全部为今日新批次。
效率主线：Full Attention Strikes Back / Gated DeltaNet-2 / Q-ARVD / WorldKV / KVServe；agentic 主线：ClinSeekAgent / Lean Refactor / Formal Proof Search / LCGuard。

📌 Top picks (交叉命中)

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps（HF#16 · 69赞 · score 7.9） → 百步训练把全注意力迁移成稀疏，破解长上下文推理瓶颈

入选：HF trending #16(69 赞) + watchlist 命中 reasoning/inference/kv cache；本日 attention 效率主线最高分候选
作者：Yanke Zhou, Yiduo Li, Hanlin Tang, Maohua Li, Kan Liu, Lan Tao 等
证据：arXiv HF

Q-ARVD: Quantizing Autoregressive Video Diffusion Models（HF#17 · 14赞 · score 7.3） → 量化自回归视频扩散模型，降低实时流式生成推理成本

入选：HF trending #17 + watchlist 命中 quantization/inference/world model；把量化推向视频扩散世界模型
作者：Siao Tang, Xinyin Ma, Gongfan Fang, Xingyi Yang, Xinchao Wang
证据：arXiv HF

ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning（HF#18 · 4赞 · score 7.2） → 临床推理智能体，主动检索并综合多模态证据

入选：HF trending #18 + watchlist 命中 reasoning/agent/inference；不再假设证据已策展，强调主动寻证
作者：Juncheng Wu, Letian Zhang, Yuhan Wang, Haoqin Tu, Hardy Chen, Zijun Wang 等
证据：arXiv HF

LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems（score 7.0 · cs.AI/cs.ET/cs.LG/cs.MA） → 守护多智能体潜空间通信，安全共享KV缓存

入选：watchlist 命中 reasoning/agent/kv cache + benchmark；cs.MA 交叉，KV 共享安全是多智能体新攻面
作者：Sadia Asif, Mohammad Mohammadi Amiri, Momin Abbas, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy
证据：arXiv HF

ACC: Compiling Agent Trajectories for Long-Context Training（HF#41 · 51赞 · score 5.5） → 把智能体轨迹编译成长上下文训练数据，省去人工策展

入选：watchlist 命中 reasoning/agent + sft；HF #41(51 赞)，用 agent 轨迹替代昂贵长文档策展
作者：Qisheng Su, Zhen Fang, Shiting Huang, Yu Zeng, Yiming Zhao, Kou Shi 等
证据：arXiv HF

Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search（HF#8 · 1赞 · score 5.2） → 检索增强智能体，多目标可控重构Lean证明

入选：HF trending #8 + watchlist 命中 agent；针对 LLM 形式化证明冗长脆弱的 agentic 优化
作者：Jialin Lu, Soonho Kong, Rodrigo Stehling, Kaiyu Yang, Zhangyang Wang, Weiran Sun 等
证据：arXiv HF

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?（HF#29 · 88赞 · score 5.1） → 评测MLLM能否超越对人格的第一印象偏见

入选：HF trending #29(88 赞) + watchlist 命中 reasoning/agent + benchmark；从分数预测转向行为感知评测
作者：Caixin Kang, Tianyu Yan, Sitong Gong, Mingfang Zhang, Liangyang Ouyang, Ruicong Liu 等
证据：arXiv HF

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles（HF#28 · 17赞 · score 4.7） → 用强化学习编排分层模型-技能集成

入选：HF trending #28 + watchlist 命中 reasoning/agent + benchmark；打破单体 LLM 接口技能的瓶颈
作者：Jinyang Wu, Guocheng Zhai, Ruihan Jin, Yuhao Shen, Zhengxi Lu, Fan Zhang 等
证据：arXiv HF

🏷 Watchlist 分类命中

本次 raw fresh、关键词命中但未进 Top picks 的论文，按 arXiv category 分组。

cs.AI

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention（score 4.5） → 线性注意力解耦擦除与写入，更稳地编辑压缩记忆（watchlist 命中 reasoning/vla + benchmark；线性注意力架构改进）
Advancing Mathematics Research with AI-Driven Formal Proof Search（score 4.5） → 首个大规模评测LLM用Lean做研究级数学形式证明（watchlist 命中 reasoning/agent + evaluation；形式化数学研究方向）

cs.CL

ChronoMedKG: A Temporally-Grounded Biomedical Knowledge Graph and Benchmark for Clinical Reasoning（score 4.5） → 时间感知生物医学知识图谱与临床推理基准（watchlist 命中 reasoning/agent + benchmark；为临床推理补上时间维度）
Two is better than one: A Collapse-free Multi-Reward RLIF Training Framework（score 4.5） → 无外部监督的双奖励RLIF，避免推理RL训练坍缩（watchlist 命中（RLIF/reasoning）；内部反馈强化学习的稳定性改进）

cs.CV

WorldKV: Efficient World Memory with World Retrieval and Compression（score 4.5） → 世界记忆检索+压缩KV，保持视频世界长程一致（watchlist 命中 inference/vla；KV 压缩用于持久世界模型）
SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers（score 4.4） → 谱能量引导注意力，扩散Transformer超分辨外推（watchlist 命中（inference-time attention）；DiT 训练外分辨率生成）

cs.RO

AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation（score 4.0） → 带自我意识推理的视觉语言导航（watchlist 命中 reasoning/vla；VLN 显式自我意识推理）
GesVLA: Gesture-Aware Vision-Language-Action Model Embedded Representations（score 4.0） → 手势感知VLA，解决多相似物体的空间歧义（watchlist 命中 vla；为机器人操作引入手势线索）

cs.LG

Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals（score 4.0） → 随机恢复近边界信号，稳定RLVR的GRPO裁剪（watchlist 命中（RLVR/reasoning）；GRPO 裁剪不稳定性的诊断与修复）

🔗 延伸阅读 (Semantic Scholar 相似论文)

本段今日无高置信度增量信号（S2 相似论文未返回）。140 篇候选中仅 6 篇带 S2 tldr，且全部未返回 similar_papers 引用图谱，故 extended_reading=[]，对应 coverage_gaps: s2_similar_unavailable。

🧑‍🔬 新出现的作者 / 团队

Pushmeet Kohli（Google DeepMind，frontier-labs，cross_checked=false）：领衔 DeepMind 形式化证明搜索新作（Advancing Mathematics Research with AI-Driven Formal Proof Search），首次大规模评测 LLM 用 Lean 解决研究级数学；合著含 Thomas Hubert / Matej Balog / Aja Huang 等 AlphaProof 谱系作者。机构归属系作者签名公开背景推断，本轮未交叉核验。
- 证据：link link
Jan Kautz（NVIDIA，frontier-labs，cross_checked=false）：NVIDIA 团队线性注意力新架构 Gated DeltaNet-2，解耦擦除与写入以更稳地编辑压缩记忆；合著含 Ali Hatamizadeh / Yejin Choi。机构归属系作者签名公开背景推断，本轮未交叉核验。
- 证据：link link

📉 覆盖缺口与不确定性

s2_similar_unavailable
affiliations_unavailable
s2_metadata_sparse: 仅 6/140 候选返回 S2 tldr/链接，0 篇返回相似图谱（相似论文段缺失）
affiliations_inferred: 候选 JSON 无机构字段，DeepMind / NVIDIA 归属系作者签名推断，cross_checked=false 未本轮核验
categories_partial: 100/140 候选带 arXiv 分类，约 40 篇（多为 HF 源）分类缺失

来源与交叉验证说明

结论锚 arXiv primary；HF Daily 作 trending 热度信号（49/140 带 upvotes），不作论文结果证据；Semantic Scholar metadata 稀疏（6/140 有 tldr，全部无相似图谱），仅作补充。paper_fetch stderr 为空，无单源完全降级。

三源均返回（paper_fetch.err 空）。140 篇候选全部不在 14 天 seen-pool，今日为全新批次，无需向延伸阅读降级。结论锚 arXiv primary，HF trending 仅作热度信号；S2 因新预印本索引滞后未提供相似图谱，延伸阅读段按规约留空。

Hanzhi's BLOG

[论文·2026-05-22]