论文雷达日报｜2026-05-25

一句话结论：今日候选单源降级（100% HF Daily），论文层双热点是「多模态/视觉推理」与「agent skill 自演化」——SkillOpt 主张把 agent 技能当优化器训练（147 赞当日最高），与分类命中里「模型生成技能复用」研究同日呼应。

摘要

今日 32 条候选实为单源降级：30 条来自 HuggingFace Daily trending、2 条 hf+s2，arXiv 分类与 Semantic Scholar 富集（s2_tldr / similar_papers）几乎全空，全部 0 命中 14 天 seen-pool。论文层主线是「multimodal reasoning + agent skill 自演化」双热点：Top 8 中 5 篇围绕多模态/视觉推理（SciAtlas 知识图谱、VGenST-Bench 时空推理基准、ETCHR think-with-images、From-Seeing-to-Thinking 感知-推理解耦、LatentUMM 理解-生成对齐），两篇正面交锋「agent skill 该不该当成可训练对象」——SkillOpt（#6，147 赞当日最高）主张把技能当优化器训练实现自演化，与分类命中里的 2605.23899「系统研究模型生成 agent skill 复用」形成同日呼应。效率/理论侧 LLMs-as-Noisy-Channels（2605.23901）用香农信道视角挑战单调幂律 scaling law。注意：本期无任何结论可锚定到 arXiv 分类或 S2 引用图，Top picks 排序完全由 HF trending + watchlist 关键词驱动，可信度相应降级。

📌 Top picks (交叉命中)

按 ranking_score 降序取前 8；本期排序完全由 HF trending + watchlist 关键词驱动（无 arXiv 分类 / S2 交叉）。

SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research（44↑ / HF#2 / score 8.8） → 大规模科研知识图谱，助 agent 跨学科自动检索
- 入选理由：HF 当日热度第 2（44 赞）；watchlist 命中 reasoning/agent/inference，ranking_score 8.8 居首
- 证据：https://arxiv.org/abs/2605.22878 · https://huggingface.co/papers/2605.22878
PhotoFlow: Agentic 3D Virtual Photography Missions（20↑ / HF#5 / score 7.0） → agentic 3D 虚拟摄影：自主推断机位与相机参数
- 入选理由：HF trending #5（20 赞）；命中 reasoning/agent + benchmark
- 证据：https://arxiv.org/abs/2605.23771 · https://huggingface.co/papers/2605.23771
VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis（18↑ / HF#12 / score 6.8） → 用主动视频合成构建时空推理评测基准
- 入选理由：HF trending #12（18 赞）；命中 reasoning/agent + benchmark/evaluation
- 证据：https://arxiv.org/abs/2605.22570 · https://huggingface.co/papers/2605.22570
StepAudio 2.5 Technical Report（35↑ / HF#13 / score 6.2） → 统一音频-语言大模型，强化语音与推理能力
- 入选理由：HF trending #13（35 赞）；命中 reasoning/rlhf，101 作者大团队技术报告
- 证据：https://arxiv.org/abs/2605.23463 · https://huggingface.co/papers/2605.23463
ETCHR: Editing To Clarify and Harness Reasoning（9↑ / HF#17 / score 5.8） → 编辑图像澄清线索，提升多模态视觉推理
- 入选理由：HF trending #17（9 赞）；命中 reasoning/moe + fine-tuning
- 证据：https://arxiv.org/abs/2605.23897 · https://huggingface.co/papers/2605.23897
SkillOpt: Executive Strategy for Self-Evolving Agent Skills（147↑ / HF#21 / score 5.4） → 把 agent 技能当优化器训练，实现自演化
- 入选理由：HF trending #21，147 赞为当日最高；命中 agent/inference
- 证据：https://arxiv.org/abs/2605.23904 · https://huggingface.co/papers/2605.23904
From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models（4↑ / HF#3 / score 5.2） → 解耦感知与推理，改进 VLM 后训练
- 入选理由：HF trending #3（4 赞）；命中 reasoning + sft
- 证据：https://arxiv.org/abs/2605.20177 · https://huggingface.co/papers/2605.20177
LatentUMM: Dual Latent Alignment for Unified Multimodal Models（4↑ / HF#1 / score 4.9） → 双隐空间对齐，修复统一多模态理解生成不一致
- 入选理由：HF 当日榜首 #1（4 赞）；命中 preference optimization
- 证据：https://arxiv.org/abs/2605.17766 · https://huggingface.co/papers/2605.17766

🏷 Watchlist 分类命中

arXiv 分类本期未返回（arxiv_categories_unavailable），改用 watchlist 关键词分组；仅收录关键词命中但未进 Top picks 的新抓论文。

🔗 延伸阅读 (Semantic Scholar 相似论文)

本段今日无高置信度增量信号（S2 相似论文未返回，s2_similar_unavailable）。

🧑‍🔬 新出现的作者 / 团队

本日发现扫描未发现达标候选人：全 32 条候选无 tracked_author 命中，且 affiliations 字段缺失（affiliations_unavailable），无法做机构交叉验证，故不硬塞候选。

📉 覆盖缺口与不确定性

s2_unavailable：Semantic Scholar 未返回 tldr/venue/affiliation，tldr_en 全空。
s2_similar_unavailable：S2 相似论文图未返回，延伸阅读为空。
arxiv_categories_unavailable：arXiv listing 未返回分类，分类命中改用关键词分组。
affiliations_unavailable：候选无机构字段，新作者发现无法机构交叉验证。
confidence_flags：single_source_fallback:hf_daily_only、arxiv_listing_returned_no_categories、s2_enrichment_missing:tldr_and_similar

来源与交叉验证说明

单源降级：候选 100% 来自 HF Daily trending（30 hf + 2 hf+s2），arXiv listing 未返回分类、Semantic Scholar 未返回 tldr/similar/venue/affiliation。权重上无 primary(arXiv) 与 metadata(S2) 交叉，仅 curated(HF) 单源支撑。
每篇结论的硬证据指向 arxiv_url（primary 预印本原文）；HF upvotes/trending 仅作热度信号、不作论文结果证据。因 S2 未索引，citation_count/venue 缺失不作降权理由（新预印本常见）。本期无第二独立源交叉验证，属单源降级期。

Hanzhi's BLOG

[论文·2026-05-25]

论文雷达日报｜2026-05-25

摘要

📌 Top picks (交叉命中)

🏷 Watchlist 分类命中

keyword:reasoning（关键词分组）

keyword:inference（关键词分组）

keyword:agent（关键词分组）

keyword:world model（关键词分组）

keyword:quantization（关键词分组）

keyword:vla（关键词分组）

🔗 延伸阅读 (Semantic Scholar 相似论文)

🧑‍🔬 新出现的作者 / 团队

📉 覆盖缺口与不确定性

来源与交叉验证说明