cs.AI（2026-02-02）

📊 共 40 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (27 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (10 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1) 支柱四：生成式动作 (Generative Motion) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

#	题目	一句话要点	标签	🔗
1	Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling	提出基于漫画的视觉推理范式，提升多模态时序和因果推理能力	large language model multimodal chain-of-thought
2	Entropy-Guided Data-Efficient Training for Multimodal Reasoning Reward Models	提出熵引导训练(EGT)方法，提升多模态推理奖励模型的训练效率与性能。	large language model multimodal
3	Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts	Avenir-Web：基于混合专家和经验模仿的多模态Web Agent，提升复杂Web环境下的任务执行能力	large language model multimodal
4	Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models	提出DDR-Bench，评估LLM在开放数据分析中的自主探索能力	large language model
5	Evolving from Tool User to Creator via Training-Free Experience Reuse in Multimodal Reasoning	提出UCT框架，通过免训练经验复用，使多模态推理Agent从工具使用者进化为创造者	multimodal
6	Large Language Model and Formal Concept Analysis: a comparative study for Topic Modeling	对比研究大型语言模型与形式概念分析在主题建模中的应用	large language model
7	Optimizing Prompts for Large Language Models: A Causal Approach	提出因果提示优化（CPO）框架，解决大语言模型提示工程中的泛化性和成本问题。	large language model
8	MentisOculi: Revealing the Limits of Reasoning with Mental Imagery	MentisOculi：揭示心智图像推理的局限性，评估多模态模型利用视觉信息的能力	large language model multimodal
9	Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback	提出Live-Evo以解决在线记忆演化问题	large language model	✅
10	Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient	提出IPG方法，通过积分策略梯度实现对LLM推理过程的解释与控制	large language model
11	Light Alignment Improves LLM Safety via Model Self-Reflection with a Single Neuron	提出基于单神经元门控机制的轻量级对齐方法，提升LLM安全性。	large language model	✅
12	Geometric Analysis of Token Selection in Multi-Head Attention	提出多头注意力几何分析框架，揭示Token选择机制与头部的专门化行为	large language model
13	RedVisor: Reasoning-Aware Prompt Injection Defense via Zero-Copy KV Cache Reuse	RedVisor：通过零拷贝KV缓存复用实现推理感知的提示注入防御	large language model
14	PRISM: Parametrically Refactoring Inference for Speculative Sampling Draft Models	PRISM：通过参数化重构推理解耦模型容量与推理成本，加速推测采样	large language model
15	Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge	提出身份桥接方法以解决自回归语言模型的反转诅咒问题	large language model
16	Structure Enables Effective Self-Localization of Errors in LLMs	提出Thought-ICS框架，通过结构化推理实现LLM的有效误差自定位与修正	chain-of-thought
17	More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression	LASER-KV：通过精确LSH召回克服KV缓存压缩中的贪婪偏差	large language model
18	Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization	提出NLCO基准，评估LLM在自然语言组合优化问题中的推理能力	large language model
19	See2Refine: Vision-Language Feedback Improves LLM-Based eHMI Action Designers	See2Refine：利用视觉-语言反馈提升LLM驱动的eHMI动作设计	large language model
20	Constrained Process Maps for Multi-Agent Generative AI Workflows	提出多代理生成AI工作流的约束过程图以解决不确定性问题	large language model
21	Do I Really Know? Learning Factual Self-Verification for Hallucination Reduction	提出VeriFY框架，通过自验证学习减少大语言模型的事实性幻觉	large language model
22	Human Society-Inspired Approaches to Agentic AI Security: The 4C Framework	提出4C框架，应对Agentic AI在开放环境中涌现的安全风险	large language model
23	GRAB: An LLM-Inspired Sequence-First Click-Through Rate Prediction Modeling Paradigm	GRAB：受LLM启发的序列优先点击率预测建模范式，提升广告收益和点击率。	large language model
24	Meta Engine: A Unified Semantic Query Engine on Heterogeneous LLM-Based Query Systems	提出Meta Engine，统一异构LLM语义查询系统，解决多模态数据查询难题。	large language model
25	Beyond Dense States: Elevating Sparse Transcoders to Active Operators for Latent Reasoning	提出LSTR：提升稀疏转码器为主动算子，用于潜在空间推理	chain-of-thought
26	What LLMs Think When You Don't Tell Them What to Think About?	研究揭示：在无主题引导下，大语言模型展现出显著且系统性的主题偏好	large language model
27	The Strategic Foresight of LLMs: Evidence from a Fully Prospective Venture Tournament	大型语言模型在战略预测中超越人类专家，尤其在众筹项目成功预测方面	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗
28	Position: Explaining Behavioral Shifts in Large Language Models Requires a Comparative Approach	提出Δ-XAI框架，用于解释大语言模型行为转变	reinforcement learning large language model foundation model
29	Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents	利用熵减优化LLM智能体工具使用行为，提升效率与性能	reward design large language model
30	DomusFM: A Foundation Model for Smart-Home Sensor Data	DomusFM：面向智能家居传感器数据的预训练基础模型	contrastive learning foundation model
31	Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models	提出GPS，通过小模型预测提示词难度，高效指导大模型强化学习后训练。	reinforcement learning predictive model large language model
32	FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning	FlowSteer：通过端到端强化学习实现交互式Agent工作流编排	reinforcement learning large language model
33	Edit Knowledge, Not Just Facts via Multi-Step Reasoning over Background Stories	提出基于背景故事多步推理的知识编辑方法，提升模型知识整合与泛化能力	distillation large language model
34	ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents	ProcMEM：通过非参数PPO从经验中学习可复用程序记忆，用于LLM智能体	PPO
35	TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning	TABX：用于多智能体强化学习的高吞吐量沙盒战斗模拟器	reinforcement learning
36	MAGIC: A Co-Evolving Attacker-Defender Adversarial Game for Robust LLM Safety	MAGIC：一种用于增强LLM安全性的协同进化攻防对抗博弈方法	reinforcement learning large language model	✅
37	Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking	提出对抗奖励审计框架，主动检测并缓解奖励模型中的奖励攻击问题。	reinforcement learning RLHF

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
38	MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation	MarkCleaner：通过不可察觉的微几何扰动实现高保真水印去除	gaussian splatting splatting

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
39	Understanding the Reversal Curse Mitigation in Masked Diffusion Models through Attention and Training Dynamics	通过注意力和训练动态理解掩码扩散模型中逆转诅咒的缓解	MDM

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
40	Synesthesia of Vehicles: Tactile Data Synthesis from Visual Inputs	提出车辆触觉感知框架SoV，通过视觉输入预测车辆行驶过程中的触觉激励，提升自动驾驶安全性。	spatiotemporal

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-02-02）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理