cs.AI（2026-03-19）

📊 共 31 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (8 🔗3) 支柱一：机器人控制 (Robot Control) (2) 支柱五：交互与反应 (Interaction & Reaction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding	揭示多模态大语言模型在离散符号理解中的认知失配问题，并提出评测基准。	large language model multimodal
2	dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models	dTRPO：通过轨迹缩减优化扩散大语言模型的策略	large language model instruction following
3	How Uncertainty Estimation Scales with Sampling in Reasoning Models	研究推理模型中基于采样的不确定性估计方法，并提出混合估计器。	chain-of-thought
4	Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity	跨领域映射提升人类与LLM创造力：设计中的意外发现	large language model
5	Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction	提出基于5W3H结构化提示的PPS框架，提升人机交互中意图对齐效果	large language model
6	I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems	评估多智能体治理系统中腐败现象，强调制度设计的重要性	large language model
7	Quantitative Introspection in Language Models: Tracking Internal States Across Conversation	提出数值自报告以追踪语言模型的内部状态	large language model
8	Geography According to ChatGPT -- How Generative AI Represents and Reasons about Geography	评估ChatGPT的地理知识表示与推理能力，揭示生成式AI的地理认知局限性	foundation model
9	Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review	研究LLM辅助代码审查中的确认偏差，揭示软件供应链攻击风险	large language model
10	Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures	分析单智能体和多智能体生成式AI架构中的语言刻板印象	chain-of-thought
11	Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism	提出Li-Net，通过稀疏注意力机制实现准确高效的多通道时间序列预测。	multimodal
12	An Onto-Relational-Sophic Framework for Governing Synthetic Minds	提出Onto-Relational-Sophic框架，为通用人工智能的治理提供哲学基础。	foundation model
13	ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs	ZebraArena：用于研究工具增强LLM中推理-行动耦合的诊断模拟环境	large language model
14	The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation	提出针对企业AI洗涤行为的政策工具以促进绿色创新	large language model
15	PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents	提出PlanTwin以解决云端规划中的隐私泄露问题	large language model
16	ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models	ItinBench：利用大语言模型在多认知维度上进行规划的基准测试	large language model	✅
17	Learning to Disprove: Formal Counterexample Generation with Large Language Models	提出基于大语言模型的形式化反例生成方法，提升数学推理能力	large language model
18	The Autonomy Tax: Defense Training Breaks LLM Agents	揭示防御训练导致LLM Agent能力退化的“自主性税”，并分析其根本原因。	large language model
19	POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization	POET：面向功耗优化的LLM驱动RTL代码演化调优框架	large language model
20	PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents	PlanTwin：为云辅助LLM代理提供隐私保护的规划抽象	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签	🔗
21	Balanced Thinking: Improving Chain of Thought Training in Vision Language Models	提出SCALe，通过动态损失加权改进视觉语言模型中的思维链训练	reinforcement learning multimodal chain-of-thought
22	AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba	AlignMamba-2：利用模态感知Mamba增强多模态融合与情感分析	Mamba multimodal
23	Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs	提出SA-DRL框架，利用语义增强的DRL解决UAV辅助VANET中的网络碎片问题。	reinforcement learning deep reinforcement learning DRL
24	RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models	RewardFlow：利用状态图拓扑结构的奖励传播，提升LLM智能体强化学习效果	reinforcement learning large language model	✅
25	Functional Subspace Watermarking for Large Language Models	提出功能子空间水印(FSW)方法，增强大语言模型水印对参数扰动的鲁棒性。	distillation large language model
26	LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling	提出LuMamba以解决EEG建模中的电极拓扑不变性与计算效率问题	Mamba foundation model	✅
27	Box Maze: A Process-Control Architecture for Reliable LLM Reasoning	提出Box Maze框架，通过过程控制架构提升LLM推理的可靠性	reinforcement learning RLHF large language model
28	Memento-Skills: Let Agents Design Agents	Memento-Skills：提出一种通用、可持续学习的LLM Agent系统，实现Agent的自主设计。	reinforcement learning generalist agent	✅

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning	提出A2PO，通过强化学习提升MLLM在几何推理中利用辅助线的能力	manipulation reinforcement learning reward shaping
30	Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning	提出NeSyCR，通过神经符号反事实推理实现跨域Demo-to-Code	manipulation

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
31	Secure Linear Alignment of Large Language Models	提出一种安全线性对齐框架，用于跨独立训练的大语言模型进行隐私保护的交叉推理。	OMOMO large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-03-19）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理