cs.AI（2026-04-28）

📊 共 31 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (9) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	Learning Generalizable Multimodal Representations for Software Vulnerability Detection	提出MultiVul多模态对比学习框架，提升软件漏洞检测的泛化性	large language model multimodal
2	Walking Through Uncertainty: An Empirical Study of Uncertainty Estimation for Audio-Aware Large Language Models	首个音频感知大语言模型不确定性估计的系统性实证研究	large language model
3	DualFact+: A Multimodal Fact Verification Framework for Procedural Video Understanding	提出DualFact+框架，用于程序视频理解中的多模态事实核查。	multimodal
4	From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models	提出IGDS框架，利用可解释性指导大语言模型的数据选择，提升模型性能。	large language model
5	From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling	提出Agora-Opt，利用去中心化辩论和记忆增强LLM Agent解决优化建模问题	large language model	✅
6	Making AI-Assisted Grant Evaluation Auditable without Exposing the Model	提出基于TEE的架构，在不暴露模型的前提下，实现AI辅助资助评估的可审计性。	large language model
7	Doing More With Less: Revisiting the Effectiveness of LLM Pruning for Test-Time Scaling	非结构化剪枝提升LLM在测试时计算扩展中的推理性能	large language model
8	Towards Agentic Investigation of Security Alerts	提出基于LLM的Agentic安全告警调查工作流，提升告警判定的准确性。	large language model
9	SAFEdit: Does Multi-Agent Decomposition Resolve the Reliability Challenges of Instructed Code Editing?	SAFEdit：多智能体分解框架提升指令驱动代码编辑的可靠性	large language model
10	Think Before You Act -- A Neurocognitive Governance Model for Autonomous AI Agents	提出神经认知治理模型PAGRL，提升自主AI Agent在复杂环境下的安全性与合规性	large language model
11	HotComment: A Benchmark for Evaluating Popularity of Online Comments	提出HotComment基准，用于评估在线评论的受欢迎程度，并引入StyleCmt模型。	multimodal
12	The Nonverbal Syntax Framework: An Evidence-Based Tiered System for Inferring Learner States from Observable Behavioral Cues	提出非语言语法框架，通过可观察行为线索推断学习者状态	multimodal
13	Emotive Architectures: The Role of LLMs in Adjusting Work Environments	利用LLM构建情感感知工作环境，提升用户体验与福祉	large language model
14	SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents	SnapGuard：针对截图Web代理的轻量级Prompt注入检测方法	multimodal
15	Assistants, Not Architects: The Role of LLMs in Networked Systems Design	提出Kepler框架，解决LLM在网络系统架构设计中不可靠的问题	large language model
16	SciEval: A Benchmark for Automatic Evaluation of K-12 Science Instructional Materials	SciEval：构建K-12科学教学材料自动评估基准，并验证领域微调的有效性。	large language model
17	AHASD: Asynchronous Heterogeneous Architecture for LLM Adaptive Drafting Speculative Decoding on Mobile Devices	提出AHASD以解决移动设备上LLM自适应草拟的效率问题	large language model
18	DATAREEL: Automated Data-Driven Video Story Generation with Animations	DataReel：提出一个自动生成动画数据视频故事的基准和多智能体框架	large language model	✅
19	Where Did It Go Wrong? Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents	提出面向能力的测试方法，用于视觉-语言导航Agent的故障归因	VLN
20	Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization	Agentic Architect：基于Agentic AI的计算机体系结构设计探索与优化框架	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签
21	Three Models of RLHF Annotation: Extension, Evidence, and Authority	提出RLHF标注的三种模型，优化人类反馈强化学习流程	reinforcement learning RLHF large language model
22	How Can Reinforcement Learning Achieve Expert-level Placement?	提出基于专家布局学习的强化学习方法，提升芯片布局质量	reinforcement learning reward design
23	Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions	提出基于半马尔可夫强化学习的城市级电动汽车网约车控制方法，保证动作可行性。	reinforcement learning SAC
24	Sample-efficient Neuro-symbolic Proximal Policy Optimization	提出神经符号近端策略优化，提升DRL在稀疏奖励和长规划任务中的样本效率	reinforcement learning deep reinforcement learning DRL
25	Improving Zero-Shot Offline RL via Behavioral Task Sampling	提出基于行为任务采样的离线零样本强化学习方法，提升泛化性能。	reinforcement learning offline RL
26	RADD: Retrieval-Augmented Discrete Diffusion for Multi-Modal Knowledge Graph Completion	提出RADD框架，解耦检索与重排序，提升多模态知识图谱补全性能。	distillation multimodal
27	JURY-RL: Votes Propose, Proofs Dispose for Label-Free RLVR	JURY-RL：基于投票提议与形式化验证的无标签强化学习	reinforcement learning large language model
28	Multi-action Tangled Program Graphs for Multi-task Reinforcement Learning with Continuous Control	提出基于多动作缠结程序图的MATPG算法，用于连续控制多任务强化学习。	reinforcement learning
29	Evaluating Risks in Weak-to-Strong Alignment: A Bias-Variance Perspective	通过偏差-方差视角评估弱到强对齐中的风险，揭示强模型方差是欺骗性错误的早期预警信号。	reinforcement learning RLHF

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
30	Large language models eroding science understanding: an experimental study	大型语言模型易受伪科学影响，损害科学认知	manipulation large language model
31	PHISHREV: A Hybrid Machine Learning and Post-Hoc Non-monotonic Reasoning Framework for Context-Aware Phishing Website Classification	提出PHISHREV框架以解决网络钓鱼网站分类中的上下文推理问题	manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-04-28）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理