cs.AI（2026-06-01）

📊 共 43 篇论文 | 🔗 9 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (21 🔗4) 支柱二：RL算法与架构 (RL & Architecture) (20 🔗5) 支柱一：机器人控制 (Robot Control) (1) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (21 篇)

#	题目	一句话要点	标签	🔗
1	Compliance-Scored Best-of-N Guardrail Orchestration for Multimodal Document Generation in Payments Dispute Defense	提出合规性评分的Best-of-N机制，用于支付争议防御中的多模态文档生成	multimodal
2	Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference	提出LLMFI框架，系统研究大语言模型推理中的错误传播问题	large language model
3	RA-LWLM: Retrieval-Augmented In-Context Localization with Wireless Foundation Models	提出RA-LWLM，利用无线基础模型实现免训练的跨场景定位。	foundation model
4	Boosting Multimodal Federated Learning via Chained Modality Optimization	提出FedMChain，通过链式模态优化提升多模态联邦学习性能。	multimodal
5	Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling	提出GeoCoupling框架以优化生物分子多模态协同设计	multimodal
6	eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion	提出eMoT框架，通过演进式记忆和符号锚定提升LLM多步推理的可靠性。	large language model chain-of-thought
7	MOSS-Audio Technical Report	MOSS-Audio：面向语音、环境声和音乐理解的统一音视频语言模型	large language model instruction following
8	HLL: Can Agents Cross Humanity's Last Line of Verification?	提出HLL基准测试，评估多模态Agent在交互式验证码破解中的类人能力。	multimodal	✅
9	Iteris: Agentic Research Loops for Computational Mathematics	Iteris：面向计算数学的Agentic研究环路系统	large language model
10	Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization	提出证据门控LLM先验的多目标贝叶斯优化方法，提升黑盒优化中LLM建议的可靠性。	large language model
11	Bridging the Last Mile of Time Series Forecasting with LLM Agents	提出基于LLM Agent的时间序列预测框架，弥合统计预测与业务应用间的差距	foundation model
12	MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation	MCP-Persona：通过环境模拟评估LLM Agent在真实个人应用中的性能	large language model	✅
13	MOC: Multi-Order Communication in LLM-based Multi-Agent Systems	提出MOC多阶通信方案，提升LLM多智能体系统中的信息传递效率与任务性能。	large language model	✅
14	POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems	提出POIROT以解决多智能体系统中的故障检测问题	large language model
15	SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes	SMH-Bench：用于评估LLM智能家居环境推理与行动能力的综合基准	large language model
16	WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis	提出WorldCoder-Bench，用于评估LLM在物理规则3D世界合成中的能力。	large language model
17	RadioMaster: Multi-Agent System for Autonomous Radio Signal Generation	RadioMaster：用于自主无线信号生成的多智能体系统	large language model
18	Does Compression Preserve Uncertainty? A Unified Benchmark for Quantized and Sparse LLMs via Conformal Prediction	提出基于Conformal Prediction的统一基准，评估压缩LLM的不确定性保持能力。	large language model
19	Consistency evaluation of benchmarks used for causal discovery	提出一种基于LLM的自动pipeline，用于评估因果发现benchmark的知识一致性。	large language model
20	Revisiting Ripple Effects in Knowledge Editing through Pressure-Aware Joint Neighborhood Optimization	提出JNO框架，通过压力感知联合邻域优化解决知识编辑中的涟漪效应。	large language model
21	RoleCDE:Benchmarking and Mitigating Role-Alignment Trade-offs in Role-Playing Agents	RoleCDE：用于评估和缓解角色扮演智能体中角色对齐权衡的基准	large language model	✅

🔬 支柱二：RL算法与架构 (RL & Architecture) (20 篇)

#	题目	一句话要点	标签	🔗
22	Spatial Representation Learning Beyond Pixels: Unifying Raster Data and Vector Semantics for Human-Centric Geospatial Foundation Models	提出统一空间表征学习框架，融合栅格数据与矢量语义，构建以人为中心的地理空间基础模型。	representation learning foundation model multimodal
23	TrafficRAG: A Multimodal RAG Framework for Traffic Accident Liability Determination	TrafficRAG：多模态检索增强框架，用于交通责任事故判定	MAE large language model multimodal
24	Explainable Data-driven Deep Reinforcement Learning Methods for Optimal Energy Management in Buildings	提出可解释深度强化学习框架，优化建筑能源管理并提升用户信任	reinforcement learning deep reinforcement learning DRL
25	COMAP: Co-Evolving World Models and Agent Policies for LLM Agents	COMAP：面向LLM Agent的协同进化世界模型与策略，提升交互环境决策能力	world model world models distillation	✅
26	Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space	Echo：基于共享隐空间的联合嵌入预测架构，用于说话人分离和语音识别	JEPA Joint-Embedding Predictive Architecture joint-embedding predictive architecture
27	EvoBrain: Continual Learning of EEG Foundation Models Across Heterogeneous BCI Tasks	EvoBrain：面向异构BCI任务的脑电基础模型持续学习框架	distillation foundation model
28	SafeSteer: Localized On-Policy Distillation for Efficient Safety Alignment	SafeSteer：面向安全对齐的局部化On-Policy蒸馏方法	distillation large language model	✅
29	Learning When Not to Act: Mitigating Tool Abuse in Agentic Reinforcement Learning	EAPO：通过学习何时不行动来缓解Agentic强化学习中的工具滥用问题	reinforcement learning policy learning reward shaping
30	SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning	SafeMCP：通过环境感知的前瞻推理实现LLM Agent的主动式能力管控	reinforcement learning world model world models
31	Community-Aware Assessment of Social Textual Engagement and Resonance: A Human-Centric Perspective on User-Generated Content Evaluation	提出MEDEA模型，通过模拟社群共鸣评估用户生成内容质量，超越传统视觉保真度指标。	reinforcement learning multimodal chain-of-thought
32	SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training	SIRI：通过自内部化强化学习与内在技能训练LLM Agent	reinforcement learning distillation	✅
33	S-SPPO: Semantic-Calibrated Self-Play Preference Optimization	S-SPPO：通过语义校准的自博弈偏好优化，解决LLM对齐中的策略退化问题	DPO direct preference optimization large language model	✅
34	Coordination Graphs for Constrained Multi-Agent Reinforcement Learning	提出CG-CMARL框架，通过协调图和拉格朗日对偶解决约束多智能体强化学习问题	reinforcement learning reward shaping
35	Physically-Constrained Mamba-SDE for Remaining Useful Life Prediction under Irregular Observations	提出PC-MambaSDE，解决不规则观测下剩余寿命预测的物理约束问题	latent dynamics Mamba
36	JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions	提出JenBridge，解决长视频场景过渡中配乐连贯性问题	flow matching large language model
37	Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses	Harness-1：利用强化学习和外部状态管理提升搜索Agent性能	reinforcement learning	✅
38	EVA-Net: Subject-Independent EEG Motor Decoding with Video-Derived Motor Priors	EVA-Net：利用视频运动先验实现与受试者无关的脑电运动解码	distillation multimodal
39	TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment	TriAlign：面向个性化LLM对齐的通用真值一致性方法	reinforcement learning large language model
40	TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL	TRON：面向视觉推理强化学习的可控规则验证在线环境	reinforcement learning multimodal
41	ReSkill: Reconciling Skill Creation with Policy Optimization in Agentic RL	ReSkill：在Agentic RL中协调技能创建与策略优化	reinforcement learning policy learning

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
42	Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization	提出基于输入二值化的视觉程序合成方法，弥合半导体图像Sim-to-Real差距	manipulation sim-to-real

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
43	Token Predictors Are Not Planners: Building Physically Grounded Causal Reasoners	提出Causal-Plan-Bench和Causal-Plan-1M，提升具身智能体物理因果推理能力。	egocentric

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-06-01）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (21 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (20 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理