cs.AI（2025-09-29）

📊 共 28 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (19 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (8) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (19 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Building the EHR Foundation Model via Next Event Prediction	提出基于事件预测的EHR基础模型，提升LLM在临床时序推理能力	large language model foundation model
2	Radiology's Last Exam (RadLE): Benchmarking Frontier Multimodal AI Against Human Experts and a Taxonomy of Visual Reasoning Errors in Radiology	RadLE：放射学专家级诊断基准，评估多模态AI并分析视觉推理错误	large language model multimodal
3	TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models	提出TimeOmni-1，通过时间序列推理套件TSR-Suite，解决大语言模型在复杂时间序列推理任务中的挑战。	large language model multimodal
4	Bridging the behavior-neural gap: A multimodal AI reveals the brain's geometry of emotion more accurately than human self-reports	多模态AI超越人类自报告，更准确揭示大脑情感几何	large language model multimodal	✅
5	Evaluating Foundation Models with Pathological Concept Learning for Kidney Cancer	利用病理概念学习评估肾癌Foundation Model的转化能力	foundation model	✅
6	AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models	AdvChain：对抗性思维链微调，提升大型推理模型安全对齐的鲁棒性	chain-of-thought
7	MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech	MGM-Omni：面向个性化长时程语音的通用多模态大语言模型	multimodal
8	Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent	提出基于LLM的反馈模糊认知图自编码器，实现可解释的因果关系学习。	large language model
9	ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning	ATLAS：面向真实旅行规划的约束感知多智能体协作框架	large language model
10	A(I)nimism: Re-enchanting the World Through AI-Mediated Object Interaction	A(I)nimism：通过AI中介的物体交互，探索万物有灵论的新可能	large language model
11	Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions	提出在线平台和AI系统中内容毒性的综合分类，并探讨检测与缓解策略。	large language model
12	Adaptive Test-Time Reasoning via Reward-Guided Dual-Phase Search	提出基于奖励引导的双阶段搜索，提升LLM在推理任务中的效率与准确性。	large language model
13	AutoCode: LLMs as Problem Setters for Competitive Programming	AutoCode：利用大型语言模型自动生成竞赛级编程题目	large language model
14	ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory	提出 ReasoningBank，通过推理记忆和自进化提升Agent在持续任务中的性能。	large language model
15	Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents	提出LLM Agent自复制风险评估框架，揭示实际应用中潜在安全隐患	large language model
16	Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution	Flash-Searcher：基于DAG并行执行的快速高效Web Agent	large language model
17	MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning	提出MASLegalBench：用于评估多智能体系统在演绎法律推理中的性能	large language model
18	Neural network embeddings recover value dimensions from psychometric survey items on par with human data	提出SQuID方法，利用神经网络嵌入从心理测量问卷条目中恢复价值维度，性能与人类数据相当。	large language model
19	Experience-Guided Reflective Co-Evolution of Prompts and Heuristics for Automatic Algorithm Design	提出EvoPH框架，通过经验引导的提示与启发式算法协同进化，实现自动算法设计	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
20	Training Agents Inside of Scalable World Models	Dreamer 4：通过可扩展世界模型在Minecraft中实现离线钻石获取	reinforcement learning world model dreamer
21	Hybrid Reward Normalization for Process-supervised Non-verifiable Agentic Tasks	提出PPR方法，通过混合奖励归一化提升Agent在非验证任务中的表现	reinforcement learning large language model
22	Modeling Others' Minds as Code	ROTE：利用程序合成高效预测人类及AI行为，提升人机协作	behavior cloning large language model
23	DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search	DeepSearch：通过蒙特卡洛树搜索和可验证奖励克服强化学习瓶颈	reinforcement learning
24	The Era of Real-World Human Interaction: RL from User Conversations	提出基于用户对话的强化学习(RLHI)，实现个性化对齐和持续模型改进。	reinforcement learning instruction following
25	Pushing LLMs to Their Logical Reasoning Bound: The Role of Data Reasoning Intensity	提出数据推理强度（DRI）指标，优化训练数据以提升LLM逻辑推理能力。	reinforcement learning large language model
26	Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention	提出Intervened Preference Optimization以提升大型推理模型安全性	preference learning chain-of-thought
27	Learning to Interact in World Latent for Team Coordination	提出交互世界隐空间(IWoL)框架，促进多智能体强化学习中的团队协作	reinforcement learning representation learning

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
28	Vision-and-Language Navigation with Analogical Textual Descriptions in LLMs	提出基于LLM中类比文本描述的视觉-语言导航方法，提升导航性能。	scene understanding embodied AI VLN

⬅️ 返回 cs.AI 首页 · 🏠 返回主页