cs.AI（2025-09-29）

📊 共 38 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (24 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (13) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

#	题目	一句话要点	标签	🔗
1	Building the EHR Foundation Model via Next Event Prediction	提出基于事件预测的EHR基础模型，增强LLM在临床时序推理能力	large language model foundation model
2	Radiology's Last Exam (RadLE): Benchmarking Frontier Multimodal AI Against Human Experts and a Taxonomy of Visual Reasoning Errors in Radiology	RadLE：放射学诊断基准测试，评估多模态AI与专家医生的差距及视觉推理错误	large language model multimodal
3	TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models	TimeOmni-1：通过时间序列激励大语言模型进行复杂推理	large language model multimodal
4	Bridging the behavior-neural gap: A multimodal AI reveals the brain's geometry of emotion more accurately than human self-reports	多模态AI超越人类自报告，更准确揭示大脑情感几何	large language model multimodal	✅
5	Model Merging Scaling Laws in Large Language Models	提出语言模型融合的规模法则，实现专家模型高效组合与性能预测	large language model
6	Chat to Chip: Large Language Model Based Design of Arbitrarily Shaped Metasurfaces	提出基于大语言模型的超表面设计方法，实现任意形状超表面的光谱预测与逆向设计。	large language model
7	Evaluating Foundation Models with Pathological Concept Learning for Kidney Cancer	提出基于病理概念学习的肾癌评估方法，利用基础模型提升生存分析效果。	foundation model	✅
8	AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models	AdvChain：对抗式思维链调优，提升大型推理模型安全对齐的鲁棒性	chain-of-thought
9	ELHPlan: Efficient Long-Horizon Task Planning for Multi-Agent Collaboration	ELHPlan：面向多智能体协作的高效长时程任务规划框架	large language model
10	Advancing mathematics research with generative AI	利用生成式AI辅助数学研究，提升问题求解与猜想能力	large language model
11	TENET: Leveraging Tests Beyond Validation for Code Generation	TENET：利用测试驱动开发提升代码生成质量，解决复杂仓库环境下的代码生成难题。	large language model
12	MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech	MGM-Omni：面向个性化长时程语音的通用多模态大语言模型	multimodal
13	Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent	提出基于LLM的因果模糊认知图自编码器，实现可解释的认知图重建。	large language model
14	ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning	ATLAS：面向真实旅行规划的约束感知多智能体协作框架	large language model
15	A(I)nimism: Re-enchanting the World Through AI-Mediated Object Interaction	提出A(I)nimism以重塑人与物的互动关系	large language model
16	Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions	构建在线平台和AI系统毒性全面分类体系，旨在促进毒性检测与缓解方案设计。	large language model
17	Adaptive Test-Time Reasoning via Reward-Guided Dual-Phase Search	提出基于奖励引导的双阶段搜索，提升LLM在推理任务中的效率和准确性。	large language model
18	AutoCode: LLMs as Problem Setters for Competitive Programming	AutoCode：利用大语言模型自动生成高质量的竞赛编程题目	large language model
19	ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory	提出 ReasoningBank，通过推理记忆和自进化提升Agent在持续任务中的性能。	large language model
20	Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents	构建真实场景评估LLM Agent的自复制风险，揭示潜在安全隐患	large language model
21	Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution	Flash-Searcher：基于DAG并行执行的快速高效Web Agent	large language model
22	MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning	提出MASLegalBench：用于评估多智能体系统在演绎法律推理中的性能	large language model
23	Neural network embeddings recover value dimensions from psychometric survey items on par with human data	利用神经网络嵌入和SQuID方法，从心理测量问卷条目中恢复人类价值观维度，效果与人类数据相当	large language model
24	Experience-Guided Reflective Co-Evolution of Prompts and Heuristics for Automatic Algorithm Design	提出EvoPH框架，通过经验引导的提示与启发式算法协同进化，实现自动算法设计	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (13 篇)

#	题目	一句话要点	标签
25	Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning	Uni-NTFM：用于脑电信号表征学习的统一神经拓扑基础模型	representation learning foundation model
26	RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment	RE-PO：一种通用的LLM对齐框架，通过鲁棒增强策略优化解决标签噪声问题	reinforcement learning RLHF DPO
27	Training Agents Inside of Scalable World Models	Dreamer 4：通过可扩展世界模型在Minecraft中实现离线钻石获取	reinforcement learning world model dreamer
28	Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models	揭示大语言模型推理与检索的竞争机制，提出FARL提升推理能力	reinforcement learning distillation chain-of-thought
29	RL in the Wild: Characterizing RLVR Training in LLM Deployment	针对LLM部署中RLVR训练的系统挑战，提出PolyTrace基准测试套件。	reinforcement learning large language model
30	Hybrid Reward Normalization for Process-supervised Non-verifiable Agentic Tasks	提出原则过程奖励以解决长轨迹任务中的反馈稀疏问题	reinforcement learning large language model
31	Modeling Others' Minds as Code	ROTE：利用程序合成高效预测人类行为，提升人机协作	behavior cloning large language model
32	DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search	DeepSearch：通过蒙特卡洛树搜索和可验证奖励克服强化学习瓶颈	reinforcement learning
33	The Era of Real-World Human Interaction: RL from User Conversations	提出基于用户对话的强化学习(RLHI)，实现持续模型改进和多方面对齐。	reinforcement learning instruction following
34	Pushing LLMs to Their Logical Reasoning Bound: The Role of Data Reasoning Intensity	提出数据推理强度（DRI）指标，优化训练数据以提升LLM逻辑推理能力。	reinforcement learning large language model
35	Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention	提出Intervened Preference Optimization以提升大型推理模型安全性	preference learning chain-of-thought
36	Humanline: Online Alignment as Perceptual Loss	提出Humanline，通过感知损失在线对齐，提升模型与人类偏好一致性	PPO DPO
37	Unifying Agent Interaction and World Information for Multi-agent Coordination	提出IWoL框架，统一交互与世界信息，促进多智能体协同	reinforcement learning representation learning

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
38	Vision-and-Language Navigation with Analogical Textual Descriptions in LLMs	提出基于LLM和类比文本描述的视觉-语言导航方法，提升场景理解和空间推理能力	scene understanding embodied AI VLN

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-09-29）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (13 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理