cs.AI（2025-07-31）

📊 共 26 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (6)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring	LLM可在CoT监控下隐蔽地进行能力评估中的策略性低效表现	chain-of-thought
2	Automated Feedback on Student-Generated UML and ER Diagrams Using Large Language Models	DUET：利用大语言模型为学生生成的UML和ER图提供自动反馈	large language model
3	CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks	提出CoT-Self-Instruct，通过高质量合成数据提升LLM推理与非推理任务性能。	instruction following chain-of-thought
4	MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks	MECAT：构建多专家基准，提升细粒度音频理解任务性能	large language model chain-of-thought	✅
5	Causal Reasoning in Pieces: Modular In-Context Learning for Causal Discovery	提出模块化上下文学习框架，提升大语言模型因果发现能力	large language model chain-of-thought
6	Self-Foveate: Enhancing Diversity and Difficulty of Synthesized Instructions from Unsupervised Text via Multi-Level Foveation	提出Self-Foveate方法，通过多层次注视机制提升指令合成数据的多样性和难度。	large language model instruction following	✅
7	LLM-Based Identification of Infostealer Infection Vectors from Screenshots: The Case of Aurora	利用LLM从信息窃取器感染截图中识别感染向量，以Aurora为例。	large language model
8	Accessibility Scout: Personalized Accessibility Scans of Built Environments	Accessibility Scout：基于LLM的个性化无障碍环境扫描系统	large language model
9	A Survey on Code Generation with LLM-based Agents	综述基于LLM的智能体在代码生成中的应用，涵盖技术、应用、评估与挑战。	large language model
10	DeformTune: A Deformable XAI Music Prototype for Non-Musicians	DeformTune：面向非音乐家的可变形XAI音乐原型系统	multimodal
11	A survey of multi-agent geosimulation methodologies: from ABM to LLM	综述多智能体地理模拟方法：从ABM到LLM的演进与融合	large language model
12	MemoCue: Empowering LLM-Based Agents for Human Memory Recall via Strategy-Guided Querying	提出MemoCue，通过策略引导查询增强LLM在人脑记忆回忆中的表现	large language model
13	DICE: Dynamic In-Context Example Selection in LLM Agents via Efficient Knowledge Transfer	DICE：通过高效知识迁移在LLM Agent中进行动态In-Context示例选择	large language model
14	Chatting with your ERP: A Recipe	提出双Agent架构，利用LLM实现自然语言查询工业ERP系统。	large language model
15	LLM4Rail: An LLM-Augmented Railway Service Consulting Platform	LLM4Rail：一个基于大语言模型的铁路服务咨询平台，提供个性化服务。	large language model
16	Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling	Trae Agent：基于LLM的软件工程智能体，具备测试时扩展能力，解决代码缺陷。	large language model	✅
17	"I made this (sort of)": Negotiating authorship, confronting fraudulence, and exploring new musical spaces with prompt-based AI music generation	利用提示词AI音乐生成探索作者身份、欺骗性及音乐新空间	large language model
18	DSBC : Data Science task Benchmarking with Context engineering	DSBC：通过上下文工程对数据科学任务进行基准测试，评估LLM在实际应用中的性能。	large language model
19	How Far Are AI Scientists from Changing the World?	综述AI科学家系统，探讨其在改变科研范式和解决重大挑战中的潜力与瓶颈	large language model
20	AutoBridge: Automating Smart Device Integration with Centralized Platform	AutoBridge：自动化智能设备与中心化平台的集成，无需人工干预。	multimodal

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签
21	A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models	眼科多模态诊断综述：从任务特定方法到基础模型	reinforcement learning large language model foundation model
22	SimuRA: A World-Model-Driven Simulative Reasoning Architecture for General Goal-Oriented Agents	SimuRA：基于世界模型的通用目标导向智能体模拟推理架构	world model foundation model
23	RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization	RL-PLUS：混合策略优化解决LLM在强化学习中能力边界崩溃问题	reinforcement learning large language model
24	Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving	提出Seed-Prover：用于自动定理证明的深度和广度推理模型	reinforcement learning IMoS chain-of-thought
25	Hyperproperty-Constrained Secure Reinforcement Learning	提出基于HyperTWTL约束的安全强化学习算法，保障机器人任务安全性	reinforcement learning
26	Model-Based Soft Maximization of Suitable Metrics of Long-Term Human Power	提出基于模型的软最大化人类长期权力度量方法，旨在提升AI安全性和人类福祉。	reinforcement learning world model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-07-31）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理