cs.AI(2025-07-01)
📊 共 4 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning | 研究表明数学推理能力提升不一定带来通用LLM能力提升,SFT可能导致能力遗忘。 | reinforcement learning large language model instruction following | ||
| 2 | ASTRO: Teaching Language Models to Reason by Reflecting and Backtracking In-Context | ASTRO:通过上下文反思与回溯,教导语言模型进行推理 | reinforcement learning large language model chain-of-thought |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Enhancing LLM Agent Safety via Causal Influence Prompting | 提出因果影响提示CIP,提升LLM Agent在复杂任务中的安全性 | large language model | ||
| 4 | iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols | 提出iPanda以自动化通信协议的一致性测试问题 | large language model |