cs.AI(2025-07-01)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 研究表明数学推理能力提升不一定带来通用LLM能力提升,SFT可能导致能力遗忘。 reinforcement learning large language model instruction following
2 ASTRO: Teaching Language Models to Reason by Reflecting and Backtracking In-Context ASTRO:通过上下文反思与回溯,教导语言模型进行推理 reinforcement learning large language model chain-of-thought

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
3 Enhancing LLM Agent Safety via Causal Influence Prompting 提出因果影响提示CIP,提升LLM Agent在复杂任务中的安全性 large language model
4 iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols 提出iPanda以自动化通信协议的一致性测试问题 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页