cs.CL(2025-09-03)
📊 共 19 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (14)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱八:物理动画 (Physics-based Animation) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (14 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Design and Optimization of Reinforcement Learning-Based Agents in Text-Based Games | 提出一种基于强化学习的文本游戏智能体设计与优化方法,显著提升游戏完成率和胜率。 | reinforcement learning deep reinforcement learning world model | ||
| 16 | Advancing SLM Tool-Use Capability using Reinforcement Learning | 利用强化学习GRPO提升小语言模型(SLM)的工具使用能力 | reinforcement learning large language model | ||
| 17 | Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators | 提出基于激活的干预方法,缓解LLM评估器中的自我偏好问题 | direct preference optimization large language model | ||
| 18 | Training LLMs to be Better Text Embedders through Bidirectional Reconstruction | 提出双向重构训练方法,提升LLM文本嵌入在检索和重排序任务中的性能 | contrastive learning large language model |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | ResearchPulse: Building Method-Experiment Chains through Multi-Document Scientific Inference | 提出ResearchPulse以解决多文献科学推理问题 | PULSE | ✅ |