cs.CL(2025-03-23)
📊 共 15 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment | 提出DR-IRL,通过动态奖励调整提升LLM安全对齐效果 | reinforcement learning inverse reinforcement learning large language model | ||
| 13 | Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts | 研究表明RLHF虽提升LLM文本质量,但也使其更易被检测且产生冗长重复内容 | reinforcement learning RLHF large language model | ||
| 14 | $D^2LoRA$: Data-Driven LoRA Initialization for Low Resource Tasks | 提出D²LoRA,一种数据驱动的LoRA初始化方法,提升低资源任务下的微调效率。 | DPO direct preference optimization large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | GeoBenchX: Benchmarking LLMs in Agent Solving Multistep Geospatial Tasks | GeoBenchX:评估LLM在多步骤地理空间任务中Agent工具调用能力的基准 | manipulation large language model | ✅ |