cs.CL(2024-08-14)
📊 共 15 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (3 🔗1)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization | 提出BMC框架,通过桥接和建模成对数据相关性,提升DPO的对齐性能 | DPO direct preference optimization large language model | ✅ | |
| 12 | Large Language Models Prompting With Episodic Memory | 提出基于情景记忆的大语言模型提示优化方法POEM,提升小样本学习性能。 | reinforcement learning large language model | ||
| 13 | Large Language Models Know What Makes Exemplary Contexts | 提出基于强化学习的上下文学习框架,提升大语言模型的Few-shot性能 | reinforcement learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | Enhanced Detection of Conversational Mental Manipulation Through Advanced Prompting Techniques | 探索高级Prompting技术在对话式精神操控检测中的有效性 | manipulation chain-of-thought | ||
| 15 | Assessing the Role of Lexical Semantics in Cross-lingual Transfer through Controlled Manipulations | 通过可控操纵评估词汇语义在跨语言迁移中的作用 | manipulation |