cs.CL(2024-11-22)
📊 共 12 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | On the Impact of Fine-Tuning on Chain-of-Thought Reasoning | 研究表明,微调会降低大型语言模型链式思考推理的可靠性。 | reinforcement learning RLHF large language model | ||
| 9 | Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | 提出TAIL方法,结合合成标签生成与知识蒸馏,解决异构文档信息抽取难题。 | distillation multimodal | ||
| 10 | Tulu 3: Pushing Frontiers in Open Language Model Post-Training | Tulu 3:开源语言模型后训练的突破,超越Llama 3.1 Instruct及部分闭源模型 | reinforcement learning DPO direct preference optimization |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Evaluating LLM Prompts for Data Augmentation in Multi-label Classification of Ecological Texts | 利用LLM提示进行数据增强,提升生态文本多标签分类性能 | manipulation large language model | ||
| 12 | Universal and Context-Independent Triggers for Precise Control of LLM Outputs | 提出通用且上下文无关的触发器,实现对LLM输出的精确控制 | manipulation large language model |