cs.CL(2024-08-22)
📊 共 22 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (16 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱一:机器人控制 (Robot Control) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (16 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | 提出FIRST:通过高效可信的蒸馏方法训练可靠的大型语言模型 | distillation large language model | ||
| 18 | RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment | RuleAlign:通过诊断规则对齐提升大型语言模型在医疗诊断中的表现 | preference learning large language model | ||
| 19 | Jamba-1.5: Hybrid Transformer-Mamba Models at Scale | Jamba-1.5:基于混合Transformer-Mamba架构的大规模语言模型,实现高吞吐和低内存占用。 | Mamba large language model instruction following | ||
| 20 | Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models | 提出 DualChecker 框架,缓解大语言模型蒸馏中的幻觉问题并提升模型性能。 | distillation large language model | ||
| 21 | Preference-Guided Reflective Sampling for Aligning Language Models | 提出偏好引导反射采样(PRS),提升语言模型与人类偏好对齐效果 | offline RL large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 22 | Controllable Text Generation for Large Language Models: A Survey | 综述可控文本生成:针对大语言模型,实现内容与属性的精准控制。 | manipulation reinforcement learning large language model | ✅ |