cs.CL(2025-09-06)
📊 共 10 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (5 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian | Llama-GENBA-10B:一种用于德语、英语和巴伐利亚语的三语大型语言模型 | large language model foundation model | ||
| 2 | Icon$^{2}$: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation | Icon²:利用LLM内在调控的自合成偏好数据对齐大语言模型 | large language model | ||
| 3 | Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning | 提出跨问题方法复用框架,扩展大语言模型在低相似度问题上的推理能力 | large language model | ||
| 4 | Enhancing Factual Accuracy and Citation Generation in LLMs via Multi-Stage Self-Verification | 提出VeriFact-CoT,通过多阶段自验证提升LLM的事实准确性和引用生成质量 | large language model chain-of-thought | ||
| 5 | LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding | LM-Searcher:利用LLM和统一数值编码实现跨领域神经架构搜索 | large language model | ✅ |
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation | 提出CoPeD,通过纠正性感知蒸馏缓解CoT数据中的伪相关性问题 | distillation large language model chain-of-thought | ||
| 7 | Using Contrastive Learning to Improve Two-Way Reasoning in Large Language Models: The Obfuscation Task as a Case Study | 提出对比学习微调方法,提升大语言模型在代码混淆任务中的双向推理能力 | contrastive learning large language model | ||
| 8 | A Survey of the State-of-the-Art in Conversational Question Answering Systems | 综述性研究:对话式问答系统的前沿技术进展与未来方向 | reinforcement learning contrastive learning large language model | ||
| 9 | LatinX: Aligning a Multilingual TTS Model with Direct Preference Optimization | LatinX:通过直接偏好优化对齐多语种TTS模型,实现跨语种语音克隆 | DPO direct preference optimization |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs | AntiDote:面向抗篡改大语言模型的双层对抗训练方法 | manipulation large language model |