cs.CL（2024-09-17）

📊 共 41 篇论文 | 🔗 6 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (36 🔗5) 支柱二：RL算法与架构 (RL & Architecture) (4) 支柱四：生成式动作 (Generative Motion) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (36 篇)

#	题目	一句话要点	标签	🔗
1	CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration	提出CoCA，通过宪法校准恢复多模态大语言模型对恶意视觉输入的安全性感知。	large language model multimodal
2	NVLM: Open Frontier-Class Multimodal LLMs	NVLM 1.0：媲美GPT-4o的前沿多模态大语言模型，提升文本性能并开源	large language model multimodal	✅
3	Chain-of-Thought Prompting for Speech Translation	提出基于思维链提示的语音翻译方法，显著提升Speech-LLM的翻译性能	large language model chain-of-thought
4	Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts	提出MLPrompt多语言提示方法，提升LLM在复杂规则下的推理和理解能力	large language model chain-of-thought
5	Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant	大规模语言模型量化方法的全面评估：模型大小、任务难度与性能权衡	large language model instruction following
6	Enriching Datasets with Demographics through Large Language Models: What's in a Name?	利用大型语言模型进行人口统计信息推断，提升数据集质量	large language model
7	THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models	THaMES：用于大规模语言模型幻觉缓解与评估的端到端工具	large language model
8	Task Arithmetic for Language Expansion in Speech Translation	提出增强型任务算术方法，用于语音翻译中的语言扩展，无需重新训练。	large language model foundation model multimodal
9	LOLA -- An Open-Source Massively Multilingual Large Language Model	LOLA：一个开源的大规模多语言大型语言模型	large language model
10	The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives	提出基于多智能体生成式AI的动态多模态叙事教育工具	multimodal
11	Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models	评估压缩技术对大语言模型任务性能的影响，强调校准数据和评估指标的重要性	large language model
12	Self-Evolutionary Large Language Models through Uncertainty-Enhanced Preference Optimization	提出不确定性增强偏好优化(UPO)，提升LLM自进化性能	large language model
13	Strategic Insights in Human and Large Language Model Tactics at Word Guessing Games	分析人类与大语言模型在猜词游戏中的策略，揭示模型在多语言环境下的挑战。	large language model
14	KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models	KVPruner：通过结构化剪枝加速并降低大语言模型的内存占用	large language model
15	Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models	提出Typhoon-Audio模型，提升语音语言模型在低资源语言和指令跟随方面的能力	instruction following
16	Enhancing Code-switched Text-to-Speech Synthesis Capability in Large Language Models with only Monolingual Corpora	提出CS-LLM，仅用单语语料提升大语言模型在混合语文本转语音合成中的能力	large language model
17	Investigating Context-Faithfulness in Large Language Models: The Roles of Memory Strength and Evidence Style	研究记忆强度和证据风格对大语言模型上下文忠实度的影响	large language model	✅
18	A Unified Framework to Classify Business Activities into International Standard Industrial Classification through Large Language Models for Circular Economy	利用大型语言模型将商业活动分类到国际标准产业分类，促进循环经济发展。	large language model
19	Adaptive Large Language Models By Layerwise Attention Shortcuts	提出层级注意力捷径，用于自适应大型语言模型计算	large language model
20	Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement	提出基于迭代优化的多样性数据选择方法，提升LLM微调效果	large language model instruction following	✅
21	Surveying the MLLM Landscape: A Meta-Review of Current Surveys	MLLM综述的元综述：系统性回顾多模态大语言模型评测方法与未来方向	large language model multimodal
22	Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs	提出TRIM方法，通过CLIP度量进行token缩减，提升多模态LLM效率。	large language model multimodal
23	CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization	提出CREAM，一种基于比较和ELO排序的免参考会议摘要自动评估方法	large language model chain-of-thought
24	Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming	探索基于大型代码模型的对话式编程，实现协作机器人免代码编程	large language model
25	Watch Your Steps: Observable and Modular Chains of Thought	提出程序追踪提示，增强CoT的可观测性和模块化，解决非局部错误问题。	chain-of-thought
26	Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs	小语言模型在短篇创意写作中超越人类：SLM与人类及LLM的对比研究	large language model
27	Egalitarian Language Representation in Language Models: It All Begins with Tokenizers	提出GPE，提升语言模型分词器对复杂文字的公平表征	large language model
28	Multi-Document Grounded Multi-Turn Synthetic Dialog Generation	提出一种多文档驱动的多轮合成对话生成技术，提升模型在文档型对话任务上的性能。	chain-of-thought
29	Says Who? Effective Zero-Shot Annotation of Focalization	利用大型语言模型实现叙事焦点零样本标注，性能媲美人工标注。	large language model
30	Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning	提出RGER：通过推理图增强的范例检索提升上下文学习效果	large language model	✅
31	SC-Phi2: A Fine-tuned Small Language Model for StarCraft II Macromanagement Tasks	SC-Phi2：微调的小型语言模型用于星际争霸II的宏观管理任务	large language model
32	Diversity-grounded Channel Prototypical Learning for Out-of-Distribution Intent Detection	提出多样性引导的通道原型学习以解决分布外意图检测问题	large language model
33	DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition	提出DynamicNER数据集，用于评估LLM在动态、多语言和细粒度命名实体识别中的能力。	large language model	✅
34	Propulsion: Steering LLM with Tiny Fine-Tuning	Propulsion：通过微调缩放LLM特定维度，实现高效任务引导。	large language model
35	Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction	提出Attention-Seeker以解决无监督关键短语提取问题	large language model
36	Efficient and Personalized Mobile Health Event Prediction via Small Language Models	利用小型语言模型实现高效且个性化的移动健康事件预测	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

#	题目	一句话要点	标签
37	Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5	提出基于蒸馏的文档理解方法，利用FLAN-T5提升文档处理效率。	curriculum learning distillation large language model
38	Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models	提出Bio-Inspired Mamba，融合生物学习原则的在线选择性状态空间模型	Mamba state space model
39	REAL: Response Embedding-based Alignment for LLMs	REAL：基于响应嵌入对齐LLM，提升标注效率与模型性能。	RLHF DPO direct preference optimization
40	LLM-as-a-Judge & Reward Model: What They Can and Cannot Do	分析LLM作为评判者和奖励模型的局限性，揭示其在多语言、事实核查和复杂推理上的不足	reinforcement learning large language model

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
41	BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation	提出双向自回归扩散模型BAD，用于提升文本到动作生成效果	text-to-motion motion generation	✅

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2024-09-17）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (36 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理