cs.CL(2024-08-06)

📊 共 23 篇论文 | 🔗 6 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (18 🔗5) 支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (18 篇)

#题目一句话要点标签🔗
1 Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons 通过知识神经元揭示大语言模型的事实性知识回忆行为 large language model chain-of-thought
2 Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement 提出WIDEN方法,通过权重解耦实现微调与预训练大语言模型的有效融合 large language model instruction following
3 StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation StructEval:通过结构化评估加深和拓宽大型语言模型评估 large language model
4 Evaluating the Translation Performance of Large Language Models Based on Euas-20 构建Euas-20数据集,评估大型语言模型在机器翻译任务中的性能 large language model
5 Citekit: A Modular Toolkit for Large Language Model Citation Generation Citekit:一个模块化工具包,用于大语言模型引文生成。 large language model
6 500xCompressor: Generalized Prompt Compression for Large Language Models 提出500xCompressor,实现大语言模型Prompt超高压缩比且无需微调 large language model
7 Towards an Analysis of Discourse and Interactional Pragmatic Reasoning Capabilities of Large Language Models 综述性分析:大型语言模型在语篇和互动语用推理能力上的研究进展 large language model
8 Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs Fact Finder:融合知识图谱增强大语言模型在特定领域的专业知识 large language model
9 Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering 提出IIER框架,利用块间交互增强大语言模型问答中的检索效果 large language model
10 EC-Guide: A Comprehensive E-Commerce Guide for Instruction Tuning and Quantization EC-Guide:面向电商场景,用于指令微调和量化LLM的综合指南 large language model chain-of-thought
11 Accuracy and Consistency of LLMs in the Registered Dietitian Exam: The Impact of Prompt Engineering and Knowledge Retrieval 评估大型语言模型在注册营养师考试中的准确性和一致性,并分析提示工程和知识检索的影响。 large language model chain-of-thought
12 Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD) 提出语言对比解码(LCD)算法,有效缓解大型视觉语言模型(LVLM)中的幻觉问题 large language model
13 Non-Determinism of "Deterministic" LLM Settings 揭示“确定性”LLM设置下的非确定性问题,并提出量化指标 large language model
14 Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi 针对马拉地语等低资源文本分类,探索参数高效微调方法以提升模型训练效率。 large language model
15 Topic Modeling with Fine-tuning LLMs and Bag of Sentences 提出FT-Topic方法以改进主题建模效果 large language model
16 KnowPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models 提出KnowPO,通过知识偏好优化解决RAG中可控知识选择问题。 large language model
17 OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs OpenFactCheck:用于大语言模型事实性评估的统一框架 large language model
18 LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations 提出基于Few-Shot LLM的MOFs合成条件提取方法,显著提升提取性能和材料设计质量。 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
19 ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning 提出ULLME框架以提升大语言模型的文本嵌入能力 representation learning large language model
20 Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations 综述大型语言模型推理优化技术,分析其影响、挑战与实践考量 distillation large language model
21 Intermediate direct preference optimization 提出中间层直接偏好优化(Intermediate DPO)方法,提升大型语言模型微调效果 DPO direct preference optimization large language model
22 Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation 提出EmpRL框架,通过强化学习对齐生成回复中的共情水平,提升共情对话质量。 reinforcement learning
23 Synthesizing Text-to-SQL Data from Weak and Strong LLMs 结合强弱LLM合成数据,SENSE模型显著提升Text-to-SQL任务性能 preference learning large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页