cs.CL（2024-06-20）

📊 共 46 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (39 🔗5) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (39 篇)

#	题目	一句话要点	标签	🔗
1	Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities	提出Whiteboard-of-Thought提示方法，提升多模态大语言模型在视觉推理任务上的性能	large language model multimodal chain-of-thought
2	QuST-LLM: Integrating Large Language Models for Comprehensive Spatial Transcriptomics Analysis	QuST-LLM：集成大语言模型以实现全面的空间转录组学分析	large language model	✅
3	Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination	揭示大语言模型中的“虚假否定”偏见，缓解输入冲突导致的幻觉问题	large language model
4	Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks	提出基于微调LLM的RAG框架，提升隐式关系抽取性能	large language model
5	MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate	提出多智能体协作攻击方法，研究辩论场景下LLM协作的对抗攻击	large language model
6	GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models	提出GraphReader，通过图结构化长文本增强大语言模型的长程上下文处理能力	large language model
7	Evidence of a log scaling law for political persuasion with large language models	研究表明：大型语言模型政治说服力遵循对数比例定律，边际效益递减	large language model
8	Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing	提出GETA：一种基于生成式演进测试的大语言模型价值观评估方法	large language model
9	Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models	提出基于游戏交互的多模态大型模型评估方法，用于评估视觉表征和对话对齐能力。	multimodal
10	Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective	提出心理测量学攻击方法，评估大型语言模型中的隐性偏见	large language model	✅
11	1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?	提出一种跨语言知识聚合方法，提升大语言模型的多语言一致性与性能。	large language model
12	Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary	提出基于通俗化摘要提示的大语言模型，提升放射科报告总结的准确性和可访问性	large language model
13	On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning	研究表明，具备思维链推理的神经语言模型可表示概率图灵机所能表示的字符串分布族。	chain-of-thought
14	Aligning Large Language Models with Diverse Political Viewpoints	通过政治观点对齐，提升大语言模型在政治信息处理中的准确性和公正性	large language model
15	Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models	提出LLM微调数据选择三阶段框架，并统一评估标准，揭示方法优劣与未来挑战。	large language model
16	Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation	提出推理时去污方法ITD，解决大语言模型benchmark泄露导致的性能虚高问题	large language model
17	AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought	提出AutoCAP，实现零样本思维链跨语言对齐的自动规划	chain-of-thought
18	Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data	提出通过隐性推理解决大型语言模型知识监控问题	large language model chain-of-thought
19	MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs	提出MR-Ben基准，用于评估LLM的System-2思维和元推理能力	large language model chain-of-thought
20	Exploring Design Choices for Building Language-Specific LLMs	探索构建特定语言LLM的设计选择，提升低资源语言性能	large language model
21	OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset	提出OpenDebateEvidence数据集，用于论证挖掘和摘要生成，助力辩论领域研究。	large language model	✅
22	Model Merging and Safety Alignment: One Bad Model Spoils the Bunch	提出安全对齐的模型合并方法，解决LLM合并过程中的不对齐问题。	large language model
23	Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning	针对LLM微调，提出用户级别差分隐私方法以保障用户隐私	large language model
24	Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons	通过识别安全神经元，解析大语言模型安全对齐的内在机制	large language model	✅
25	Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics	提出TRAIT以评估大型语言模型的个性特征	large language model
26	Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction	利用LLM关系抽取探索历史湖区文本中的空间表征	large language model
27	Leveraging LLMs for Bangla Grammar Error Correction:Error Categorization, Synthetic Data, and Model Evaluation	利用LLM提升孟加拉语语法纠错：错误分类、数据合成与模型评估	large language model
28	Step-Back Profiling: Distilling User History for Personalized Scientific Writing	提出STEP-BACK PROFILING以解决个性化科学写作问题	large language model	✅
29	How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics	通过会话游戏自博弈评估LLM，研究参数量、训练方式等因素对性能的影响	large language model
30	Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?	SCALPEL方法剖析LLM在信念推理任务中的失效原因	large language model
31	Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell	揭示LLM长文本失效机制：Transformer模型知而不言现象研究	large language model
32	An Analysis of Multilingual FActScore	分析多语言环境下的FActScore，并提出知识源缓解策略以提升跨语言的事实性评估。	large language model
33	Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation	综述：揭示语言模型中数据污染的全貌——从检测到修复	large language model
34	Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas	利用显性和隐性人类因素构建LLM中的主观性建模	large language model
35	Selected Languages are All You Need for Cross-lingual Truthfulness Transfer	提出FaMSS，通过选择性语言协同提升跨语言大语言模型的真实性。	large language model
36	Definition generation for lexical semantic change detection	提出基于LLM生成定义的词义表示方法，用于词汇语义随时间变化检测。	large language model
37	An Investigation of Prompt Variations for Zero-shot LLM-based Rankers	探究Prompt变体对零样本LLM排序器性能的影响，揭示Prompt工程的重要性	large language model
38	Prompt Injection Attacks in Defended Systems	研究防御系统中Prompt注入攻击的黑盒方法，揭示潜在安全风险	large language model
39	Persuasiveness of Generated Free-Text Rationales in Subjective Decisions: A Case Study on Pairwise Argument Ranking	研究表明，开源LLM在生成论证理由方面更具说服力，尤其是在成对论证排序任务中。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签
40	Factual Dialogue Summarization via Learning from Large Language Models	提出基于大语言模型知识蒸馏的对话摘要方法，提升小模型的factual consistency。	contrastive learning distillation large language model
41	Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs	提出基于知识图谱的规划数据，提升检索增强大语言模型在复杂问答中的规划能力	distillation large language model
42	GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models	提出GenderAlign数据集，用于缓解大型语言模型中的性别偏见。	RLHF large language model
43	Investigating Mysteries of CoT-Augmented Distillation	探究CoT增强蒸馏的内在机理，发现关键token足以提升学生模型性能	distillation chain-of-thought
44	A Contrastive Learning Approach to Mitigate Bias in Speech Models	提出一种基于对比学习的语音模型偏见缓解方法，提升弱势群体表现。	contrastive learning
45	Learning to Retrieve Iteratively for In-Context Learning	提出迭代检索框架，通过策略优化提升上下文学习的检索效果	reinforcement learning large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
46	Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems	提出零知识模式推断攻击，揭示Text-to-SQL系统中数据库模式泄露漏洞	manipulation

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2024-06-20）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (39 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理