cs.CL（2024-12-17）

📊 共 30 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (25 🔗5) 支柱二：RL算法与架构 (RL & Architecture) (4) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (25 篇)

#	题目	一句话要点	标签	🔗
1	Question: How do Large Language Models perform on the Question Answering tasks? Answer:	对比研究：大型语言模型在问答任务中的表现及单次推理提示优化	large language model instruction following
2	DateLogicQA: Benchmarking Temporal Biases in Large Language Models	提出DateLogicQA基准测试，用于评估大型语言模型中的时间推理偏差。	large language model
3	Refining Answer Distributions for Improved Large Language Model Reasoning	提出精炼答案分布法，提升大语言模型推理能力	large language model
4	Compressed Chain of Thought: Efficient Reasoning Through Dense Representations	提出压缩思维链（CCoT），通过稠密表示提升语言模型推理效率。	chain-of-thought
5	Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study	研究大型语言模型在生成德国公众意见中的算法忠实性	large language model
6	CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval	CLASP：用于多语言多模态信息检索的对比语言-语音预训练	multimodal
7	RCLMuFN: Relational Context Learning and Multiplex Fusion Network for Multimodal Sarcasm Detection	提出RCLMuFN模型，通过关系上下文学习和多路复用融合提升多模态讽刺检测性能。	multimodal
8	SnakModel: Lessons Learned from Training an Open Danish Large Language Model	SnakModel：基于Llama2-7B的丹麦语大语言模型训练与优化实践	large language model
9	DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models	提出DSGram框架，利用动态权重子指标提升大语言模型时代语法纠错评估的有效性	large language model
10	RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement	RAG-Star：通过检索增强的验证与改进提升LLM的审慎推理能力	large language model chain-of-thought
11	An Automated Explainable Educational Assessment System Built on LLMs	AERA Chat：基于LLM的自动化、可解释教育评估系统	large language model
12	Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation	提出多阶段参数高效微调方法，扩展Llama模型至波斯语	large language model
13	Training Dynamics of a 1.7B LLaMa Model: A Data-Efficient Approach	训练17亿参数LLaMa模型：一种数据高效的方法	large language model	✅
14	DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation	提出DnDScore，通过解构和去语境化提升长文本生成的事实性验证效果	large language model
15	Memory-Augmented Agent Training for Business Document Understanding	提出Matrix框架，通过记忆增强Agent训练提升LLM在商业文档理解中的性能	large language model
16	AI PERSONA: Towards Life-long Personalization of LLMs	提出AI Persona框架，实现大语言模型（LLM）的终身个性化	large language model
17	AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark	提出AIR-Bench：自动化异构信息检索评测基准，解决新兴领域评测难题	large language model	✅
18	NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation	提出NAVCON：一个认知启发且语言对齐的视觉语言导航语料库	VLN
19	Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health	提出IC-AnnoMI数据集，解决心理健康领域LLM数据稀缺和偏差问题	large language model
20	Adaptations of AI models for querying the LandMatrix database in natural language	利用AI模型适配LandMatrix数据库，实现自然语言查询	large language model	✅
21	Truthful Text Sanitization Guided by Inference Attacks	提出基于推理攻击指导的文本泛化脱敏方法，平衡隐私保护与效用	large language model
22	Benchmarking and Understanding Compositional Relational Reasoning of LLMs	提出GAR基准测试，用于评估和理解LLM的组合关系推理能力	large language model	✅
23	More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression	提出量化剪枝方法，优化KV缓存压缩中的Token-精度权衡，提升长文本LLM性能	large language model	✅
24	Trigger$^3$: Refining Query Correction via Adaptive Model Selector	提出Trigger$^3$，通过自适应模型选择优化查询纠错。	large language model
25	Exploring Cross-lingual Latent Transplantation: Mutual Opportunities and Open Challenges	提出跨语言隐空间迁移框架，提升LLM多语言能力和文化适应性	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

#	题目	一句话要点	标签
26	Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models	提出偏好导向的监督微调方法以提升模型性能	DPO large language model
27	Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models	提出Span-Expanded Attention，扩展混合状态空间模型的记忆范围，提升长序列建模能力。	SSM state space model
28	In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning	提出上下文学习蒸馏方法，实现高效小样本微调并显著压缩模型规模。	distillation
29	Detecting Emotional Incongruity of Sarcasm by Commonsense Reasoning	提出基于常识推理的情感不一致性检测框架EICR，用于识别反讽语句	contrastive learning large language model

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
30	Revealing the impact of synthetic native samples and multi-tasking strategies in Hindi-English code-mixed humour and sarcasm detection	提出融合原生样本与多任务学习策略，提升印地语-英语混合语幽默与讽刺检测性能	HuMoR

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2024-12-17）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (25 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理