| 1 |
Learning Dynamics in Continual Pre-Training for Large Language Models |
提出CPT缩放法则以优化大语言模型的持续预训练 |
large language model foundation model |
|
|
| 2 |
Reassessing Large Language Model Boolean Query Generation for Systematic Reviews |
系统评审中提出改进的LLM布尔查询生成方法 |
large language model chain-of-thought |
|
|
| 3 |
EmoMeta: A Multimodal Dataset for Fine-grained Emotion Classification in Chinese Metaphors |
提出EmoMeta数据集以解决中文隐喻情感分类问题 |
multimodal |
✅ |
|
| 4 |
Large Language Models and Arabic Content: A Review |
综述大型语言模型在阿拉伯语内容处理中的应用与挑战 |
large language model |
|
|
| 5 |
Characterizing the Investigative Methods of Fictional Detectives with Large Language Models |
提出AI驱动的方法系统化分析虚构侦探的调查手法 |
large language model |
|
|
| 6 |
On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models |
提出DeltaEdit以解决大语言模型的噪声累积问题 |
large language model |
|
|
| 7 |
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models |
提出SAS-Bench以解决短答案评分中的细粒度评估问题 |
large language model |
|
|
| 8 |
ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation |
提出ViMRHP数据集以解决越南语多模态评论有用性预测问题 |
multimodal |
✅ |
|
| 9 |
One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models |
提出D-STT以解决大型语言模型的安全性与可用性平衡问题 |
large language model |
|
|
| 10 |
Spoken Language Understanding on Unseen Tasks With In-Context Learning |
提出随机类标签的无任务特定微调方法以提升SLU性能 |
large language model |
|
|
| 11 |
Re$^2$: A Consistency-ensured Dataset for Full-stage Peer Review and Multi-turn Rebuttal Discussions |
提出Re^2数据集以解决同行评审和反驳讨论中的数据不足问题 |
large language model |
|
|
| 12 |
OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit |
提出OnPrem.LLM以解决敏感数据处理中的隐私问题 |
large language model |
|
|
| 13 |
Semantic Retention and Extreme Compression in LLMs: Can We Have Both? |
提出联合剪枝与量化以提升大语言模型压缩性能 |
large language model |
|
|
| 14 |
Are LLMs complicated ethical dilemma analyzers? |
提出伦理困境基准数据集以评估大型语言模型的伦理推理能力 |
large language model |
|
|
| 15 |
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning |
提出FalseReject以解决大型语言模型的过度拒绝问题 |
large language model |
|
|
| 16 |
Benchmarking Retrieval-Augmented Generation for Chemistry |
提出ChemRAG-Bench以评估化学领域的检索增强生成方法 |
large language model |
|
|
| 17 |
Concept-Level Explainability for Auditing & Steering LLM Responses |
提出ConceptX以解决大语言模型响应的可解释性问题 |
large language model |
|
|
| 18 |
ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution |
提出ToolACE-DEV以解决工具学习中的自我提升问题 |
large language model |
|
|
| 19 |
QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines |
提出QUPID以提升韩国搜索引擎的相关性评估 |
large language model |
|
|
| 20 |
Domain Regeneration: How well do LLMs match syntactic properties of text domains? |
探讨大型语言模型在文本领域语法特性匹配的有效性 |
large language model |
|
|
| 21 |
JobHop: A Large-Scale Dataset of Career Trajectories |
提出JobHop数据集以解决职业轨迹分析问题 |
large language model |
|
|
| 22 |
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs |
提出SENATOR框架以解决大语言模型知识缺陷问题 |
large language model |
✅ |
|
| 23 |
HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling |
提出HAMLET以解决医疗领域多语言主题建模问题 |
large language model |
|
|