| 1 |
Keyword-Oriented Multimodal Modeling for Euphemism Identification |
提出关键词导向的多模态隐晦表达识别方法,解决社交媒体内容审核难题。 |
large language model multimodal |
|
|
| 2 |
Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection |
提出NofT指标,用于任务路由和对抗性提示检测,提升LLM效率与安全性。 |
large language model chain-of-thought |
✅ |
|
| 3 |
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond |
综述高效推理:针对大型推理模型中语言、多模态及其他方面的推理效率提升方法。 |
multimodal chain-of-thought |
|
|
| 4 |
UGen: Unified Autoregressive Multimodal Model with Progressive Vocabulary Learning |
UGen:一种基于渐进式词汇学习的统一自回归多模态模型 |
multimodal |
|
|
| 5 |
Boosting Large Language Models with Mask Fine-Tuning |
提出Mask Fine-Tuning,通过掩码微调显著提升大语言模型性能 |
large language model |
|
|
| 6 |
Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models |
通过热力学形式主义分析解码策略,揭示大语言模型局部归一化失真问题 |
large language model |
|
|
| 7 |
SWI: Speaking with Intent in Large Language Models |
提出SWI:通过显式意图提升大语言模型的推理与生成能力 |
large language model |
|
|
| 8 |
AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models |
AutoPsyC:利用大语言模型自动识别半结构化访谈中的心理动力冲突 |
large language model |
|
|
| 9 |
Navigating the Risks of Using Large Language Models for Text Annotation in Social Science Research |
提出LLM文本标注框架,评估其在社会科学研究中的风险与潜力 |
large language model |
|
|
| 10 |
JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community |
JiraiBench:一个双语基准,用于评估大型语言模型对Jirai社区中人类自毁行为内容的检测能力 |
large language model |
|
|
| 11 |
Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach |
提出一种跨模型和语义一致性方法,评估大语言模型基于内部知识生成书籍摘要的能力。 |
large language model |
|
|
| 12 |
OpenHuEval: Evaluating Large Language Model on Hungarian Specifics |
提出OpenHuEval,首个面向匈牙利语及特定文化的LLM评测基准。 |
large language model |
✅ |
|
| 13 |
OmniVox: Zero-Shot Emotion Recognition with Omni-LLMs |
OmniVox:利用全模态大语言模型实现零样本情感识别 |
large language model multimodal chain-of-thought |
|
|
| 14 |
Large Language Model Agent: A Survey on Methodology, Applications and Challenges |
对大型语言模型智能体的方法、应用与挑战进行综述 |
large language model |
✅ |
|
| 15 |
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models |
提出OlymMATH奥赛级数学基准,挑战大语言模型复杂推理能力 |
large language model |
✅ |
|
| 16 |
From User Preferences to Optimization Constraints Using Large Language Models |
利用大型语言模型将用户偏好转化为家庭能源优化约束 |
large language model |
|
|
| 17 |
Leveraging Large Language Models for Risk Assessment in Hyperconnected Logistic Hub Network Deployment |
利用大型语言模型进行超互联物流枢纽网络部署中的风险评估 |
large language model |
|
|
| 18 |
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models |
ThinkEdit:通过可解释的权重编辑缓解推理模型中的过度短推理问题 |
large language model chain-of-thought |
✅ |
|
| 19 |
LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models |
提出LLaVA-CMoE,解决LLM在视觉-语言持续学习中的灾难性遗忘和参数效率问题。 |
large language model multimodal |
|
|
| 20 |
ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging |
ZJUKLAB提出基于模型融合的LLM敏感内容遗忘方法,在SemEval-2025 Task 4中排名第二。 |
large language model |
✅ |
|
| 21 |
Effective Skill Unlearning through Intervention and Abstention |
提出基于干预和抑制的LLM技能遗忘方法,无需训练且高效。 |
large language model |
✅ |
|
| 22 |
How do language models learn facts? Dynamics, curricula and hallucinations |
研究语言模型学习事实的动态过程,揭示知识获取的阶段性、数据分布影响及幻觉现象。 |
large language model |
|
|
| 23 |
Shared Global and Local Geometry of Language Model Embeddings |
揭示大语言模型嵌入的全局和局部几何相似性,并提出跨模型迁移方法。 |
large language model |
|
|
| 24 |
Cognitive Prompts Using Guilford's Structure of Intellect Model |
利用吉尔福特智力结构模型,提出认知提示工程以提升LLM推理能力 |
large language model |
|
|
| 25 |
RedditESS: A Mental Health Social Support Interaction Dataset -- Understanding Effective Social Support to Refine AI-Driven Support Tools |
提出RedditESS数据集,用于提升AI心理健康支持工具的有效性 |
large language model |
|
|
| 26 |
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition |
提出ResearchBench,用于评估LLM在科学发现中基于灵感的任务分解能力。 |
large language model |
|
|
| 27 |
MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning |
MSPLoRA:多尺度金字塔低秩适配,提升模型微调效率 |
large language model |
✅ |
|
| 28 |
Debate-Driven Multi-Agent LLMs for Phishing Email Detection |
提出基于辩论驱动的多Agent LLM钓鱼邮件检测方法 |
large language model |
|
|
| 29 |
Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad |
评估LLM在2025年美国数学奥林匹克竞赛中的解题能力:证明还是虚张声势? |
large language model |
|
|
| 30 |
R-PRM: Reasoning-Driven Process Reward Modeling |
提出R-PRM:一种推理驱动的过程奖励建模方法,提升数学推理的准确性和效率。 |
large language model |
|
|
| 31 |
EmoDebt: Bayesian-Optimized Emotional Intelligence for Strategic Agent-to-Agent Debt Recovery |
EmoDebt:基于贝叶斯优化的情感智能,用于智能体间债务催收 |
large language model |
✅ |
|