| 1 |
Caution for the Environment: Multimodal LLM Agents are Susceptible to Environmental Distractions |
揭示多模态LLM智能体在GUI环境中易受环境干扰的问题 |
generalist agent large language model multimodal |
|
|
| 2 |
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding |
提出基于约束链式思考解码的对话本体关系抽取方法,提升泛化能力。 |
large language model chain-of-thought |
|
|
| 3 |
Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory of Mind in Large Language Models |
利用大模型思维,提升小模型效率:迁移大语言模型的心理理论 |
large language model |
|
|
| 4 |
XMainframe: A Large Language Model for Mainframe Modernization |
XMainframe:用于大型机现代化的专用大型语言模型 |
large language model |
|
|
| 5 |
SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models |
提出SEAS框架,通过自进化对抗安全优化提升大语言模型安全性 |
large language model |
✅ |
|
| 6 |
A Few-Shot Approach for Relation Extraction Domain Adaptation using Large Language Models |
提出一种基于大语言模型的小样本关系抽取领域自适应方法,用于科学知识图谱构建。 |
large language model |
|
|
| 7 |
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model |
提出UnifiedMLLM以解决多模态任务统一表示问题 |
large language model |
✅ |
|
| 8 |
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models |
研究表明:格式限制显著降低大语言模型在推理任务中的性能 |
large language model |
|
|
| 9 |
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models |
揭示定制大语言模型中的提示词泄露风险并提出防御策略 |
large language model |
✅ |
|
| 10 |
Pula: Training Large Language Models for Setswana |
Pula:训练用于塞茨瓦纳语的大型语言模型,性能超越GPT-4o和Gemini 1.5 Pro |
large language model |
|
|
| 11 |
Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings |
对比研究LLM在低资源语言环境下的表现,揭示其性能差异 |
large language model |
|
|
| 12 |
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation |
RAG Foundry:用于增强LLM的检索增强生成开源框架 |
large language model |
✅ |
|
| 13 |
Winning Amazon KDD Cup'24 |
针对在线购物场景,提出基于Qwen2-72B微调和数据增强的LLM智能助手方案,赢得Amazon KDD Cup'24全部任务冠军。 |
large language model |
|
|
| 14 |
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs |
CodeACT:面向代码大模型的代码自适应计算高效微调框架 |
large language model |
|
|
| 15 |
Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization |
通过微调LLM,本文提出了一种高质量的基于方面的情感摘要生成方法。 |
large language model |
|
|
| 16 |
Long Input Benchmark for Russian Analysis |
LIBRA:面向俄语分析的长文本输入基准评测,促进长文本理解能力评估 |
large language model |
|
|
| 17 |
MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities |
提出MaterioMiner数据集,用于材料科学领域过程-结构-性质实体抽取。 |
large language model |
|
|
| 18 |
LLM economicus? Mapping the Behavioral Biases of LLMs via Utility Theory |
利用效用理论评估大语言模型的行为偏差,揭示其经济决策非完全理性或类人 |
large language model |
|
|
| 19 |
The Mechanics of Conceptual Interpretation in GPT Models: Interpretative Insights |
提出概念编辑方法,揭示GPT模型中概念理解的机制 |
large language model |
|
|
| 20 |
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems |
ReDel:一个支持LLM驱动的递归多智能体系统工具包,用于灵活的任务委派和组织。 |
large language model |
✅ |
|