| 1 |
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View |
从Hopfield网络视角理解Chain-of-Thought推理,提升鲁棒性与可解释性 |
large language model chain-of-thought |
|
|
| 2 |
Gradient-based Jailbreak Images for Multimodal Fusion Models |
提出基于梯度优化的图像Jailbreak攻击,突破多模态融合模型的防御。 |
multimodal |
|
|
| 3 |
Towards a Benchmark for Large Language Models for Business Process Management Tasks |
构建面向业务流程管理任务的大语言模型基准评测 |
large language model |
|
|
| 4 |
Enriching Ontologies with Disjointness Axioms using Large Language Models |
利用大型语言模型补全本体中类的不相交公理,提升知识图谱推理能力 |
large language model |
✅ |
|
| 5 |
Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks |
研究多模态提示中图文顺序对大语言模型推理性能的影响 |
large language model |
|
|
| 6 |
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation |
TICK:通过生成式检查清单改进LLM评估与生成 |
large language model instruction following |
|
|
| 7 |
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search |
DOTS:通过最优推理轨迹搜索,使LLM具备动态推理能力 |
large language model |
|
|
| 8 |
On Uncertainty In Natural Language Processing |
研究自然语言处理中的不确定性,并提出校准抽样和置信度量化方法。 |
large language model |
|
|
| 9 |
ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure |
提出ProcBench基准以评估多步骤推理能力 |
large language model |
✅ |
|
| 10 |
Towards Assuring EU AI Act Compliance and Adversarial Robustness of LLMs |
提出基于本体、保障案例和要素表的框架,增强LLM的欧盟AI法案合规性和对抗鲁棒性。 |
large language model |
|
|
| 11 |
Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs |
提出LLM对抗鲁棒性保障框架,应对恶意攻击并满足法规遵从 |
large language model |
|
|
| 12 |
GraphRouter: A Graph-based Router for LLM Selections |
GraphRouter:一种基于图的LLM选择路由方法,提升性能并降低成本。 |
large language model |
✅ |
|