| 1 |
Countermind: A Multi-Layered Security Architecture for Large Language Models |
Countermind:一种用于大型语言模型的多层安全架构,旨在防御提示注入等攻击。 |
large language model multimodal |
|
|
| 2 |
Asking Clarifying Questions for Preference Elicitation With Large Language Models |
提出基于扩散模型的澄清问题生成方法,提升LLM偏好获取能力 |
large language model |
|
|
| 3 |
Beyond touch-based HMI: Control your machines in natural language by utilizing large language models and OPC UA |
提出基于LLM和OPC UA的自然语言人机交互方法,提升工业控制便捷性 |
large language model |
|
|
| 4 |
Automating Structural Engineering Workflows with Large Language Model Agents |
MASSE:基于LLM Agent的结构工程工作流自动化系统 |
large language model |
|
|
| 5 |
Diffusion-Link: Diffusion Probabilistic Model for Bridging the Audio-Text Modality Gap |
提出Diffusion-Link,通过扩散模型弥合音频-文本模态鸿沟,提升音频自动描述性能。 |
large language model multimodal |
✅ |
|
| 6 |
Analyzing and Internalizing Complex Policy Documents for LLM Agents |
提出CAP-CPT,通过类别感知的持续预训练,提升LLM Agent在复杂策略文档中的推理能力。 |
large language model chain-of-thought |
|
|
| 7 |
Improving AI Efficiency in Data Centres by Power Dynamic Response |
提出动态电源响应方法,提升AI数据中心能效与可持续性 |
large language model foundation model |
|
|
| 8 |
CTIArena: Benchmarking LLM Knowledge and Reasoning Across Heterogeneous Cyber Threat Intelligence |
CTIArena:构建知识增强型网络威胁情报LLM基准评测体系 |
large language model |
|
|
| 9 |
Beyond Consensus: Mitigating the Agreeableness Bias in LLM Judge Evaluations |
提出少数否决与回归模型,缓解LLM评判中的一致性偏差,提升代码评估精度。 |
large language model |
|
|
| 10 |
ParaCook: On Time-Efficient Planning for Multi-Agent Systems |
ParaCook:面向多智能体系统的时间效率型规划基准 |
large language model |
✅ |
|
| 11 |
Zero Data Retention in LLM-based Enterprise AI Assistants: A Comparative Study of Market Leading Agentic AI Products |
对比研究Salesforce和Microsoft的企业AI助手零数据保留策略 |
large language model |
|
|
| 12 |
Audio-Maestro: Enhancing Large Audio-Language Models with Tool-Augmented Reasoning |
Audio-Maestro:工具增强推理提升大型音频语言模型性能 |
multimodal |
|
|
| 13 |
Automated Skill Decomposition Meets Expert Ontologies: Bridging the Granularity Gap with LLMs |
提出基于LLM的技能自动分解框架,弥合技能粒度与专家知识体系之间的差距 |
large language model |
|
|
| 14 |
PADME: Procedure Aware DynaMic Execution |
PADME:提出程序感知动态执行框架,提升LLM在长流程任务中的可靠性。 |
large language model |
|
|
| 15 |
ProofFlow: A Dependency Graph Approach to Faithful Proof Autoformalization |
提出ProofFlow,通过依赖图提升定理证明自动形式化的语义忠实度。 |
large language model |
✅ |
|
| 16 |
Gelina: Unified Speech and Gesture Synthesis via Interleaved Token Prediction |
Gelina:提出一种基于交错Token预测的统一语音和手势合成框架 |
multimodal |
|
|