| 1 |
Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world |
Cedalion:一个用于全面分析多模态fNIRS/DOT数据的Python框架 |
multimodal |
|
|
| 2 |
Crisis-Bench: Benchmarking Strategic Ambiguity and Reputation Management in Large Language Models |
Crisis-Bench:评估大语言模型在危机公关中的战略模糊与声誉管理能力 |
large language model |
|
|
| 3 |
Understanding LLM-Driven Test Oracle Generation |
利用大语言模型生成测试预言,解决软件测试中的预言问题 |
large language model foundation model |
|
|
| 4 |
ART: Adaptive Reasoning Trees for Explainable Claim Verification |
提出自适应推理树ART,用于可解释的声明验证 |
large language model chain-of-thought |
|
|
| 5 |
Safety Not Found (404): Hidden Risks of LLM-Based Robotics Decision Making |
揭示LLM机器人决策的潜在风险:安全关键场景下的灾难性错误 |
large language model |
|
|
| 6 |
Explainable AI: Learning from the Learners |
结合可解释AI与因果推理,从AI学习者中提取知识 |
foundation model |
|
|
| 7 |
Can AI mediation improve democratic deliberation? |
探讨AI调解能否提升民主审议质量,关注LLM在促进共识中的作用 |
large language model |
|
|
| 8 |
Decoding Workload and Agreement From EEG During Spoken Dialogue With Conversational AI |
探索脑机接口在人机对话中的应用:利用脑电信号解码工作负荷与一致性 |
large language model |
|
|
| 9 |
DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation |
DynaDebate:动态路径生成的多智能体辩论框架,打破同质化推理 |
large language model |
|
|
| 10 |
Logic-Parametric Neuro-Symbolic NLI: Controlling Logical Formalisms for Verifiable LLM Reasoning |
提出逻辑可控的神经符号自然语言推理框架,提升LLM推理的鲁棒性和适应性 |
large language model |
|
|
| 11 |
RISE: Rule-Driven SQL Dialect Translation via Query Reduction |
RISE:通过查询简化实现规则驱动的SQL方言翻译 |
large language model |
|
|
| 12 |
The Evaluation Gap in Medicine, AI and LLMs: Navigating Elusive Ground Truth & Uncertainty via a Probabilistic Paradigm |
提出基于概率范式的评估方法,解决医学AI和LLM中ground truth不确定性问题 |
large language model |
|
|
| 13 |
STELP: Secure Transpilation and Execution of LLM-Generated Programs |
STELP:安全转译与执行LLM生成代码,保障AI系统安全 |
large language model |
|
|