| 1 |
Theory of Mind in Large Language Models: Assessment and Enhancement |
综述LLM心智理论能力:评估基准与提升策略分析 |
large language model |
|
|
| 2 |
Detect, Explain, Escalate: Sustainable Dialogue Breakdown Management for LLM Agents |
提出'检测、解释、升级'框架以解决对话中断问题 |
large language model chain-of-thought |
|
|
| 3 |
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks |
LLM时代通用评估:超越基准的评测框架综述,关注能力评估与自动化 |
large language model instruction following |
|
|
| 4 |
Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs |
提出GoAT,利用攻击图提升LLM黑盒越狱攻击效果与可解释性 |
large language model |
✅ |
|
| 5 |
LINC: Supporting Language Independent Communication and Comprehension to Enhance Contribution in Multilingual Collaborative Meetings |
LINC:支持语言无关的沟通与理解,提升多语言协作会议的贡献度 |
multimodal |
|
|
| 6 |
RAIR: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction |
提出RAIR框架,增强LLM在中文拼写纠错中对领域术语和变长纠错的处理能力 |
large language model |
|
|