| 1 |
Certainty-Guided Reasoning in Large Language Models: A Dynamic Thinking Budget Approach |
提出确定性引导推理(CGR),提升大语言模型推理效率与可靠性。 |
large language model |
|
|
| 2 |
XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols |
提出基于XML提示的语法约束交互框架,保障LLM输出结构化和可控性。 |
large language model |
|
|
| 3 |
Explicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and Robustness |
研究表明,在LLM评判任务中,显式推理模型在准确性、效率和鲁棒性上更优。 |
large language model |
|
|
| 4 |
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions |
VStyle:一个基于口语指令的语音风格迁移评测基准 |
instruction following |
✅ |
|
| 5 |
Astra: A Multi-Agent System for GPU Kernel Performance Optimization |
Astra:基于多智能体系统的GPU Kernel性能优化方法 |
large language model |
✅ |
|