| 1 |
When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models |
提出CoT-Output安全矩阵以揭示多轮推理模型的失败模式 |
chain-of-thought |
|
|
| 2 |
Null-Space Constrained Low-Rank Adaptation for Response-Specified Large Language Model Unlearning |
提出Null-Space约束低秩适应以解决大语言模型的遗忘问题 |
large language model |
|
|
| 3 |
A History-Aware Visually Grounded Critic for Computer Use Agents |
提出HiViG框架以解决CUA决策短视与视觉缺失问题 |
multimodal visual grounding |
|
|
| 4 |
ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models |
提出ReasonAlloc以解决推理模型中的KV缓存预算分配问题 |
large language model chain-of-thought |
|
|
| 5 |
Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA |
提出SECDA-DSE框架以自动化FPGA加速器设计 |
large language model chain-of-thought |
|
|
| 6 |
Soul Computing: A Theoretical Framework and Technical Architecture for Intelligent Agents with Independent Consciousness |
提出灵魂计算框架以解决智能体独立意识构建问题 |
large language model multimodal |
|
|
| 7 |
Generative Explainability for Next-Generation Networks: LLM-Augmented XAI with Mutual Feature Interactions |
提出生成性可解释性框架以解决网络透明性问题 |
large language model |
|
|
| 8 |
A Constrained Natural-Language Interface for Variational Multi-Physics Finite Element Simulations in FEniCS |
提出受限自然语言接口以优化多物理场有限元模拟 |
large language model |
|
|
| 9 |
Piper: A Programmable Distributed Training System |
提出Piper以解决大规模模型训练的并行策略适应性问题 |
foundation model |
|
|
| 10 |
Flaws in the LLM Automation Narrative |
提出新基准任务以评估LLM在数据分析中的表现 |
large language model |
|
|
| 11 |
Superficial Beliefs in LLM Decision-Making |
提出对LLM决策中的表面信念的分析以揭示决策结构 |
large language model |
|
|
| 12 |
Structure from Reasoning, Numbers from Search: On-Premise Open LLMs as Structural Priors for Coupled MIMO Controller Tuning |
利用开放源代码大语言模型优化耦合MIMO控制器调优 |
large language model |
|
|
| 13 |
Mind the Gap: Can Frontier LLMs Pass a Standardized Office Proficiency Exam? |
提出基于NCRE的评估方法以测试前沿LLM在办公自动化中的能力 |
large language model |
|
|
| 14 |
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution |
提出Role-Agent框架以解决LLM代理学习中的反馈效率问题 |
large language model |
|
|
| 15 |
Evaluating Research-Level Math Proofs via Strict Step-Level Verification |
提出严格逐步验证框架以解决数学证明评估问题 |
large language model |
|
|
| 16 |
Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation |
提出安全LLM代理的综合框架以应对新兴威胁 |
large language model |
|
|
| 17 |
Decentralized Multi-Agent Systems with Shared Context |
提出去中心化语言模型以解决多智能体系统的协调瓶颈问题 |
large language model |
✅ |
|
| 18 |
ActiveMem: Distributed Active Memory for Long-Horizon LLM Reasoning |
提出ActiveMem以解决长时间推理任务中的记忆管理问题 |
large language model |
|
|
| 19 |
Decoupling Thought from Speech: Knowledge-Grounded Counterfactual Reasoning for Resilient Multi-Agent Argumentation |
提出知识基础的反事实推理以解决多智能体辩论中的稳定性问题 |
large language model |
|
|
| 20 |
STAGE-Claw: Automated State-based Agent Benchmarking for Realistic Scenarios |
提出STAGE-Claw框架以解决个人代理评估的挑战 |
large language model |
|
|
| 21 |
Beyond Static Evaluation: Co-Evolutionary Mechanisms for LLM-Driven Strategy Evolution in Adversarial Games |
提出共进化机制以解决对抗游戏中策略演化的评估挑战 |
foundation model |
✅ |
|
| 22 |
Atomic Intent Reasoning: Bringing LLM Semantics to Industrial Cross-Domain Recommendations |
提出AIR框架以解决跨域推荐中的语义差距问题 |
large language model |
|
|
| 23 |
Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints |
提出基于LLM的运动异常生成框架以解决数据稀缺问题 |
large language model |
|
|
| 24 |
From Context-Aware to Conflict-Aware: Generalizing Contrastive Decoding for Knowledge Conflict in LLMs |
提出冲突感知解码方法以解决大语言模型中的知识冲突问题 |
large language model |
✅ |
|
| 25 |
Sim2Schedule: A Simulator-Guided LLM Framework for Autonomous Open-Pit Mine Scheduling |
提出Sim2Schedule框架以解决开放式矿山调度问题 |
large language model |
|
|