| 1 |
EASE: Federated Multimodal Unlearning via Entanglement-Aware Anchor Closure |
EASE:通过解耦感知锚点闭合实现联邦多模态可遗忘学习 |
multimodal |
|
|
| 2 |
Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment |
提出解耦关系子空间对齐(DRSA)框架,提升异构图基础模型跨域知识迁移能力。 |
foundation model |
✅ |
|
| 3 |
Can Coding Agents Reproduce Findings in Computational Materials Science? |
AutoMat:评估LLM智能体在计算材料科学中重现科研结果能力的基准 |
large language model foundation model |
|
|
| 4 |
LLM-Oriented Information Retrieval: A Denoising-First Perspective |
提出面向LLM的信息检索框架,强调去噪以提升检索增强生成质量。 |
large language model multimodal |
|
|
| 5 |
Social Bias in LLM-Generated Code: Benchmark and Mitigation |
提出 Fairness Monitor Agent (FMA) 以缓解 LLM 生成代码中的社会偏见,并提升代码正确性。 |
large language model chain-of-thought |
|
|
| 6 |
Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference |
分析消费级硬件上LLM推理的性能、效率和生态壁垒,揭示Nvidia和Apple Silicon的权衡。 |
large language model |
|
|
| 7 |
Space Network of Experts: Architecture and Expert Placement |
提出Space-XNet框架,解决星载网络中MoE模型的高效分布式部署问题 |
large language model |
|
|
| 8 |
Skills as Verifiable Artifacts: A Trust Schema and a Biconditional Correctness Criterion for Human-in-the-Loop Agent Runtimes |
提出一种基于可验证工件的技能信任模式,用于人机协作Agent运行时环境。 |
large language model |
|
|
| 9 |
AgentFloor: How Far Up the tool use Ladder Can Small Open-Weight Models Go? |
AgentFloor:评估小型开源模型在工具使用Agent中能力的阶梯式基准 |
instruction following |
|
|