| 1 |
Commencing-Student Enrolment Forecasting Under Data Sparsity with Time Series Foundation Models |
利用时间序列基础模型,解决数据稀疏下的高校新生入学预测问题 |
foundation model |
|
|
| 2 |
InjectRBP: Steering Large Language Model Reasoning Behavior via Pattern Injection |
InjectRBP:通过行为模式注入引导大语言模型推理 |
large language model |
|
|
| 3 |
Talk2DM: Enabling Natural Language Querying and Commonsense Reasoning for Vehicle-Road-Cloud Integrated Dynamic Maps with Large Language Models |
提出Talk2DM,通过自然语言查询和常识推理增强车-路-云协同动态地图 |
large language model |
|
|
| 4 |
scPilot: Large Language Model Reasoning Toward Automated Single-Cell Analysis and Discovery |
scPilot:利用大语言模型推理实现自动化单细胞分析与发现 |
large language model |
✅ |
|
| 5 |
Differentially Private and Communication Efficient Large Language Model Split Inference via Stochastic Quantization and Soft Prompt |
提出DEL框架,通过差分隐私随机量化和软提示实现通信高效的LLM分割推理。 |
large language model |
|
|
| 6 |
Beyond Pixels: Vector-to-Graph Transformation for Reliable Schematic Auditing |
提出向量-图转换以解决工程图纸结构盲目性问题 |
large language model multimodal |
✅ |
|
| 7 |
Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging |
提出SCF-RKL,通过稀疏互补融合解决模型合并中的功能干扰问题。 |
large language model instruction following |
|
|
| 8 |
Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation |
MathSpatial:用于评估和提升多模态大语言模型空间数学推理能力的统一框架 |
large language model multimodal |
|
|
| 9 |
Think like a Scientist: Physics-guided LLM Agent for Equation Discovery |
KeplerAgent:基于物理先验知识的LLM智能体,用于符号方程发现 |
large language model |
|
|
| 10 |
GPT-4o Lacks Core Features of Theory of Mind |
GPT-4o缺乏核心的心智理论能力,无法建立连贯一致的心理状态模型 |
large language model |
|
|
| 11 |
AttentionRetriever: Attention Layers are Secretly Long Document Retrievers |
提出AttentionRetriever,利用注意力机制进行高效长文档检索。 |
large language model |
|
|
| 12 |
VIRENA: Virtual Arena for Research, Education, and Democratic Innovation |
VIRENA:用于研究、教育和民主创新的虚拟社交媒体实验平台 |
large language model |
|
|
| 13 |
Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision |
提出Sci-CoE框架,通过几何共识与稀疏监督协同进化科学推理LLM |
large language model |
✅ |
|
| 14 |
The Pensieve Paradigm: Stateful Language Models Mastering Their Own Context |
提出StateLM,赋予语言模型记忆管理能力,提升长文本处理和对话性能。 |
foundation model |
|
|
| 15 |
ModelWisdom: An Integrated Toolkit for TLA+ Model Visualization, Digest and Repair |
ModelWisdom:集成TLA+模型可视化、理解与修复工具,提升模型检查效率。 |
large language model |
|
|
| 16 |
IncompeBench: A Permissively Licensed, Fine-Grained Benchmark for Music Information Retrieval |
IncompeBench:一个许可宽松、细粒度的音乐信息检索评测基准。 |
multimodal |
✅ |
|
| 17 |
Leveraging LLMs to support co-evolution between definitions and instances of textual DSLs: A Systematic Evaluation |
利用LLM协同演化文本DSL定义与实例,系统评估其性能与局限性 |
large language model |
|
|
| 18 |
From Atoms to Trees: Building a Structured Feature Forest with Hierarchical Sparse Autoencoders |
提出HSAE,通过分层稀疏自编码器构建结构化特征森林,挖掘LLM中的多尺度概念结构。 |
large language model |
|
|
| 19 |
AmbiBench: Benchmarking Mobile GUI Agents Beyond One-Shot Instructions in the Wild |
AmbiBench:构建移动GUI Agent基准,评估其在真实场景下处理模糊指令和意图对齐的能力。 |
instruction following |
|
|
| 20 |
AIR: Improving Agent Safety through Incident Response |
AIR:通过事件响应提升LLM Agent的安全性 |
large language model |
|
|
| 21 |
Text2GQL-Bench: A Text to Graph Query Language Benchmark [Experiment, Analysis & Benchmark] |
提出Text2GQL-Bench,用于评估和提升文本到图查询语言的转换性能 |
large language model |
|
|
| 22 |
Benchmark Health Index: A Systematic Framework for Benchmarking the Benchmarks of LLMs |
提出基准健康指数BHI,用于系统性评估和管理LLM基准的可靠性。 |
large language model |
|
|
| 23 |
PhyNiKCE: A Neurosymbolic Agentic Framework for Autonomous Computational Fluid Dynamics |
PhyNiKCE:一种神经符号代理框架,用于自主计算流体动力学。 |
large language model |
|
|
| 24 |
LoRA-based Parameter-Efficient LLMs for Continuous Learning in Edge-based Malware Detection |
提出基于LoRA的参数高效LLM持续学习框架,用于边缘恶意软件检测。 |
large language model |
|
|
| 25 |
MAPLE: Modality-Aware Post-training and Learning Ecosystem |
提出MAPLE,通过模态感知后训练提升多模态强化学习性能。 |
multimodal |
|
|
| 26 |
SemaPop: Semantic-Persona Conditioned Population Synthesis |
SemaPop:提出一种语义-角色条件的人口合成方法,融合LLM与生成模型。 |
large language model |
|
|
| 27 |
Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs |
提出TRACE-RPS框架,主动防御LLM中的属性推断攻击 |
large language model |
✅ |
|
| 28 |
AgentLeak: A Full-Stack Benchmark for Privacy Leakage in Multi-Agent LLM Systems |
AgentLeak:多智能体LLM系统隐私泄露的全栈基准测试 |
large language model |
|
|
| 29 |
Compiler-Guided Inference-Time Adaptation: Improving GPT-5 Programming Performance in Idris |
编译器指导的推理时自适应:提升GPT-5在Idris编程中的性能 |
large language model |
|
|