| 1 |
Foundation-Model-Based Agents in Industrial Automation: Purposes, Capabilities, and Open Challenges |
综述研究:基于大模型的智能体在工业自动化中的应用、能力与挑战 |
large language model foundation model |
|
|
| 2 |
Position: How can Graphs Help Large Language Models? |
图结构助力大语言模型:提升知识、推理与结构化数据理解能力 |
large language model chain-of-thought |
|
|
| 3 |
On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length |
研究长程任务中LLM训练,揭示任务长度对训练稳定性和泛化性的影响 |
large language model |
|
|
| 4 |
When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition |
针对构音障碍语音识别,研究表明现有语音-语言模型未能有效利用多模态临床上下文信息。 |
multimodal |
|
|
| 5 |
Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims |
提出ReClaim:基于大规模医疗理赔数据的医疗健康领域预训练模型 |
foundation model |
|
|
| 6 |
ProPACT: A Proactive AI-Driven Adaptive Collaborative Tutor for Pair Programming |
ProPACT:用于结对编程的主动式AI驱动自适应协作辅导系统 |
multimodal |
|
|
| 7 |
Anon: Extrapolating Optimizer Adaptivity Across the Real Spectrum |
提出Anon优化器,通过可调适应性和增量延迟更新,统一并超越经典与现代优化器。 |
large language model |
|
|
| 8 |
Submodular Benchmark Selection |
提出基于次模优化的基准测试选择方法,降低大模型评测成本。 |
large language model |
|
|
| 9 |
AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development |
揭示AI生成软件的技术债务:推理复杂度与代码质量的权衡分析 |
large language model |
|
|
| 10 |
Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI |
提出混合检查与任务型访问控制,保障零信任Agentic AI安全。 |
large language model |
|
|
| 11 |
Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution |
利用因果关系解决可信AI中不变性冲突问题 |
foundation model |
|
|
| 12 |
Beyond State Machines: Executing Network Procedures with Agentic Tool-Calling Sequences |
利用Agentic Tool-Calling序列执行网络程序,提升移动通信系统灵活性。 |
large language model |
|
|
| 13 |
Strategy-Aware Optimization Modeling with Reasoning LLMs |
提出SAGE框架,显式建模优化策略,提升LLM在优化问题建模中的正确性和效率。 |
large language model |
✅ |
|
| 14 |
From Experimental Limits to Physical Insight: A Retrieval-Augmented Multi-Agent Framework for Interpreting Searches Beyond the Standard Model |
提出HEP-CoPilot,一个检索增强的多Agent框架,用于解释超出标准模型的搜索结果。 |
multimodal |
|
|
| 15 |
GRAIL: A Deep-Granularity Hybrid Resonance Framework for Real-Time Agent Discovery via SLM-Enhanced Indexing |
GRAIL:通过SLM增强索引实现实时Agent发现的深度粒度混合共振框架 |
large language model |
|
|
| 16 |
LLM-Assisted Repository-Level Generation with Structured Spec-Driven Engineering |
提出结构化规约驱动工程(SSDE),提升LLM在仓库级代码生成的质量和可验证性。 |
large language model |
|
|
| 17 |
APIOT: Autonomous Vulnerability Management Across Bare-Metal Industrial OT Networks |
APIOT:实现裸机工业OT网络漏洞自主管理的框架 |
large language model |
|
|
| 18 |
LLM-enabled Social Agents |
提出基于角色定义的LLM社会智能体框架,提升社会交互能力 |
large language model |
|
|
| 19 |
EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions |
EngiAgent:全连接LLM智能体协同解决可行性导向的开放式工程问题 |
large language model |
✅ |
|
| 20 |
Complexity Horizons of Compressed Models in Analog Circuit Analysis |
提出基于前提图的模型压缩策略,优化LLM在电路分析中的推理效率。 |
large language model |
✅ |
|
| 21 |
On the Privacy of LLMs: An Ablation Study |
针对LLM隐私风险,提出统一威胁模型并进行消融研究,揭示设计选择的影响。 |
large language model |
|
|
| 22 |
Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training |
针对小LLM,提出零样本置信度估计方法,无需监督训练即可实现可靠的本地-云路由。 |
large language model |
|
|
| 23 |
CoVSpec: Efficient Device-Edge Co-Inference for Vision-Language Models via Speculative Decoding |
提出CoVSpec,通过推测解码实现视觉-语言模型在端-边协同推理中的高效部署。 |
multimodal |
|
|
| 24 |
Retrieval and Multi-Hop Reasoning in 1M-Token Context Windows: Evaluating LLMs on Classical Chinese Text |
评估百万Token上下文窗口下LLM在古文检索与多跳推理能力 |
large language model |
|
|
| 25 |
DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion |
DocSync:提出一种基于评论家引导反思的Agent,用于维护软件文档与代码的一致性。 |
large language model |
|
|
| 26 |
The Dynamic Gist-Based Memory Model (DGMM): A Memory-Centric Architecture for Artificial Intelligence |
提出动态概要记忆模型(DGMM),解决AI在持久记忆、时序定位和可解释性方面的局限。 |
large language model |
|
|