| 1 |
Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography |
Odysseus:利用双重隐写术破解商业多模态LLM集成系统 |
large language model multimodal |
|
|
| 2 |
Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent |
SAGE:基于人机协同推理的大语言模型自动立体定向放射外科计划系统 |
large language model chain-of-thought |
|
|
| 3 |
Dual-Encoder Transformer-Based Multimodal Learning for Ischemic Stroke Lesion Segmentation Using Diffusion MRI |
提出基于双编码器Transformer的Ischemic Stroke病灶分割方法,提升DWI和ADC图像的分割精度。 |
multimodal |
|
|
| 4 |
Toward Explaining Large Language Models in Software Engineering Tasks |
提出FeatureSHAP,用于解释软件工程任务中的大型语言模型 |
large language model |
✅ |
|
| 5 |
Advancing Multimodal Teacher Sentiment Analysis:The Large-Scale T-MED Dataset & The Effective AAM-TSA Model |
构建T-MED数据集与AAM-TSA模型以提升教师情感分析准确性 |
multimodal |
|
|
| 6 |
SynCraft: Guiding Large Language Models to Predict Edit Sequences for Molecular Synthesizability Optimization |
SynCraft:引导大语言模型预测编辑序列,优化分子合成可行性 |
large language model |
|
|
| 7 |
TongSIM: A General Platform for Simulating Intelligent Machines |
TongSIM:通用智能机器模拟平台,支持具身智能体训练与评估 |
embodied AI large language model multimodal |
|
|
| 8 |
Concept Generalization in Humans and Large Language Models: Insights from the Number Game |
通过数字游戏对比人类与大语言模型在概念泛化上的差异 |
large language model |
|
|
| 9 |
A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice |
DeepSeek赋能的AI系统Janus-Pro-CXR,用于临床胸部X光片自动判读 |
large language model multimodal |
|
|
| 10 |
Reason2Decide: Rationale-Driven Multi-Task Learning |
Reason2Decide:一种基于理由驱动的多任务学习框架,提升临床决策支持系统的预测精度和解释一致性。 |
large language model foundation model |
|
|
| 11 |
Generative Digital Twins: Vision-Language Simulation Models for Executable Industrial Systems |
提出视觉-语言模拟模型,从草图和文本生成可执行的工业系统数字孪生。 |
multimodal |
|
|
| 12 |
Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation |
提出一种自动工作流生成方法,解决大型语言模型从工具使用者到工作流架构师的转变难题。 |
large language model |
|
|
| 13 |
Memory as Resonance: A Biomimetic Architecture for Infinite Context Memory on Ergodic Phonetic Manifolds |
提出基于遍历语音流形的共振记忆架构PTM,解决大语言模型无限上下文记忆问题。 |
large language model |
|
|
| 14 |
MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents |
MemR³:通过反思推理实现LLM Agent的记忆检索,提升问答质量。 |
large language model |
|
|
| 15 |
AXIOM: Benchmarking LLM-as-a-Judge for Code via Rule-Based Perturbation and Multisource Quality Calibration |
AXIOM:通过规则扰动和多源质量校准,基准测试LLM作为代码评估判官的能力 |
large language model |
|
|
| 16 |
Enhancing Zero-Shot Time Series Forecasting in Off-the-Shelf LLMs via Noise Injection |
通过噪声注入增强即用型LLM的零样本时间序列预测能力 |
large language model |
|
|
| 17 |
On the Effectiveness of Instruction-Tuning Local LLMs for Identifying Software Vulnerabilities |
指令调优本地LLM,有效识别软件漏洞类型,提升安全性和实用性。 |
large language model |
|
|
| 18 |
S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test |
提出S$^3$IT基准测试,用于评估具身智能体在复杂社交环境中的推理能力 |
large language model |
|
|