| 1 |
JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering |
提出JPS以解决多模态大语言模型的越狱攻击问题 |
large language model multimodal |
✅ |
|
| 2 |
MedMKEB: A Comprehensive Knowledge Editing Benchmark for Medical Multimodal Large Language Models |
提出MedMKEB:用于评估医学多模态大语言模型知识编辑的综合基准 |
large language model multimodal |
|
|
| 3 |
MV-Debate: Multi-view Agent Debate with Dynamic Reflection Gating for Multimodal Harmful Content Detection in Social Media |
提出MV-Debate多视角Agent辩论框架,用于社交媒体中多模态有害内容检测。 |
multimodal |
|
|
| 4 |
Can Large Language Models Generate Effective Datasets for Emotion Recognition in Conversations? |
利用小型语言模型生成对话情绪识别数据集,提升模型泛化能力 |
large language model |
|
|
| 5 |
Large Language Models Transform Organic Synthesis From Reaction Prediction to Automation |
大型语言模型将有机合成从反应预测转变为自动化 |
large language model |
|
|
| 6 |
StructVRM: Aligning Multimodal Reasoning with Structured and Verifiable Reward Models |
StructVRM:通过结构化可验证奖励模型对齐多模态推理 |
multimodal |
|
|
| 7 |
Driver Assistant: Persuading Drivers to Adjust Secondary Tasks Using Large Language Models |
利用大语言模型辅助驾驶员调整次要任务,提升道路安全性 |
large language model |
|
|
| 8 |
Incident Response Planning Using a Lightweight Large Language Model with Reduced Hallucination |
提出一种轻量级、低幻觉的大语言模型事件响应规划方法 |
large language model |
|
|
| 9 |
Tool Graph Retriever: Exploring Dependency Graph-based Tool Retrieval for Large Language Models |
提出Tool Graph Retriever(TGR),利用工具依赖图提升大语言模型工具检索性能 |
large language model |
|
|
| 10 |
LLM-BI: Towards Fully Automated Bayesian Inference with Large Language Models |
提出LLM-BI,利用大语言模型实现全自动贝叶斯推断 |
large language model |
|
|
| 11 |
Safety of Embodied Navigation: A Survey |
具身导航安全性综述:分析攻击、防御与评估方法,展望未来研究方向 |
embodied AI large language model |
|
|
| 12 |
QA-Dragon: Query-Aware Dynamic RAG System for Knowledge-Intensive Visual Question Answering |
提出QA-Dragon,用于知识密集型视觉问答的查询感知动态RAG系统 |
large language model multimodal |
|
|
| 13 |
A Framework for Inherently Safer AGI through Language-Mediated Active Inference |
提出一种基于语言介导主动推理的AGI安全框架,旨在实现内生安全性。 |
large language model |
|
|
| 14 |
LLM-Based Intelligent Agents for Music Recommendation: A Comparison with Classical Content-Based Filtering |
利用LLM智能体进行音乐推荐,效果优于传统内容过滤方法 |
large language model |
|
|
| 15 |
Streamlining Admission with LOR Insights: AI-Based Leadership Assessment in Online Master's Program |
提出LORI:利用AI评估推荐信中的领导力,优化在线硕士项目招生流程。 |
large language model |
|
|
| 16 |
AI-Guided Exploration of Large-Scale Codebases |
提出一种AI引导的代码探索方法,结合逆向工程与LLM以提升代码理解效率。 |
large language model |
|
|
| 17 |
KuaiLive: A Real-time Interactive Dataset for Live Streaming Recommendation |
发布KuaiLive:一个用于直播推荐的实时交互数据集 |
TAMP |
✅ |
|
| 18 |
Simulating Human-Like Learning Dynamics with LLM-Empowered Agents |
提出LearnerAgent,利用LLM模拟人类学习动态,揭示LLM的认知局限性。 |
large language model |
|
|
| 19 |
CLAPP: The CLASS LLM Agent for Pair Programming |
CLAPP:用于配对编程的CLASS LLM智能体,提升科研效率 |
large language model |
|
|
| 20 |
Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation |
提出Auto-Eval Judge通用框架,用于评估Agent任务完成质量,提升评估与人类对齐度。 |
foundation model |
|
|
| 21 |
Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising |
提出M$^3$BSR模型,利用条件扩散去噪提升多模态多行为序列推荐精度 |
multimodal |
|
|
| 22 |
NomicLaw: Emergent Trust and Strategic Argumentation in LLMs During Collaborative Law-Making |
NomicLaw:利用LLM进行协同法律制定,探索涌现信任与策略性论证 |
large language model |
|
|
| 23 |
The Term 'Agent' Has Been Diluted Beyond Utility and Requires Redefinition |
重新定义“Agent”概念,解决AI领域术语歧义问题,提升研究清晰度和可复现性 |
large language model |
|
|
| 24 |
EvoGraph: Hybrid Directed Graph Evolution toward Software 3.0 |
EvoGraph:混合有向图进化框架,迈向软件3.0时代 |
large language model |
|
|
| 25 |
Situated Epistemic Infrastructures: A Diagnostic Framework for Post-Coherence Knowledge |
提出情境化认知基础设施框架,诊断后连贯性时代混合人机系统的知识权威性问题。 |
large language model |
|
|
| 26 |
Grid-Agent: An LLM-Powered Multi-Agent System for Power Grid Control |
Grid-Agent:基于LLM的多智能体系统,用于电力网络控制与故障恢复。 |
large language model |
|
|