| 1 |
OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision |
OccVLA:提出基于隐式3D Occupancy监督的视觉-语言-动作模型,提升自动驾驶场景理解。 |
vision-language-action large language model multimodal |
|
|
| 2 |
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs |
提出MIND框架,提升多模态LLM的元认知知识编辑能力 |
large language model multimodal |
|
|
| 3 |
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated |
针对具备推理能力的大语言模型,提出一种新的分解推理中毒攻击方法。 |
large language model chain-of-thought |
|
|
| 4 |
Decoding Latent Attack Surfaces in LLMs: Prompt Injection via HTML in Web Summarization |
揭示LLM在Web摘要中的潜在攻击面:通过HTML注入提示 |
large language model |
|
|
| 5 |
EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit in a Production LLM System |
EchoLeak:首个在生产LLM系统中实现的零点击Prompt注入漏洞利用 |
large language model |
|
|