| 1 |
Agentic Physical AI toward a Domain-Specific Foundation Model for Nuclear Reactor Control |
提出Agentic Physical AI,用于核反应堆控制的领域特定基础模型。 |
foundation model multimodal |
|
|
| 2 |
Breaking Audio Large Language Models by Attacking Only the Encoder: A Universal Targeted Latent-Space Audio Attack |
提出一种通用目标潜在空间音频攻击,打破音频大语言模型编码器。 |
large language model multimodal |
|
|
| 3 |
Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks |
提出跨Agent多模态溯源防御框架,防范Agentic AI中的提示注入攻击 |
large language model multimodal |
|
|
| 4 |
EquaCode: A Multi-Strategy Jailbreak Approach for Large Language Models via Equation Solving and Code Completion |
提出EquaCode,利用方程求解与代码补全实现大语言模型的越狱攻击 |
large language model |
|
|
| 5 |
How Large Language Models Systematically Misrepresent American Climate Opinions |
揭示大型语言模型在美国气候观点上的系统性偏差,尤其是在交叉身份群体中。 |
large language model |
|
|
| 6 |
Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation |
CreativeDC:利用大语言模型中的发散-收敛思维生成多样化创意问题 |
large language model |
|
|
| 7 |
SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search |
SPIRAL:通过具身和反思搜索实现符号LLM规划 |
large language model chain-of-thought |
|
|
| 8 |
From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research |
提出“模型置信度”以更高效利用LLM的概率信息,提升模拟研究效率。 |
large language model |
|
|
| 9 |
Enhancing Temporal Awareness in LLMs for Temporal Point Processes |
提出TPP-TAL框架,增强LLM在时序点过程中的时间感知能力 |
large language model |
✅ |
|
| 10 |
It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents |
提出TRAP基准以评估网络代理的劝说脆弱性 |
large language model |
|
|
| 11 |
AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis |
提出AKG内核代理以解决跨平台内核合成问题 |
multimodal |
|
|
| 12 |
CASCADE: Cumulative Agentic Skill Creation through Autonomous Development and Evolution |
CASCADE:通过自主开发和演化实现累积式智能体技能创造 |
large language model |
|
|
| 13 |
From Correctness to Collaboration: Toward a Human-Centered Framework for Evaluating AI Agent Behavior in Software Engineering |
提出人本框架以评估软件工程中AI代理行为 |
large language model |
|
|
| 14 |
The Gaining Paths to Investment Success: Information-Driven LLM Graph Reasoning for Venture Capital Prediction |
提出MIRAGE-VC,利用信息增益驱动的LLM图推理进行风险投资预测。 |
large language model |
|
|
| 15 |
Securing the AI Supply Chain: What Can We Learn From Developer-Reported Security Issues and Solutions of AI Projects? |
分析AI项目开发者报告的安全问题与解决方案,保障AI供应链安全。 |
large language model |
|
|
| 16 |
TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI |
TCEval:利用热舒适度评估AI的认知和感知能力 |
large language model |
|
|