| 1 |
Agentic Physical AI toward a Domain-Specific Foundation Model for Nuclear Reactor Control |
提出Agentic Physical AI,用于核反应堆控制的领域特定基础模型。 |
foundation model multimodal |
|
|
| 2 |
Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks |
提出跨Agent多模态溯源防御框架,防范Agentic AI中的提示注入攻击。 |
large language model multimodal |
|
|
| 3 |
Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation |
CreativeDC:利用大语言模型中的发散-收敛思维生成多样化创意问题 |
large language model |
|
|
| 4 |
EquaCode: A Multi-Strategy Jailbreak Approach for Large Language Models via Equation Solving and Code Completion |
提出 EquaCode,利用数学方程求解与代码补全实现大语言模型越狱攻击 |
large language model |
|
|
| 5 |
SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search |
SPIRAL:通过具身和反思搜索实现符号LLM规划 |
large language model chain-of-thought |
|
|
| 6 |
AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis |
提出AKG kernel Agent,一个用于跨平台内核合成的多智能体框架。 |
multimodal |
|
|
| 7 |
The Gaining Paths to Investment Success: Information-Driven LLM Graph Reasoning for Venture Capital Prediction |
MIRAGE-VC:信息增益驱动的LLM图推理,用于风险投资预测 |
large language model |
|
|
| 8 |
Securing the AI Supply Chain: What Can We Learn From Developer-Reported Security Issues and Solutions of AI Projects? |
通过分析开发者报告的安全问题与解决方案,提升AI供应链安全性 |
large language model |
|
|
| 9 |
TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI |
提出TCEval框架以评估AI的认知与感知能力 |
large language model |
|
|
| 10 |
From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research |
提出“模型置信度”:一种更高效的LLM数据利用方法,提升LLM模拟研究的统计效率。 |
large language model |
|
|
| 11 |
It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents |
提出TRAP基准测试,评估Web Agent在提示注入攻击下的任务重定向脆弱性 |
large language model |
|
|