| 1 |
Jailbreaking Frontier Foundation Models Through Intention Deception |
提出基于意图欺骗的多轮对话攻击方法,破解前沿大语言模型 |
foundation model multimodal |
|
|
| 2 |
Representational Curvature Modulates Behavioral Uncertainty in Large Language Models |
研究表明表征曲率调节大型语言模型中的行为不确定性 |
large language model |
|
|
| 3 |
Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models |
提出RouteHead,通过学习查询相关的head选择,提升基于LLM的注意力重排序效果。 |
large language model |
|
|
| 4 |
Quantum Kernel Advantage over Classical Collapse in Medical Foundation Model Embeddings |
利用量子核在医学基础模型嵌入上超越经典方法,提升胸部X光片保险分类性能 |
foundation model |
✅ |
|
| 5 |
Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models |
提出层间收敛指纹LCF,用于大语言模型运行时异常行为检测。 |
large language model |
|
|
| 6 |
Strategic Bidding in 6G Spectrum Auctions with Large Language Models |
利用大语言模型在6G频谱拍卖中实现策略性竞标,提升资源利用率 |
large language model |
|
|
| 7 |
An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress |
提出基于信息几何的框架,评估大语言模型在不确定性下的稳定性。 |
large language model |
|
|
| 8 |
A2DEPT: Large Language Model-Driven Automated Algorithm Design via Evolutionary Program Trees |
A2DEPT:基于大语言模型与进化程序树的自动化算法设计 |
large language model |
|
|
| 9 |
XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation |
提出XGRAG框架,通过图扰动解释知识图谱增强的检索增强生成 |
large language model |
|
|
| 10 |
Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis |
提出SpecValidator,用于检测LLM代码生成中任务描述的缺陷。 |
large language model |
|
|
| 11 |
Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study |
利用LLM生成多文件DSL代码:宝马工业案例研究 |
large language model |
|
|
| 12 |
AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents |
AgentWard:面向自主AI代理的全生命周期安全架构 |
large language model |
✅ |
|
| 13 |
STELLAR-E: a Synthetic, Tailored, End-to-end LLM Application Rigorous Evaluator |
STELLAR-E:提出一种全自动的、可定制的LLM应用评测数据集生成框架。 |
large language model |
|
|
| 14 |
Interoceptive machine framework: Toward interoception-inspired regulatory architectures in artificial intelligence |
提出基于内感受的机器框架,旨在提升人工智能的自适应自主性。 |
embodied AI |
|
|
| 15 |
Why AI Harms Can't Be Fixed One Identity at a Time: What 5300 Incident Reports Reveal About Intersectionality |
揭示AI偏见:基于5300份事故报告的交叉性分析框架 |
large language model |
|
|
| 16 |
GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems |
GAMMAF:LLM多智能体系统中基于图的异常监控基准测试通用框架 |
large language model |
|
|
| 17 |
MEMCoder: Multi-dimensional Evolving Memory for Private-Library-Oriented Code Generation |
MEMCoder:面向私有库代码生成的多维演化记忆框架 |
large language model |
|
|
| 18 |
RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation |
RefEvo:利用协同进化验证的Agentic设计,加速敏捷参考模型生成 |
large language model |
|
|
| 19 |
Defusing the Trigger: Plug-and-Play Defense for Backdoored LLMs via Tail-Risk Intrinsic Geometric Smoothing |
提出TIGS:一种即插即用的后门大语言模型防御方法,无需额外数据和模型更新。 |
large language model |
|
|
| 20 |
What Did They Mean? How LLMs Resolve Ambiguous Social Situations across Perspectives and Roles |
研究表明大型语言模型倾向于消除社会情境中的歧义,而非保持不确定性。 |
large language model |
|
|