| 1 |
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science |
DeepAnalyze:面向自主数据科学的Agentic大语言模型 |
large language model |
|
|
| 2 |
BreakFun: Jailbreaking LLMs via Schema Exploitation |
BreakFun:利用模式漏洞攻击大型语言模型 |
large language model chain-of-thought |
|
|
| 3 |
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models |
SAKE:首个面向大型音频-语言模型听觉属性知识编辑的基准测试。 |
multimodal |
|
|
| 4 |
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations |
揭示语音情感变化下大型语音语言模型(LALM)的安全漏洞 |
multimodal |
|
|
| 5 |
When Many-Shot Prompting Fails: An Empirical Study of LLM Code Translation |
揭示LLM代码翻译中多示例提示的悖论:少量胜过大量 |
large language model |
|
|
| 6 |
Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning |
构建印度法律推理基准,评估LLM在法律领域的适用性 |
large language model |
|
|