| 1 |
Large Language Models Are Unreliable for Cyber Threat Intelligence |
评估LLM在网络威胁情报中的可靠性,揭示其不一致性和过度自信问题 |
large language model |
|
|
| 2 |
Agentic Large Language Models, a survey |
综述性论文:Agentic LLM,探索具身智能大语言模型的研究进展与未来方向 |
large language model |
|
|
| 3 |
CCCI: Code Completion with Contextual Information for Complex Data Transfer Tasks Using Large Language Models |
CCCI:利用上下文信息增强LLM在复杂数据传输任务中的代码补全能力 |
large language model |
|
|
| 4 |
Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models |
评估大语言模型中密码泄露风险:通过LoRA微调暴露敏感信息,并使用ROME进行修复。 |
large language model |
|
|
| 5 |
Who Owns the Output? Bridging Law and Technology in LLMs Attribution |
针对LLM内容归属难题,提出结合法律与技术的框架以确保责任追溯 |
large language model multimodal |
|
|
| 6 |
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis |
CodeARC:通过交互式程序合成基准测试LLM智能体的推理能力 |
large language model |
✅ |
|
| 7 |
Encrypted Prompt: Securing LLM Applications Against Unauthorized Actions |
提出加密提示方法,保障LLM应用免受未授权操作攻击 |
large language model |
|
|
| 8 |
Simulation of Non-Ordinary Consciousness |
提出Glyph,模拟大型语言模型中类裸盖菇素的非寻常意识状态。 |
large language model |
|
|
| 9 |
Redefining Evaluation Standards: A Unified Framework for Evaluating the Korean Capabilities of Language Models |
提出HRET:一个统一的韩语LLM评估框架,解决评估标准不一致问题。 |
large language model |
|
|