| 1 |
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation |
CodeIF-Bench:评估大型语言模型在交互式代码生成中的指令遵循能力 |
large language model instruction following |
✅ |
|
| 2 |
Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability |
从理论层面分析多轮LLM推理:可逼近性、可学习性和泛化性 |
large language model chain-of-thought |
|
|
| 3 |
COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence |
提出COSINT-Agent,解决中文开源情报中多模态数据融合与推理难题。 |
large language model multimodal |
|
|
| 4 |
Multi-Agent Systems Powered by Large Language Models: Applications in Swarm Intelligence |
提出基于LLM驱动的多智能体系统,用于模拟和研究群体智能。 |
large language model |
✅ |
|
| 5 |
Leveraging Large Language Models to Develop Heuristics for Emerging Optimization Problems |
提出CEoH框架,利用大语言模型为新兴优化问题自动生成启发式算法。 |
large language model |
|
|
| 6 |
AttackSeqBench: Benchmarking Large Language Models in Analyzing Attack Sequences within Cyber Threat Intelligence |
AttackSeqBench:评估大语言模型在网络威胁情报中分析攻击序列的能力 |
large language model |
|
|
| 7 |
Unified Mind Model: Reimagining Autonomous Agents in the LLM Era |
提出统一心智模型UMM,赋能LLM时代自主智能体快速构建 |
large language model instruction following |
|
|
| 8 |
Human Preferences for Constructive Interactions in Language Model Alignment |
利用人类偏好数据,对语言模型进行建设性对话的对齐研究 |
large language model |
|
|
| 9 |
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems |
提出并行化规划-行动框架,提升LLM多智能体系统在动态环境中的效率 |
large language model |
|
|
| 10 |
LeRAAT: LLM-Enabled Real-Time Aviation Advisory Tool |
LeRAAT:基于LLM的实时航空咨询工具,提升飞行员决策效率。 |
large language model |
|
|
| 11 |
OMNISEC: LLM-Driven Provenance-based Intrusion Detection via Retrieval-Augmented Behavior Prompting |
OMNISEC:基于LLM和溯源图的入侵检测,通过检索增强行为提示提升检测效果 |
large language model |
|
|