| 1 |
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems |
提出自我进化AI代理以解决静态配置适应性不足问题 |
large language model foundation model |
|
|
| 2 |
Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy |
提出评估工具以实现大型语言模型在全压外交中的应用 |
large language model |
|
|
| 3 |
Generative AI for Strategic Plan Development |
提出生成式人工智能以优化战略计划开发 |
large language model |
|
|
| 4 |
Benchmarking for Domain-Specific LLMs: A Case Study on Academia and Beyond |
提出Comp-Comp框架以优化领域特定LLM基准评估 |
large language model |
✅ |
|