| 1 |
DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data |
DynaQuery:一个自适应框架,用于查询结构化和多模态数据 |
large language model multimodal |
|
|
| 2 |
Annotating the Chain-of-Thought: A Behavior-Labeled Dataset for AI Safety |
提出行为标注的思维链数据集,用于AI安全中的激活监控。 |
chain-of-thought |
|
|
| 3 |
Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models |
提出上下文注意力调制(CAM)机制,高效解决大语言模型中的多任务适应问题。 |
large language model |
✅ |
|
| 4 |
From Charts to Code: A Hierarchical Benchmark for Multimodal Models |
提出Chart2Code分层基准,评估多模态模型在图表理解与代码生成能力。 |
multimodal |
|
|
| 5 |
Comprehending Spatio-temporal Data via Cinematic Storytelling using Large Language Models |
提出MapMuse框架,利用大语言模型和电影叙事技术理解时空数据 |
large language model |
|
|
| 6 |
Planned Diffusion |
提出Planned Diffusion,结合自回归与扩散模型优势,加速高质量文本生成。 |
large language model instruction following |
|
|
| 7 |
RubiSCoT: A Framework for AI-Supported Academic Assessment |
RubiSCoT:一个AI支持的学术评估框架,提升论文评审效率与一致性 |
large language model chain-of-thought |
|
|
| 8 |
LLM-as-a-Prophet: Understanding Predictive Intelligence with Prophet Arena |
构建Prophet Arena基准,探索LLM作为预言机在预测智能方面的潜力 |
large language model |
|
|
| 9 |
LLM-Based Multi-Agent System for Simulating and Analyzing Marketing and Consumer Behavior |
提出基于LLM的多智能体系统,用于模拟和分析营销与消费者行为 |
large language model |
|
|
| 10 |
SMaRT: Select, Mix, and ReinvenT -- A Strategy Fusion Framework for LLM-Driven Reasoning and Planning |
SMaRT:融合多种策略,提升LLM在推理与规划任务中的性能 |
large language model |
|
|
| 11 |
CourtGuard: A Local, Multiagent Prompt Injection Classifier |
提出CourtGuard:一种本地化、多智能体提示注入分类器,降低误报率。 |
large language model |
✅ |
|
| 12 |
Evaluating LLMs for Career Guidance: Comparative Analysis of Computing Competency Recommendations Across Ten African Countries |
评估LLM在非洲职业指导中的应用:计算能力推荐的跨国比较分析 |
large language model |
|
|
| 13 |
CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows |
CompactPrompt:面向LLM工作流的统一Prompt数据压缩方案 |
large language model |
|
|
| 14 |
Subject-Event Ontology Without Global Time: Foundations and Execution Semantics |
提出一种无全局时间的Subject-Event本体建模方法,适用于复杂动态系统。 |
TAMP |
|
|
| 15 |
FABRIC: Framework for Agent-Based Realistic Intelligence Creation |
FABRIC:提出一个基于LLM的框架,用于生成Agent交互数据,促进Agent智能体的开发。 |
large language model |
|
|
| 16 |
AI for Distributed Systems Design: Scalable Cloud Optimization Through Repeated LLMs Sampling And Simulators |
利用LLM采样与模拟器,实现分布式系统设计的可扩展云优化 |
large language model |
|
|
| 17 |
DynaKV: Enabling Accurate and Efficient Long-Sequence LLM Decoding on Smartphones |
DynaKV:在智能手机上实现准确高效的长序列LLM解码 |
large language model |
|
|
| 18 |
SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion |
SpecAgent:一种用于代码补全的推测性检索和预测Agent,提升代码生成质量并降低延迟。 |
large language model |
|
|
| 19 |
Network and Systems Performance Characterization of MCP-Enabled LLM Agents |
针对MCP赋能的LLM Agent,分析其网络与系统性能瓶颈并提出优化建议 |
large language model |
|
|