| 1 |
MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption |
MetaVLA:用于高效具身适应的统一元协同训练框架 |
vision-language-action VLA OpenVLA |
|
|
| 2 |
ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems |
提出ARM:发现通用多智能体系统的Agentic推理模块 |
large language model foundation model chain-of-thought |
|
|
| 3 |
BuilderBench -- A benchmark for generalist agents |
BuilderBench:面向通用智能体,用于开放式探索的基准测试平台 |
generalist agent |
|
|
| 4 |
Domain-Shift-Aware Conformal Prediction for Large Language Models |
提出领域偏移感知共形预测(DS-CP),提升大语言模型在领域偏移下的不确定性量化 |
large language model |
|
|
| 5 |
PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles |
PuzzlePlex:用于评估具身智能体推理与规划能力的多样化谜题基准 |
foundation model |
|
|
| 6 |
Leveraging Large Language Models for Cybersecurity Risk Assessment -- A Case from Forestry Cyber-Physical Systems |
利用本地化大语言模型辅助林业网络物理系统网络安全风险评估 |
large language model |
|
|
| 7 |
StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars |
StarEmbed:天文学变星观测时间序列基础模型基准测试 |
foundation model |
|
|
| 8 |
Digital Transformation Chatbot (DTchatbot): Integrating Large Language Model-based Chatbot in Acquiring Digital Transformation Needs |
提出基于大语言模型的数字化转型需求获取聊天机器人DTchatbot |
large language model |
|
|
| 9 |
Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis |
提出一种基于时间窗口分析的跨语言Meme早期流行度多模态预测方法 |
multimodal |
|
|
| 10 |
Uncovering Representation Bias for Investment Decisions in Open-Source Large Language Models |
揭示开源大语言模型在投资决策中的表征偏差,关注Qwen模型 |
large language model |
|
|
| 11 |
Membership Inference Attacks on Tokenizers of Large Language Models |
提出基于Tokenizer的成员推断攻击,揭示大语言模型隐私风险 |
large language model |
|
|
| 12 |
Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique |
提出一种文本保持的水印框架,用于审计微调大语言模型的数据来源 |
large language model |
|
|
| 13 |
Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography |
GPT-4o提取放射报告标签,用于上肢X光片多标签图像分类模型训练 |
large language model |
|
|
| 14 |
Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences |
揭示LLM竞争中涌现的“莫洛克交易”:追求成功导致AI对齐性下降 |
large language model |
|
|
| 15 |
Domain-Grounded Evaluation of LLMs in International Student Knowledge |
针对留学知识领域,提出领域相关的LLM评估方法,解决幻觉问题。 |
large language model |
|
|
| 16 |
Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model |
提出基于相对位置的代码块划分方法,提升代码语言模型在仓库级代码补全任务中的性能 |
large language model |
|
|
| 17 |
Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting |
通过数据移动预测增强大规模MoE LLM Serving性能 |
large language model |
✅ |
|
| 18 |
Impact of LLMs on Team Collaboration in Software Development |
研究LLM对软件开发团队协作的影响,提升效率与沟通,应对挑战与安全问题。 |
large language model |
|
|
| 19 |
Automated Program Repair of Uncompilable Student Code |
利用大型语言模型自动修复学生未编译代码,提升学生建模效果 |
large language model |
|
|
| 20 |
MixReasoning: Switching Modes to Think |
MixReasoning:提出一种自适应调整推理深度的混合推理框架 |
chain-of-thought |
|
|
| 21 |
Training-Free Time Series Classification via In-Context Reasoning with LLM Agents |
提出FETA:基于LLM Agent上下文推理的免训练时间序列分类框架 |
large language model |
✅ |
|
| 22 |
Optimizing for Persuasion Improves LLM Generalization: Evidence from Quality-Diversity Evolution of Debate Strategies |
DebateQD:基于说服力优化的LLM提升泛化能力,解决过拟合问题 |
large language model |
|
|
| 23 |
VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code |
提出VeriEquivBench基准,用于无ground-truth评估形式化可验证代码的等价性。 |
large language model |
|
|
| 24 |
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming |
提出ConstraintLLM以解决工业级约束编程问题 |
large language model |
✅ |
|
| 25 |
Artificially intelligent agents in the social and behavioral sciences: A history and outlook |
回顾社会与行为科学中智能代理的发展历程与未来展望 |
large language model |
|
|
| 26 |
From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions |
提出自进化Agentic AI框架,解决无线网络中人工干预的优化难题 |
large language model |
|
|