| 1 |
Large Language Model's Multi-Capability Alignment in Biomedical Domain |
提出BalancedBio框架以解决生物医学领域多能力整合问题 |
large language model instruction following |
|
|
| 2 |
Method-Based Reasoning for Large Language Models: Extraction, Reuse, and Continuous Improvement |
提出基于方法推理的模型以提升大型语言模型的逻辑一致性 |
large language model |
|
|
| 3 |
Adversarial Attacks and Defenses on Graph-aware Large Language Models (LLMs) |
提出针对图感知大语言模型的对抗攻击与防御方法 |
large language model |
|
|
| 4 |
Compressing Large Language Models with PCA Without Performance Loss |
通过PCA压缩大语言模型而不损失性能 |
large language model |
|
|
| 5 |
The Emotional Baby Is Truly Deadly: Does your Multimodal Large Reasoning Model Have Emotional Flattery towards Humans? |
提出EmoAgent以解决多模态大规模推理模型的情感操控问题 |
multimodal |
|
|
| 6 |
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use |
综述多模态大语言模型驱动的操作系统代理以提升计算设备的智能化 |
large language model foundation model |
|
|
| 7 |
ConfProBench: A Confidence Evaluation Benchmark for MLLM-Based Process Judges |
提出ConfProBench以评估MLLM过程判断者的置信度 |
large language model multimodal |
|
|
| 8 |
KG-Augmented Executable CoT for Mathematical Coding |
提出KG-Augmented Executable CoT以解决复杂数学推理问题 |
large language model chain-of-thought |
|
|
| 9 |
Fine-Tuning Small Language Models (SLMs) for Autonomous Web-based Geographical Information Systems (AWebGIS) |
提出基于小型语言模型的自主网络地理信息系统解决方案 |
large language model |
|
|
| 10 |
Automated File-Level Logging Generation for Machine Learning Applications using LLMs: A Case Study using GPT-4o Mini |
利用GPT-4o Mini生成机器学习应用的文件级日志 |
large language model |
|
|
| 11 |
Empirical Evaluation of AI-Assisted Software Package Selection: A Knowledge Graph Approach |
提出基于知识图谱的AI辅助软件包选择框架以解决选择困难问题 |
large language model |
|
|
| 12 |
OmniPlay: Benchmarking Omni-Modal Models on Omni-Modal Game Playing |
提出OmniPlay基准以评估多模态模型在动态游戏中的表现 |
foundation model |
✅ |
|
| 13 |
Deliberative Reasoning Network: An Uncertainty-Driven Paradigm for Belief-Tracked Inference with Pretrained Language Models |
提出DRN以解决大语言模型逻辑推理中的认知陷阱问题 |
large language model |
|
|
| 14 |
Generic-to-Specific Reasoning and Learning for Scalable Ad Hoc Teamwork |
提出基于知识与数据驱动的推理学习方法以解决可扩展的临时团队协作问题 |
foundation model |
|
|
| 15 |
Experimental Analysis of Productive Interaction Strategy with ChatGPT: User Study on Function and Project-level Code Generation Tasks |
提出有效的交互策略以提升ChatGPT在代码生成中的生产力 |
large language model |
|
|
| 16 |
GeoSR: Cognitive-Agentic Framework for Probing Geospatial Knowledge Boundaries via Iterative Self-Refinement |
提出GeoSR框架以解决地理空间知识推理问题 |
large language model |
✅ |
|
| 17 |
StepWrite: Adaptive Planning for Speech-Driven Text Generation |
提出StepWrite以解决语音驱动文本生成中的上下文跟踪问题 |
large language model |
|
|