| 1 |
Specialized Foundation Models Struggle to Beat Supervised Baselines |
专业领域预训练大模型难胜监督学习基线模型 |
foundation model |
|
|
| 2 |
Kolb-Based Experiential Learning for Generalist Agents with Human-Level Kaggle Data Science Performance |
Agent K:基于Kolb学习和Vygotsky ZPD的通用智能体,达到Kaggle数据科学人类水平 |
generalist agent |
|
|
| 3 |
Long Context RAG Performance of Large Language Models |
研究长上下文LLM在RAG中的性能,揭示其优势与局限性 |
large language model |
|
|
| 4 |
Mobility-based Traffic Forecasting in a Multimodal Transport System |
基于人口流动性的多模式交通系统流量预测研究 |
multimodal |
|
|
| 5 |
Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status |
利用大型语言模型预测吸烟状态以控制未观察到的混杂因素 |
large language model |
|
|
| 6 |
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration |
提出CE-CoLLM云边协同框架,提升LLM在边缘环境的推理效率和适应性。 |
large language model |
✅ |
|
| 7 |
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios |
揭示MLLM在误导信息下的响应不确定性,并提出MUB基准与微调策略 |
large language model multimodal |
✅ |
|
| 8 |
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models |
提出GitChameleon以解决代码生成模型版本适应性问题 |
large language model |
✅ |
|
| 9 |
LASER: Attention with Exponential Transformation |
提出LASER注意力机制,通过指数变换提升梯度信号,改善Transformer学习效率。 |
large language model |
|
|
| 10 |
Climate AI for Corporate Decarbonization Metrics Extraction |
提出CAI模型,利用LLM自动提取企业脱碳指标,提升数据收集效率和准确性。 |
large language model |
|
|
| 11 |
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models |
DiffLM:通过扩散语言模型实现可控的合成数据生成 |
large language model |
✅ |
|
| 12 |
Photon: Federated LLM Pre-Training |
Photon:首个端到端联邦LLM预训练系统,实现低带宽下的全局规模模型训练。 |
large language model |
|
|
| 13 |
Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment |
随机增强可有效绕过大语言模型安全对齐,揭示其脆弱性 |
large language model |
✅ |
|