| 1 |
Multimodal Physical Activity Forecasting in Free-Living Clinical Settings: Hunting Opportunities for Just-in-Time Interventions |
MoveSense:利用多模态LSTM预测患者活动行为,为即时干预提供机会 |
multimodal |
|
|
| 2 |
ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models |
ReLU激活函数在无LayerNorm的大语言模型中表现优于GELU,提升困惑度。 |
large language model |
✅ |
|
| 3 |
Mastering AI: Big Data, Deep Learning, and the Evolution of Large Language Models -- AutoML from Basics to State-of-the-Art Techniques |
AutoML综述:从基础到前沿技术,助力AI模型自动化构建 |
large language model |
|
|
| 4 |
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis |
通过梯度流分析,研究Transformer识别词共现的训练动态 |
large language model |
|
|
| 5 |
Towards Scalable Semantic Representation for Recommendation |
提出Mixture-of-Codes方法,提升推荐系统中语义表征的可扩展性和性能。 |
large language model |
|
|
| 6 |
AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approach |
提出AT-MoE:一种基于LoRA的自适应任务规划混合专家模型,提升特定任务性能和可解释性。 |
large language model |
|
|
| 7 |
Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study |
理论分析上下文学习中示例对二分类任务的影响,揭示预训练知识与示例的交互机制 |
large language model |
|
|
| 8 |
Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes |
针对Attention机制反向传播,提出细粒度I/O复杂度分析,优化LLM训练效率。 |
large language model |
|
|