cs.LG(2024-07-01)
📊 共 2 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | FoldGPT: Simple and Effective Large Language Model Compression Scheme | FoldGPT:一种简单高效的大语言模型压缩方案,通过块移除和参数共享实现模型轻量化。 | distillation large language model |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 2 | Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs | 提出EEP高效专家剪枝策略,提升稀疏MoE语言模型性能并降低推理成本。 | large language model | ✅ |