cs.CL(2025-04-15)
📊 共 25 篇论文 | 🔗 7 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (17 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | A Dual-Space Framework for General Knowledge Distillation of Large Language Models | 提出双空间知识蒸馏框架DSKD,解决大语言模型通用知识蒸馏问题。 | distillation large language model instruction following | ||
| 19 | DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis | DeepMLF:一种基于可学习Token的多模态语言模型,用于情感分析中的深度融合 | representation learning multimodal | ||
| 20 | Dynamic Compressing Prompts for Efficient Inference of Large Language Models | 提出动态压缩提示(LLM-DCP)方法,高效推理大型语言模型,显著降低计算成本。 | curriculum learning large language model | ✅ | |
| 21 | Minitron-SSM: Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning | Minitron-SSM:通过分组感知SSM剪枝实现高效混合语言模型压缩 | SSM state space model distillation | ||
| 22 | Efficient Reasoning Models: A Survey | 综述高效推理模型,加速Chain-of-Thoughts范式在复杂逻辑任务中的应用。 | reinforcement learning distillation chain-of-thought | ✅ | |
| 23 | ReTool: Reinforcement Learning for Strategic Tool Use in LLMs | ReTool:强化学习驱动LLM战略性工具使用,提升复杂数学推理能力 | reinforcement learning | ||
| 24 | OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution | 提出OpenTuringBench,用于评估和训练机器生成文本检测与溯源模型。 | contrastive learning large language model | ✅ | |
| 25 | ReZero: Enhancing LLM search ability by trying one-more-time | ReZero:通过奖励重试机制提升LLM的检索能力 | reinforcement learning large language model |