cs.LG(2024-07-22)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (3) 支柱二:RL算法与架构 (RL & Architecture) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
1 Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models 通过选择性丢弃层,加速大语言模型Llama-v2的推理过程 large language model
2 Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs 提出目标导向的隐空间对抗训练,提升LLM对有害行为的鲁棒性 large language model
3 Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models 提出基于率失真理论的提示压缩框架,用于优化黑盒语言模型的提示。 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
4 Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications 综述强化学习中奖励工程与塑造技术,提升算法效率与效果 reinforcement learning reward shaping

⬅️ 返回 cs.LG 首页 · 🏠 返回主页