cs.LG(2024-10-22)
📊 共 3 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Curriculum Reinforcement Learning for Complex Reward Functions | 提出基于课程学习的强化学习方法,解决复杂奖励函数下的控制问题。 | reinforcement learning | ||
| 2 | Scalable spectral representations for multi-agent reinforcement learning in network MDPs | 提出基于谱表示的可扩展算法,解决网络MDP中多智能体强化学习问题 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | UnStar: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs | UnStar:利用自学习反样本推理实现LLM的知识遗忘 | large language model |