cs.LG(2024-05-05)
📊 共 8 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱九:具身大模型 (Embodied Foundation Models) (3 🔗1)
支柱五:交互与反应 (Interaction & Reaction) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation | RICE:利用解释性方法突破强化学习训练瓶颈 | reinforcement learning deep reinforcement learning DRL | ||
| 2 | Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning | 提出MOAC算法,解决多目标强化学习中的有限时间收敛和样本复杂度问题 | reinforcement learning | ||
| 3 | Active Preference Learning for Ordering Items In- and Out-of-sample | 提出一种主动偏好学习方法,用于上下文感知的物品排序,提升样本效率和泛化能力。 | preference learning | ||
| 4 | Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints | 提出基于学习的非马尔可夫安全约束强化学习方法,解决状态表示不足带来的安全问题。 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | Exploring the Improvement of Evolutionary Computation via Large Language Models | 探索利用大型语言模型改进进化计算方法 | large language model | ||
| 6 | Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy | 针对代码大语言模型的木马攻击综述:基于触发器的分类框架 | large language model | ||
| 7 | Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | 提出FourierFT,利用离散傅里叶变换压缩微调参数,提升大模型参数效率。 | foundation model | ✅ |
🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Confidential and Protected Disease Classifier using Fully Homomorphic Encryption | 提出基于全同态加密的疾病分类器,保护用户隐私并实现安全诊断。 | OMOMO large language model |