cs.LG(2024-05-05)

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (4) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗1) 支柱五:交互与反应 (Interaction & Reaction) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)

#题目一句话要点标签🔗
1 RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation RICE:利用解释性方法突破强化学习训练瓶颈 reinforcement learning deep reinforcement learning DRL
2 Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning 提出MOAC算法,解决多目标强化学习中的有限时间收敛和样本复杂度问题 reinforcement learning
3 Active Preference Learning for Ordering Items In- and Out-of-sample 提出一种主动偏好学习方法,用于上下文感知的物品排序,提升样本效率和泛化能力。 preference learning
4 Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints 提出基于学习的非马尔可夫安全约束强化学习方法,解决状态表示不足带来的安全问题。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
5 Exploring the Improvement of Evolutionary Computation via Large Language Models 探索利用大型语言模型改进进化计算方法 large language model
6 Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy 针对代码大语言模型的木马攻击综述:基于触发器的分类框架 large language model
7 Parameter-Efficient Fine-Tuning with Discrete Fourier Transform 提出FourierFT,利用离散傅里叶变换压缩微调参数,提升大模型参数效率。 foundation model

🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)

#题目一句话要点标签🔗
8 Confidential and Protected Disease Classifier using Fully Homomorphic Encryption 提出基于全同态加密的疾病分类器,保护用户隐私并实现安全诊断。 OMOMO large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页