cs.LG(2024-08-17)

📊 共 8 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (6 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)

#题目一句话要点标签🔗
1 Training Verifiably Robust Agents Using Set-Based Reinforcement Learning 提出基于集合的强化学习方法,训练可验证鲁棒性的智能体 reinforcement learning
2 QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning QEDCartographer:利用无奖励强化学习自动化形式化验证 reinforcement learning
3 Linear Attention is Enough in Spatial-Temporal Forecasting 提出STformer与NSTformer以解决交通预测中的动态拓扑问题 linear attention
4 Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning 提出基于马尔可夫平衡的离线模仿学习方法,提升在严格批量环境下的性能 imitation learning
5 Dynamic Graph Representation Learning for Passenger Behavior Prediction 提出DyGPP,利用动态图学习预测乘客行为,助力智慧城市公共交通规划。 representation learning
6 Fairness-Aware Streaming Feature Selection with Causal Graphs 提出SFCF算法,利用因果图解决流式特征选择中的公平性与准确性权衡问题。 predictive model egocentric

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
7 FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models 提出FedKIM,通过联邦学习将知识注入医学基础模型,解决医疗数据隐私和多样性限制。 foundation model multimodal
8 Selective Prompt Anchoring for Code Generation 提出选择性Prompt锚定(SPA)方法,解决代码生成中LLM对用户意图关注不足的问题。 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页