cs.LG(2024-09-12)

📊 共 15 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (10 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (3) 支柱五:交互与反应 (Interaction & Reaction) (1) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (10 篇)

#题目一句话要点标签🔗
1 Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning 提出Q值正则化Decision ConvFormer,提升离线强化学习轨迹拼接能力 reinforcement learning offline RL offline reinforcement learning
2 Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks 提出基于GNN-DRL的GRANT算法,解决太赫兹空间网络中联合资源分配和任务卸载问题 reinforcement learning deep reinforcement learning DRL
3 Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning 提出基于连续时间强化学习的扩散模型微调框架,提升生成质量 reinforcement learning RLHF
4 GRE^2-MDCL: Graph Representation Embedding Enhanced via Multidimensional Contrastive Learning 提出GRE^2-MDCL模型,通过多维对比学习增强图表示嵌入,提升节点分类性能。 representation learning contrastive learning
5 Theoretical guarantees in KL for Diffusion Flow Matching 提出Diffusion Flow Matching以解决生成模型的KL散度问题 flow matching
6 Multiplex Graph Contrastive Learning with Soft Negatives 提出MUX-GCL,利用多重图表示和软负样本进行跨尺度图对比学习。 contrastive learning
7 DiReDi: Distillation and Reverse Distillation for AIoT Applications 提出DiReDi框架,通过知识蒸馏与反向蒸馏实现AIoT边缘模型自适应更新与用户隐私保护。 distillation
8 DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning 提出DFDG:一种用于单次联邦学习的无数据双生成器对抗蒸馏方法 distillation
9 Multi-Model based Federated Learning Against Model Poisoning Attack: A Deep Learning Based Model Selection for MEC Systems 提出基于多模型的联邦学习框架,增强模型投毒攻击的防御能力,并应用于MEC系统。 reinforcement learning deep reinforcement learning
10 Learning Causally Invariant Reward Functions from Diverse Demonstrations 提出基于因果不变性的逆强化学习正则化方法,提升奖励函数泛化性 reinforcement learning inverse reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
11 Towards a graph-based foundation model for network traffic analysis 提出基于图的流量分析基础模型,提升网络流量分析任务性能。 foundation model
12 Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data 揭示微调LLM生成数据带来的隐私风险:PII泄露与成员推断攻击 large language model
13 Large Language Models are Pattern Matchers: Editing Semi-Structured and Structured Documents with ChatGPT 利用大型语言模型进行模式匹配,实现半结构化和结构化文档编辑 large language model

🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)

#题目一句话要点标签🔗
14 Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption 提出一种高效的同态加密KAN推理方案,解决隐私保护问题。 OMOMO

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
15 Self-Supervised Learning of Iterative Solvers for Constrained Optimization 提出自监督学习的迭代求解器,加速约束优化问题的实时求解。 model predictive control

⬅️ 返回 cs.LG 首页 · 🏠 返回主页