cs.LG（2024-09-12）

📊 共 15 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (10 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (3) 支柱五：交互与反应 (Interaction & Reaction) (1) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning	提出Q值正则化Decision ConvFormer，提升离线强化学习轨迹拼接能力	reinforcement learning offline RL offline reinforcement learning
2	Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks	提出基于GNN-DRL的GRANT算法，解决太赫兹空间网络中联合资源分配和任务卸载问题	reinforcement learning deep reinforcement learning DRL
3	Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning	提出基于连续时间强化学习的扩散模型微调框架，提升生成质量	reinforcement learning RLHF
4	GRE^2-MDCL: Graph Representation Embedding Enhanced via Multidimensional Contrastive Learning	提出GRE^2-MDCL模型，通过多维对比学习增强图表示嵌入，提升节点分类性能。	representation learning contrastive learning
5	Theoretical guarantees in KL for Diffusion Flow Matching	提出Diffusion Flow Matching以解决生成模型的KL散度问题	flow matching
6	Multiplex Graph Contrastive Learning with Soft Negatives	提出MUX-GCL，利用多重图表示和软负样本进行跨尺度图对比学习。	contrastive learning	✅
7	DiReDi: Distillation and Reverse Distillation for AIoT Applications	提出DiReDi框架，通过知识蒸馏与反向蒸馏实现AIoT边缘模型自适应更新与用户隐私保护。	distillation
8	DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning	提出DFDG：一种用于单次联邦学习的无数据双生成器对抗蒸馏方法	distillation
9	Multi-Model based Federated Learning Against Model Poisoning Attack: A Deep Learning Based Model Selection for MEC Systems	提出基于多模型的联邦学习框架，增强模型投毒攻击的防御能力，并应用于MEC系统。	reinforcement learning deep reinforcement learning
10	Learning Causally Invariant Reward Functions from Diverse Demonstrations	提出基于因果不变性的逆强化学习正则化方法，提升奖励函数泛化性	reinforcement learning inverse reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
11	Towards a graph-based foundation model for network traffic analysis	提出基于图的流量分析基础模型，提升网络流量分析任务性能。	foundation model
12	Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data	揭示微调LLM生成数据带来的隐私风险：PII泄露与成员推断攻击	large language model
13	Large Language Models are Pattern Matchers: Editing Semi-Structured and Structured Documents with ChatGPT	利用大型语言模型进行模式匹配，实现半结构化和结构化文档编辑	large language model

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption	提出一种高效的同态加密KAN推理方案，解决隐私保护问题。	OMOMO

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Self-Supervised Learning of Iterative Solvers for Constrained Optimization	提出自监督学习的迭代求解器，加速约束优化问题的实时求解。	model predictive control

⬅️ 返回 cs.LG 首页 · 🏠 返回主页