cs.LG(2024-07-16)
📊 共 5 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Cross-Modal Augmentation for Few-Shot Multimodal Fake News Detection | 提出跨模态增强方法CMA,解决少样本多模态假新闻检测问题 | multimodal | ✅ | |
| 2 | Performance Evaluation of Lightweight Open-source Large Language Models in Pediatric Consultations: A Comparative Analysis | 轻量级开源大语言模型在儿科咨询中的性能评估与比较分析 | large language model | ||
| 3 | Private prediction for large-scale synthetic text generation | 提出基于私有预测的大规模合成文本生成方法,提升数据质量与隐私保护水平 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Satisficing Exploration for Deep Reinforcement Learning | 提出基于不确定性价值函数的深度强化学习算法,实现高效的满意解探索。 | reinforcement learning deep reinforcement learning | ||
| 5 | Bellman Diffusion Models | 提出基于扩散模型的贝尔曼更新方法,用于离线强化学习策略建模。 | reinforcement learning offline reinforcement learning imitation learning |