cs.CL(2025-01-08)
📊 共 15 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱七:动作重定向 (Motion Retargeting) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis | OpenOmni:通过渐进式多模态对齐和实时情感语音合成,推进开源全模态大语言模型。 | direct preference optimization large language model multimodal | ||
| 12 | Graph-Based Multimodal Contrastive Learning for Chart Question Answering | 提出基于图的多模态对比学习框架,解决图表问答中异构信息融合难题。 | contrastive learning large language model multimodal | ||
| 13 | Unlocking Multimodal Mathematical Reasoning via Process Reward Model | 提出URSA框架,通过过程奖励模型解锁多模态数学推理能力 | reinforcement learning large language model multimodal | ||
| 14 | Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning | 提出一种量子启发的嵌入投影和相似性度量方法,用于表征学习。 | representation learning contrastive learning large language model |
🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | S2 Chunking: A Hybrid Framework for Document Segmentation Through Integrated Spatial and Semantic Analysis | 提出S2 Chunking混合框架,融合空间与语义信息提升文档分割效果 | spatial relationship |