cs.CV(2024-09-07)
📊 共 6 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)
支柱三:空间感知与语义 (Perception & Semantics) (1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Fine-Grained Representation Learning via Multi-Level Contrastive Learning without Class Priors | 提出Contrastive Disentangling框架,无需类别先验实现细粒度表征学习 | representation learning contrastive learning | ✅ | |
| 2 | C2F-CHART: A Curriculum Learning Approach to Chart Classification | 提出C2F-CHART,利用粗到精课程学习优化图表分类。 | curriculum learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Training-free Zero-shot Composed Image Retrieval via Weighted Modality Fusion and Similarity | 提出WeiMoCIR,一种无需训练的零样本组合图像检索方法 | large language model multimodal | ✅ | |
| 4 | POINTS: Improving Your Vision-language Model with Affordable Strategies | POINTS:通过经济高效的策略改进视觉-语言模型 | large language model |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | 提出Fisheye-GS,轻量级可扩展的鱼眼相机高斯溅射模块 | 3D gaussian splatting 3DGS gaussian splatting |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Enhancing Image Authenticity Detection: Swin Transformers and Color Frame Analysis for CGI vs. Real Images | 提出基于Swin Transformer和色彩空间分析的CGI图像鉴真方法 | manipulation |