cs.CV（2024-05-19）

📊 共 9 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (4 🔗2) 支柱三：空间感知与语义 (Perception & Semantics) (2) 支柱一：机器人控制 (Robot Control) (2) 支柱九：具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Transcriptomics-guided Slide Representation Learning in Computational Pathology	Tangle：利用转录组学指导病理切片表征学习，提升计算病理学性能	representation learning contrastive learning multimodal	✅
2	Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation	提出DiME和NICKEL双方法，通过知识蒸馏提升GAN在资源受限环境下的效率。	distillation foundation model
3	SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization	提出SLAB，通过简化线性注意力与渐进重参数化BatchNorm，提升Transformer效率。	linear attention	✅
4	Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation	提出跨域知识蒸馏框架，提升低分辨率人体姿态估计性能	distillation

🔬 支柱三：空间感知与语义 (Perception & Semantics) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
5	CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs	提出基于球面全连接CRF的CRF360D，用于单目360度深度估计。	depth estimation
6	Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement	提出基于Prompt学习和CLIP语义引导的无监督图像先验，用于低光照图像增强	open-vocabulary open vocabulary

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
7	Diffusion-Based Hierarchical Image Steganography	提出基于扩散模型的层级图像隐写术，提升多图嵌入的安全性与容量。	manipulation
8	Physics-aware Hand-object Interaction Denoising	提出物理感知的手-物交互去噪方法，提升重建序列的真实性和精确性	manipulation

🔬 支柱九：具身大模型 (Embodied Foundation Models) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
9	"Previously on ..." From Recaps to Story Summarization	提出TaleSumm模型，利用剧情回顾视频实现多模态故事摘要	multimodal

⬅️ 返回 cs.CV 首页 · 🏠 返回主页