| 10 |
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video |
提出一种基于码率控制扩散模型的视频解耦框架,用于分离视频中的运动和内容。 |
representation learning motion generation |
|
|
| 11 |
PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability |
PromptGuard:针对弱势群体,通过编排式Prompting框架提升LLM生成文本的安全性、公平性和可控性 |
contrastive learning large language model chain-of-thought |
|
|
| 12 |
First-order State Space Model for Lightweight Image Super-resolution |
提出一阶状态空间模型(FSSM),提升轻量级图像超分辨率性能 |
Mamba SSM state space model |
|
|
| 13 |
SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training |
SimCroP:基于相似性驱动的跨粒度预训练提升胸部CT影像表征学习 |
representation learning multimodal |
✅ |
|
| 14 |
World Modeling with Probabilistic Structure Integration |
提出概率结构集成(PSI),用于学习可控且灵活提示的世界模型。 |
world model optical flow |
|
|
| 15 |
RewardDance: Reward Scaling in Visual Generation |
RewardDance:通过生成式奖励建模解决视觉生成中的奖励缩放和奖励利用问题 |
reinforcement learning RLHF chain-of-thought |
|
|
| 16 |
Hyperspectral Mamba for Hyperspectral Object Tracking |
提出基于Mamba的HyMamba网络,用于高光谱目标跟踪,提升复杂场景下的跟踪精度。 |
Mamba SSM |
✅ |
|
| 17 |
Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening |
提出基于潜在空间矫正的时间感知视频表征学习方法,用于手性动作识别。 |
representation learning |
|
|