Paper Backlog
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space (CVPR 2026)
Small Object, Great Challenge: A Benchmark for Small Object Visual Grounding (CVPR 2026)
Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models (CVPR 2026)
THE MORE, THE MERRIER: CONTRASTIVE FUSION FOR HIGHER-ORDER MULTIMODAL ALIGNMENT (CVPR 2026)