COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning Paper • 2504.21850 • Published Apr 30, 2025 • 27
Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published Jan 2, 2025 • 20