Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 8 days ago • 86
InstructSAM: Segment Any Instance with Any Instructions Paper • 2605.26102 • Published 10 days ago • 17
Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking Paper • 2605.22538 • Published 14 days ago • 6
Meta-CoT: Enhancing Granularity and Generalization in Image Editing Paper • 2604.24625 • Published Apr 27 • 26
TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction Paper • 2604.08921 • Published Apr 10 • 2
FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning Paper • 2603.22969 • Published Mar 24 • 10
Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings Paper • 2602.13823 • Published Feb 14 • 9
GARDO: Reinforcing Diffusion Models without Reward Hacking Paper • 2512.24138 • Published Dec 30, 2025 • 30
VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning Paper • 2512.06373 • Published Dec 6, 2025 • 9