4 15 7

Voyage_Wang

VoyageWang

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

upvoted a paper 8 days ago

InstructSAM: Segment Any Instance with Any Instructions

authored a paper 13 days ago

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 8 days ago • 86

upvoted a paper 8 days ago

InstructSAM: Segment Any Instance with Any Instructions

Paper • 2605.26102 • Published 10 days ago • 17

upvoted a paper 13 days ago

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking

Paper • 2605.22538 • Published 14 days ago • 6

upvoted a paper about 1 month ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Paper • 2604.24625 • Published Apr 27 • 26

upvoted a collection about 2 months ago

Gemma 4

Collection

15 items • Updated about 13 hours ago • 889

upvoted a paper about 2 months ago

TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Paper • 2604.08921 • Published Apr 10 • 2

upvoted a paper 2 months ago

FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning

Paper • 2603.22969 • Published Mar 24 • 10

upvoted a paper 3 months ago

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53

upvoted a paper 4 months ago

Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Paper • 2602.13823 • Published Feb 14 • 9

upvoted 2 papers 5 months ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

Paper • 2512.24138 • Published Dec 30, 2025 • 30

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published Dec 23, 2025 • 95

upvoted 3 papers 6 months ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 174

KlingAvatar 2.0 Technical Report

Paper • 2512.13313 • Published Dec 15, 2025 • 44

VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning

Paper • 2512.06373 • Published Dec 6, 2025 • 9

upvoted a paper 8 months ago

Detect Anything via Next Point Prediction

Paper • 2510.12798 • Published Oct 14, 2025 • 53

Voyage_Wang

AI & ML interests

Recent Activity

Organizations

VoyageWang's activity