8 15 5

Junhyeok Kim

kjunh

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

upvoted a paper 2 months ago

Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation

new activity 3 months ago

kjunh/v1g-sample:Enhance dataset card: Add paper, code, project page, abstract, citation, task categories, and tags

View all activity

Organizations

upvoted a paper about 1 month ago

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Paper • 2511.22173 • Published Nov 27, 2025 • 14

upvoted a paper 2 months ago

Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation

Paper • 2510.19592 • Published Oct 22, 2025 • 12

upvoted an article 5 months ago

Article

Efficient MultiModal Data Pipeline

Jul 8, 2025

•

upvoted 2 papers 6 months ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Paper • 2507.07990 • Published Jul 10, 2025 • 45

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 89

upvoted 4 papers 7 months ago

Language-Image Alignment with Fixed Text Encoders

Paper • 2506.04209 • Published Jun 4, 2025 • 11

Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Paper • 2506.00070 • Published May 29, 2025 • 29

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues

Paper • 2506.00958 • Published Jun 1, 2025 • 20

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24, 2025 • 36

upvoted a paper 8 months ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21, 2025 • 104

upvoted a paper 9 months ago

VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms

Paper • 2503.14427 • Published Mar 18, 2025 • 19

upvoted 2 papers 10 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild

Paper • 2502.14892 • Published Feb 17, 2025 • 6

upvoted a paper 12 months ago

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation

Paper • 2501.09284 • Published Jan 16, 2025 • 10

upvoted a paper about 1 year ago

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Paper • 2410.13232 • Published Oct 17, 2024 • 44

Junhyeok Kim

AI & ML interests

Recent Activity

Organizations

kjunh's activity

Efficient MultiModal Data Pipeline