3 32 45

Yuanxin Liu

lyx97

https://llyx97.github.io/

llyx97

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

upvoted a paper 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

View all activity

Organizations

upvoted a paper 4 days ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published 15 days ago • 97

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

upvoted 4 papers 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 82

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

Paper • 2504.17343 • Published Apr 24, 2025 • 13

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Paper • 2505.22613 • Published May 28, 2025 • 9

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 58

authored a paper 2 months ago

Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence

Paper • 2510.20470 • Published Oct 23, 2025 • 11

liked a dataset 2 months ago

marinero4972/Open-o3-Video

Preview • Updated Nov 11, 2025 • 155 • 6

upvoted 2 papers 2 months ago

Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence

Paper • 2510.20470 • Published Oct 23, 2025 • 11

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55

liked 3 models 2 months ago

liked a dataset 3 months ago

lyx97/UVE-Bench

Viewer • Updated Oct 10, 2025 • 1.88k • 75 • 1

authored 4 papers 3 months ago

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

Paper • 2504.17343 • Published Apr 24, 2025 • 13

TEMPLE:Temporal Preference Learning of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment

Paper • 2503.16929 • Published Mar 21, 2025

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Paper • 2505.22613 • Published May 28, 2025 • 9

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29, 2025 • 38

upvoted a paper 3 months ago

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Paper • 2504.13180 • Published Apr 17, 2025 • 19

updated a dataset 3 months ago