Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shi Liu's picture
3 7 2

Shi Liu

CLLBJ16

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 5 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90
upvoted a paper 6 months ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25, 2025 • 32
upvoted 2 papers 7 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23, 2025 • 33

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6, 2025 • 8
upvoted a paper 10 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306
upvoted a paper about 1 year ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 87
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs