Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mohsen Gholami's picture
1 9 1

Mohsen Gholami

mgholami
·

AI & ML interests

None yet

Recent Activity

authored a paper 17 days ago
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model
upvoted a paper 18 days ago
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model
new activity 2 months ago
vbdai/Ego3D-Bench:Mismatch between the expected and actual view names for the Argoverse subset
View all activity

Organizations

Huawei's Vancouver VBDAI Lab's profile picture

authored a paper 17 days ago

From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

Paper • 2512.05277 • Published 22 days ago • 4
authored 2 papers 3 months ago

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Paper • 2503.05936 • Published Mar 7 • 2

GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

Paper • 2403.19754 • Published Mar 28, 2024
authored 2 papers 4 months ago

ETran: Energy-Based Transferability Estimation

Paper • 2308.02027 • Published Aug 3, 2023 • 3

Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes

Paper • 2509.06266 • Published Sep 8 • 11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs