Mohsen Gholami's picture

1 9 1

Mohsen Gholami

mgholami

·

AI & ML interests

None yet

Recent Activity

authored a paper 17 days ago

From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

upvoted a paper 18 days ago

From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

new activity 2 months ago

vbdai/Ego3D-Bench:Mismatch between the expected and actual view names for the Argoverse subset

View all activity

Organizations

authored a paper 17 days ago

From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

Paper • 2512.05277 • Published 22 days ago • 4

authored 2 papers 3 months ago

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Paper • 2503.05936 • Published Mar 7 • 2

GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

Paper • 2403.19754 • Published Mar 28, 2024

authored 2 papers 4 months ago

ETran: Energy-Based Transferability Estimation

Paper • 2308.02027 • Published Aug 3, 2023 • 3

Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes

Paper • 2509.06266 • Published Sep 8 • 11