Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

295

Base only

Active filters: VLM

NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated May 30 • 12.4k • 552

nvidia/NVIDIA-Nemotron-Parse-v1.2

Image-Text-to-Text • 0.9B • Updated May 5 • 248k • 53

nvidia/Eagle2.5-8B

Image-Text-to-Text • 8B • Updated Nov 29, 2025 • 131k • 46

numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated 29 days ago • 31.8k • 477

hongyuw/bitvla-siglipL-224px-bf16

Image-Text-to-Text • Updated Jun 30, 2025 • 6 • 5

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 173k • 86

omlab/VLM-FO1-3B-v01

Object Detection • 4B • Updated 16 days ago • 163 • 17

nvidia/NVIDIA-Nemotron-Parse-v1.1

Image-Text-to-Text • 1.0B • Updated May 7 • 609k • 171

Efficient-Large-Model/VILA-13b

Text Generation • 13B • Updated Mar 4, 2024 • 8 • 20

Efficient-Large-Model/VILA-7b

Text Generation • 7B • Updated Mar 4, 2024 • 34 • 27

Efficient-Large-Model/VILA-7b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 18 • 2

Efficient-Large-Model/VILA-13b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 2 • 2

Efficient-Large-Model/VILA-2.7b

Text Generation • 3B • Updated Mar 4, 2024 • 42 • 15

TIGER-Lab/Mantis-bakllava-7b

Image-Text-to-Text • 8B • Updated May 18, 2024 • 8 • 5

TIGER-Lab/Mantis-llava-7b

Image-Text-to-Text • 7B • Updated May 18, 2024 • 11 • 16

Efficient-Large-Model/VILA1.5-3b

Text Generation • Updated Jul 18, 2024 • 21.4k • 35

Efficient-Large-Model/VILA1.5-13b

Text Generation • Updated Jul 18, 2024 • 143 • 5

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 446 • 37

Efficient-Large-Model/VILA1.5-40b

Text Generation • Updated Jul 18, 2024 • 6 • 17

Efficient-Large-Model/VILA1.5-3b-s2

Text Generation • Updated Jul 18, 2024 • 2 • 2

Efficient-Large-Model/VILA1.5-3b-AWQ

Text Generation • Updated Jul 18, 2024 • 16 • 7

Efficient-Large-Model/VILA1.5-3b-s2-AWQ

Text Generation • Updated Jul 18, 2024 • 2 • 2

Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ

Text Generation • Updated Jul 18, 2024 • 11 • 7

Efficient-Large-Model/VILA1.5-13b-AWQ

Text Generation • Updated Jul 18, 2024 • 5 • 3

Efficient-Large-Model/VILA1.5-40b-AWQ

Text Generation • Updated Jul 18, 2024 • 4 • 3

RussRobin/SpatialBot-3B-LoRA

Visual Question Answering • Updated Sep 5, 2024 • 4

RussRobin/SpatialBot-3B

Visual Question Answering • 3B • Updated Sep 10, 2024 • 96 • 20

aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning

Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 664 • 11

Ligeng-Zhu/VILA15_3b

Text Generation • Updated Aug 7, 2024 • 4

NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 231 • 28