-
-
-
-
-
-
Inference Providers
Active filters:
modelopt
txn545/Qwen3-Coder-30B-A3B-Instruct-FP4
16B
•
Updated
•
9
shanjiaz/gpt-oss-120b-nvfp4-modelopt
59B
•
Updated
•
316
•
1
shanjiaz/gpt-oss-20b-nvfp4-modelopt
11B
•
Updated
•
125
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
•
6B
•
Updated
•
97
•
11
baseten-admin/glm-4.6-fp4
177B
•
Updated
•
72
baseten-admin/glm-4.6-fp8
353B
•
Updated
•
7
baseten-admin/glm-4.6-fp4-mlp
183B
•
Updated
•
383
shinedays1993/Qwen3-30B-A3B-nvfp4
16B
•
Updated
•
3
shinedays1993/Qwen3-32B-nvfp4
17B
•
Updated
•
7
Beambutbetter/Deepseek-V2-Lite-16B-NVFP4
Text Generation
•
8B
•
Updated
•
6
•
3
ramblingpolymath/Qwen3-4B-Instruct-2507
2B
•
Updated
•
5
literid/Qwen3-Coder-480B-A35B-Instruct_nvfp4_kv_fp8
241B
•
Updated
•
8
DevQuasar/DeepSeek-R1-Distill-Llama-8B_nvfp4
Text Generation
•
5B
•
Updated
•
15
DevQuasar/Qwen.Qwen3-4B-Thinking-2507_nvfp4
Text Generation
•
2B
•
Updated
•
4
177B
•
Updated
•
695
•
6
JeiganS/ML2-123B-Magnum-Diamond_fp8
Text Generation
•
123B
•
Updated
•
3
guerilla7/Foundation-Sec-8B-Instruct-NVFP4-quantized
5B
•
Updated
jiangchengchengNLP/L3.3-MS-Nevoria-70b-NVFP4-ONLY-MLP
42B
•
Updated
•
3
jiangchengchengNLP/L3.3-MS-Nevoria-70b-NVFP4_A8
36B
•
Updated
•
3
johnnyeric/Huihui-Qwen3-30B-A3B-Instruct-2507-abliterated-fp4
16B
•
Updated
•
9
jiangchengchengNLP/L3.3-MS-Nevoria-70b-NVFP4_AWQ
36B
•
Updated
•
2
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
349
mdavidson83/Qwen3-Embedding-4B_nvfp4_hf
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
•
17B
•
Updated
•
29
Image-Text-to-Text
•
13B
•
Updated
•
6
gecfdo/Monstral-123B-v2-NVFP4
Text Generation
•
69B
•
Updated
•
3
Text Generation
•
7B
•
Updated
•
161
leatan95/Tongyi-DeepResearch-30B-A3B-NVFP4
16B
•
Updated
•
5
DataSnake/Wayfarer-12B-NVFP4
Text Generation
•
7B
•
Updated
•
13
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
•
7B
•
Updated
•
73