Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

ruv
/

ruvltra-small

Text Generation

adaptive-learning

kv-cache-compression

flash-attention

speculative-decoding

vector-database

Model card Files Files and versions

400 MB

Ctrl+K

Ctrl+K

1 contributor

History: 15 commits

ruv's picture

Add L4 GPU benchmark results (75.4 tok/s)

6a993ec verified 8 days ago

.gitattributes

1.58 kB
Upload RuvLTRA 0.5B Q4_K_M model 2 months ago
README.md

4.31 kB
Add L4 GPU benchmark results (75.4 tok/s) 8 days ago
benchmark_results.json

251 Bytes
Calibration: benchmark_results.json 8 days ago
default.turboquant.json

933 Bytes
Calibration: default.turboquant.json 8 days ago
ruvltra-0.5b-q4_k_m.gguf

398 MB
xet

Upload RuvLTRA 0.5B Q4_K_M model 2 months ago
tokenizer.json

1.84 MB
Upload tokenizer 2 months ago