Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ruv
/
ruvltra-small

Text Generation
GGUF
MambaSSM
English
ruvltra
sona
adaptive-learning
quantized
edge-device
embedded
iot
turboquant
kv-cache-compression
flash-attention
speculative-decoding
graph-rag
hybrid-search
vector-database
ruvector
diskann
colbert
imatrix
conversational
Model card Files Files and versions
xet
Community
ruvltra-small
400 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 15 commits
ruv's picture
ruv
Add L4 GPU benchmark results (75.4 tok/s)
6a993ec verified 8 days ago
  • .gitattributes
    1.58 kB
    Upload RuvLTRA 0.5B Q4_K_M model 2 months ago
  • README.md
    4.31 kB
    Add L4 GPU benchmark results (75.4 tok/s) 8 days ago
  • benchmark_results.json
    251 Bytes
    Calibration: benchmark_results.json 8 days ago
  • default.turboquant.json
    933 Bytes
    Calibration: default.turboquant.json 8 days ago
  • ruvltra-0.5b-q4_k_m.gguf
    398 MB
    xet
    Upload RuvLTRA 0.5B Q4_K_M model 2 months ago
  • tokenizer.json
    1.84 MB
    Upload tokenizer 2 months ago