10 47

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

liked a model 2 months ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space 3 months ago

lm-provers/qed-nano-blogpost

View all activity

Organizations

upvoted a collection 7 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 43

upvoted an article 7 months ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

upvoted 2 articles 10 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech

•

Apr 16, 2025

• 80

Article

Efficient Request Queueing – Optimizing LLM Performance

tngtech

•

Apr 2, 2025

• 26

upvoted an article about 1 year ago

Article

The Transformers Library: standardizing model definitions

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 123

upvoted 3 articles over 1 year ago

Article

o3-mini & Deepseek-R1

prithivMLmods

•

Feb 2, 2025

• 24

Article

Fine-tune ModernBERT for RAG with Synthetic Data

sdiazlor

•

Jan 20, 2025

• 42

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

upvoted a collection over 1 year ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 14 items • Updated 18 days ago • 44

upvoted an article about 2 years ago

Article

Deploy LLMs with Hugging Face Inference Endpoints

philschmid

•

Jul 4, 2023

• 17

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

Let's talk about LLM evaluation

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

The Transformers Library: standardizing model definitions

o3-mini & Deepseek-R1

Fine-tune ModernBERT for RAG with Synthetic Data

Train 400x faster Static Embedding Models with Sentence Transformers

Deploy LLMs with Hugging Face Inference Endpoints