damerajee (dame rajee)

upvoted 2 articles 11 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 257

Article

KV Cache from scratch in nanoVLM

+3

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 119

upvoted 2 papers 11 months ago

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Paper • 2506.03065 • Published Jun 3, 2025 • 27

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 45

upvoted 2 papers 12 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27, 2025 • 38

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

upvoted an article about 1 year ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

+1

aamirshakir, tomaarsen, SeanLee97

•

Mar 22, 2024

• 132

upvoted a paper about 1 year ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 192

upvoted an article about 1 year ago

Article

Reasoning Datasets Competition

bespokelabs

•

Apr 9, 2025

• 38

upvoted a collection about 1 year ago

Web-SSL

Collection

18 items • Updated Mar 20 • 22

upvoted an article about 1 year ago

Article

🪆 Introduction to Matryoshka Embedding Models

+1

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 207

upvoted a paper about 1 year ago

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published Mar 4, 2025 • 8

upvoted a collection about 1 year ago

BD3-LMs

Collection

https://m-arriola.com/bd3lms/ • 4 items • Updated 30 days ago • 31

upvoted an article about 1 year ago

Article

Common AI Model Formats

ngxson

•

Feb 27, 2025

• 70

upvoted 4 papers about 1 year ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 175

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20, 2025 • 16

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 169

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

upvoted 2 articles over 1 year ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 120

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

+2

danaaubakirova, Molbap, mshukor, cadene

•

Feb 4, 2025

• 192

dame rajee

AI & ML interests

Organizations

nanoVLM: The simplest repository to train your VLM in pure PyTorch

KV Cache from scratch in nanoVLM

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Reasoning Datasets Competition

Web-SSL

🪆 Introduction to Matryoshka Embedding Models

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

BD3-LMs

Common AI Model Formats

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Matryoshka Quantization

What is test-time compute and how to scale it?

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

dame rajee

AI & ML interests

Organizations

damerajee's activity

nanoVLM: The simplest repository to train your VLM in pure PyTorch

KV Cache from scratch in nanoVLM

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Reasoning Datasets Competition

🪆 Introduction to Matryoshka Embedding Models

Common AI Model Formats

What is test-time compute and how to scale it?

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control