DONGRYEOLLEE

drlee1

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Improved Large Language Diffusion Models

upvoted a paper 5 days ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

liked a model 7 days ago

LiquidAI/LFM2.5-Embedding-350M

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 5 days ago • 41

upvoted a paper 5 days ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

Paper • 2606.20945 • Published 11 days ago • 75

liked a model 7 days ago

LiquidAI/LFM2.5-Embedding-350M

liked a dataset 10 days ago

lordx64/agentic-distill-fable-5-sft

Viewer • Updated 14 days ago • 4.66k • 1.19k • 48

liked a model 10 days ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated 9 days ago • 59.3k • • 743

upvoted a paper 11 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 13 days ago • 76

upvoted a paper 13 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 17 days ago • 93

upvoted 2 papers 14 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 18 days ago • 92

MiniMax Sparse Attention

Paper • 2606.13392 • Published 18 days ago • 148

liked a model 14 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.06M • • 780

liked a model 17 days ago

jinaai/jina-embeddings-v5-text-small

Feature Extraction • 0.6B • Updated Apr 15 • 368k • 183

upvoted 2 papers 18 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18, 2025 • 19

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 21 days ago • 54

liked a dataset 19 days ago

m-a-p/CodeFeedback-Filtered-Instruction

Viewer • Updated Feb 26, 2024 • 157k • 18.8k • 204

liked a model 20 days ago

ny1031/Qwen3-1.7B-SFT-RLVR-IF

Text Generation • 2B • Updated May 6 • 6 • 1

liked a dataset 20 days ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 17.8k • 251

upvoted 2 papers 24 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published May 29 • 20

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 28 days ago • 235

upvoted a paper 27 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 28 days ago • 59

liked a model 28 days ago

Qwen/Qwen3.5-2B

Image-Text-to-Text • 2B • Updated Mar 2 • 1.69M • • 320

DONGRYEOLLEE

AI & ML interests

Recent Activity

Organizations

drlee1's activity