1 4 7

Taddeus Buica PRO

taddeusb90

AI & ML interests

AI experimentation

Recent Activity

updated a model 2 months ago

apolocloud/DeepSeek-R1-Distill-Llama-70B-improved

published a model 2 months ago

apolocloud/DeepSeek-R1-Distill-Llama-70B-improved

liked a model 4 months ago

sapientinc/HRM-checkpoint-ARC-2

View all activity

Organizations

updated a model 2 months ago

apolocloud/DeepSeek-R1-Distill-Llama-70B-improved

1.32M • Updated Oct 28, 2025 • 5

published a model 2 months ago

apolocloud/DeepSeek-R1-Distill-Llama-70B-improved

1.32M • Updated Oct 28, 2025 • 5

liked a model 4 months ago

sapientinc/HRM-checkpoint-ARC-2

Updated Jul 21, 2025 • 64

updated a model 4 months ago

taddeusb90/DeepSeek-R1-Distill-Llama-70B-improved

1.32M • Updated Sep 9, 2025 • 6

published a model 4 months ago

taddeusb90/DeepSeek-R1-Distill-Llama-70B-improved

1.32M • Updated Sep 9, 2025 • 6

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26, 2025 • 4.95k • 1k

upvoted a collection 5 months ago

DeepSeek-V3.1

Collection

4 items • Updated Nov 27, 2025 • 257

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

published 2 models 5 months ago

taddeusb90/large-training-job-phase5-full-cluster

Updated Aug 12, 2025

taddeusb90/DeepSeek-R1-Distill-Qwen-14B-improved

841k • Updated Aug 7, 2025 • 4

updated a model 5 months ago

taddeusb90/DeepSeek-R1-Distill-Qwen-14B-improved

841k • Updated Aug 7, 2025 • 4

published a model 10 months ago

apolo-mind/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3-test

Updated Mar 24, 2025

updated a model 10 months ago

apolo-mind/engineer-heavy-500k-barc-llama3.1-8b-ins-fft-transduction_lr1e-5_epoch3

Text Generation • 8B • Updated Mar 18, 2025 • 17

published a model 10 months ago

apolo-mind/engineer-heavy-500k-barc-llama3.1-8b-ins-fft-transduction_lr1e-5_epoch3

Text Generation • 8B • Updated Mar 18, 2025 • 17

updated a model 10 months ago

apolo-mind/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3

Text Generation • 8B • Updated Mar 15, 2025 • 6

published a model 10 months ago

apolo-mind/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3

Text Generation • 8B • Updated Mar 15, 2025 • 6

updated a model 10 months ago

taddeusb90/Llama-3.2-3B-reasoner

Text Generation • 3B • Updated Feb 24, 2025 • 6

published a model 10 months ago

taddeusb90/Llama-3.2-3B-reasoner

Text Generation • 3B • Updated Feb 24, 2025 • 6

liked a model 12 months ago

mitkox/OwnYourAI

Updated Jun 22, 2024 • 66

upvoted a paper about 1 year ago

Large Language Models Can Self-Improve At Web Agent Tasks

Paper • 2405.20309 • Published May 30, 2024 • 2

Taddeus Buica PRO

AI & ML interests

Recent Activity

Organizations

taddeusb90's activity

The Ultra-Scale Playbook