kuleshov-group/caduceus-ph_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 23 • 1
kuleshov-group/caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 51 • 1
kuleshov-group/caduceus-ph_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.35k • 6
kuleshov-group/caduceus-ps_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 35 • 1
kuleshov-group/caduceus-ps_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 29 • 2
kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.62k • 14
kuleshov-group/bd3lm-owt-block_size1024-pretrain Text Generation • 0.2B • Updated Mar 18, 2025 • 591 • 1