kuleshov-group/caduceus-ph_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 13 • 1
kuleshov-group/caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 47 • 1
kuleshov-group/caduceus-ph_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.24k • 6
kuleshov-group/caduceus-ps_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 13 • 1
kuleshov-group/caduceus-ps_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 28 • 2
kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.69k • 14
kuleshov-group/bd3lm-owt-block_size1024-pretrain Text Generation • 0.2B • Updated Mar 18, 2025 • 532 • 1