Qwen models with custom class for bidirectional attention
Joao Coelho
jmvcoelho
AI & ML interests
None yet
Organizations
models
17
jmvcoelho/apm_sft_1.7b_all_positive_asearcher_rlm_cweb_wikipedia_8H100
Updated
jmvcoelho/apm_sft_1.7b_all_positive_afm_taskcraft_only_serper_8H100
Updated
jmvcoelho/apm_sft_1.7b_correct_and_positive_asearcher_rlm_clueweb_8H100
Updated
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn-mntp
0.5B
•
Updated
•
4
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn
0.5B
•
Updated
•
5
jmvcoelho/ad-classifier-v0.2
Text Classification
•
0.2B
•
Updated
•
7
jmvcoelho/ad-classifier-v0.1
Text Classification
•
0.2B
•
Updated
•
3
jmvcoelho/ad-classifier-v0.0
Text Classification
•
0.2B
•
Updated
•
5
jmvcoelho/GPTNeoX-160m
0.2B
•
Updated
•
8
•
1
jmvcoelho/pythia-160m-1024-marco-docs-bow-contrastive-pretrain
Updated
•
4