Qwen models with custom class for bidirectional attention
Joao Coelho
jmvcoelho
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
jmvcoelho/apm_sft_1.7b_all_positive_asearcher_rlm_cweb_wikipedia_8H100
updated
a model
about 1 month ago
jmvcoelho/apm_sft_1.7b_all_positive_afm_taskcraft_only_serper_8H100
published
a model
about 1 month ago
jmvcoelho/apm_sft_1.7b_all_positive_afm_taskcraft_only_serper_8H100