You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I'm trying to test baichuan-13b-chat on some datasets and I'm wondering if you could provide some information regarding the following:
The mean score of baichuan-13b-chat on each subcategory of CMMLU - STEM, Humanities, Social Sciences, Others, China Specific. (I only found the overall mean on the website.)
which specific configuration file was used to obtain the OCNLI score shown on the website (FewCLUE_ocnli_fc_gen_f97a97 or CLUE_ocnli_gen_c4cb6c? If the former, dev_few_all.json or test_public.json?)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi, I'm trying to test baichuan-13b-chat on some datasets and I'm wondering if you could provide some information regarding the following:
Thank you very much for your assistance.
Beta Was this translation helpful? Give feedback.
All reactions