-
描述该错误1、data/ceval/包含formal_ceval和release_ceval两个文件夹,分别都是什么用途?评测internLM-7b时,使用的是formal_ceval下的数据。使用release_ceval的数据在数据集合配置glm文件夹下,是评测GLM的模型的时候使用release_ceval数据吗? 建议opencompass开个论坛,我们可以到论坛里去交流相关技术问题。 感谢~ 环境信息无。 其他信息No response |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
|
Beta Was this translation helpful? Give feedback.
-
We just activated the Discussion panel for OpenCompass. You are welcome to discuss general topics then. |
Beta Was this translation helpful? Give feedback.
formatl_ceval
is the released version of C-Eval, the same as the one downloaded from C-Eval official .release_ceval
, on the other hand, is legacy data, which should be deleted. Configuration in the glm folder is out-of-date and buggy, please useformatl_ceval
in all scenarios.dev
is used as few shot examples. Scores of 5-shotdev
andval
is reported on the opencompass official site.test
is used only for inference mode. As the answer toval
is not released, the trial to evaluate ontest
will get0
s.qa
in the name is probably a question-answer datasets. You may take TriviaQA & NatualQuestions as examples.