Skip to content

问题咨询:关于ceval数据集以及新增类似ceval数据集的问题 #115

Answered by Leymore
niexufei asked this question in Q&A
Discussion options

You must be logged in to vote
  1. formatl_ceval is the released version of C-Eval, the same as the one downloaded from C-Eval official . release_ceval, on the other hand, is legacy data, which should be deleted. Configuration in the glm folder is out-of-date and buggy, please use formatl_ceval in all scenarios.
  2. dev is used as few shot examples. Scores of 5-shot dev and val is reported on the opencompass official site. test is used only for inference mode. As the answer to val is not released, the trial to evaluate on test will get 0s.
  3. All datasets which have qa in the name is probably a question-answer datasets. You may take TriviaQA & NatualQuestions as examples.
  4. Thank you for your suggestion on the discourse thing. We …

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by niexufei
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #110 on July 27, 2023 09:44.