关于CodeArena的一些问题 #209

buaali · 2024-12-18T07:50:23Z

您好，我部署了一下仓库中CodeArena的评测基准，在运行eval_arena.sh脚本时遇到了一个问题
我注意到该脚本中有三个步骤分别是

python infer_vllm.py生成output.jsonl文件
python judge_models.py生成output.jsonl.judge文件
python judge_models.py生成output.jsonl.judge.metric文件
前两步正常运行并生成了对应的文件，我在第三步时遇到错误：

Traceback (most recent call last):                                                                                                                               
  File "/home/largemodel/others/CodeArena/judge_models.py", line 268, in <module>                                                                                
    main()                                                                                                                                                       
  File "/home/largemodel/others/CodeArena/judge_models.py", line 253, in main                                                                                    
    score = get_scores(objs, tasktype_to_levels)                                                                                                                 
  File "/home/largemodel/others/CodeArena/judge_models.py", line 227, in get_scores                                                                              
    main_classified_scores, sub_classified_scores = calculate_classified_score(objs, tasktype_to_levels)                                                         
  File "/home/largemodel/others/CodeArena/judge_models.py", line 152, in calculate_classified_score                                                              
    task_type = wash_tag(obj["meta"]["parsed"]["task_type"][0])                                                                                                  
KeyError: 'meta'

我在调试过程中发现obj是一个字典，打印其键名如下：
dict_keys(['messages', 'id', 'gpt-4o-2024-05-13_response', 'gpt-4o-2024-08-06_response', 'gpt-4-turbo-2024-04-09_response', 'difficulty', 'level', 'programming_l anguage', 'gpt-4-turbo-2024-04-09_response_len', 'input', 'model', 'response', 'question', 'games', 'if_win', 'if_tie'])
确实不存在meta这个键，请问这个键应该是在哪个部分生成的呢，存储的貌似是代码对应的类型信息吗

The text was updated successfully, but these errors were encountered:

CSJianYang · 2024-12-18T08:01:51Z

We have updated the CodeArena file and included the "meta" key in the updated version of CodeArena. You can redownload the test file from "https://huggingface.co/datasets/CSJianYang/CodeArena"

buaali · 2024-12-18T09:12:13Z

已解决，谢谢~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于CodeArena的一些问题 #209

关于CodeArena的一些问题 #209

buaali commented Dec 18, 2024

CSJianYang commented Dec 18, 2024

buaali commented Dec 18, 2024

关于CodeArena的一些问题 #209

关于CodeArena的一些问题 #209

Comments

buaali commented Dec 18, 2024

CSJianYang commented Dec 18, 2024

buaali commented Dec 18, 2024