| 模型 | 模型答案 | 正确答案 |
|---|---|---|
| 金声玉亮2.0 | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| deepseek-r1 | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| gpto1 | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| gpt-4o-2024-08-06 | ``` interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] ``` |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| DeepSeek-Coder-V2-Lite-Instruct | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco the golden compass [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha the golden compass [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget the golden compass [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| Meta-Llama-3.1-405B-Instruct-Turbo | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| Meta-Llama-3.1-70B-Instruct-Turbo | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| Qwen2-72B-Instruct | | interaction_id | query_time | domain | question_type | static_or_dynamic | query | answer | alternative_answers | |----------------|----------------------|----------|------------------------|-------------------|--------------------------------------------|--------------------|---------------------| | ee0b351c-5a00-48e2-9 | 03/19/2024, 23:49:23 | movie | simple | static | in 2008, which movie | the golden compass | [] | | d660a07b-c598-4be9-b | 03/19/2024, 23:34:54 | movie | simple | static | which movie was reco | ratatouille | [] | | 42163b55-9bf6-4412-a | 03/15/2024, 17:05:41 | sports | comparison | static | during the 2022-12 s | true | [] | | 82e66a91-22eb-4806-a | 03/05/2024, 23:19:09 | music | simple_w_condition | static | what is the song that | cold heart | [] | | a91df871-089c-4b91-9 | 03/19/2024, 23:17:23 | movie | simple | static | who directed bridget | beeban kidron | [] | |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| baichuan4 | interaction_id\tquery_time\tdomain\tquestion_type\tstatic_or_dynamic\tquery\tanswer\talternative_answers ee0b351c-5a00-48e2-9\t03/19/2024, 23:49:23\tmovie\tsimple\tstatic\tin 2008, which movie\tthe golden compass\t[] d660a07b-c598-4be9-b\t03/19/2024, 23:34:54\tmovie\tsimple\tstatic\twhich movie was reco\tratatouille\t[] 42163b55-9bf6-4412-a\t03/15/2024, 17:05:41\tsports\tcomparison\tstatic\tduring the 2022-12 s\ttrue\t[] 82e66a91-22eb-4806-a\t03/05/2024, 23:19:09\tmusic\tsimple_w_condition\tstatic\twhat is the song tha\tcold heart\t[] a91df871-089c-4b91-9\t03/19/2024, 23:17:23\tmovie\tsimple\tstatic\twho directed bridget\tbeeban kidron\t[] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| doubao | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| gpto1 | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| kimi | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| minimax | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-0 |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| wenxinyiyan | ``` interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] ``` |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| zhipu | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
| deepseek-v3 | interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |
interaction_id query_time domain question_type static_or_dynamic query answer alternative_answers ee0b351c-5a00-48e2-9 03/19/2024, 23:49:23 movie simple static in 2008, which movie the golden compass [] d660a07b-c598-4be9-b 03/19/2024, 23:34:54 movie simple static which movie was reco ratatouille [] 42163b55-9bf6-4412-a 03/15/2024, 17:05:41 sports comparison static during the 2022-12 s true [] 82e66a91-22eb-4806-a 03/05/2024, 23:19:09 music simple_w_condition static what is the song tha cold heart [] a91df871-089c-4b91-9 03/19/2024, 23:17:23 movie simple static who directed bridget beeban kidron [] |