| 模型 | 模型答案 | 正确答案 |
|---|---|---|
| 金声玉亮2.0 | measure_id | measure_id |
| deepseek-r1 | measure_target |
measure_id |
| gpto1 | measure_id | measure_id |
| gpt-4o-2024-08-06 | measure_id | measure_id |
| DeepSeek-Coder-V2-Lite-Instruct | measure_value_type | measure_id |
| Meta-Llama-3.1-405B-Instruct-Turbo | measure_value | measure_id |
| Meta-Llama-3.1-70B-Instruct-Turbo | id | measure_id |
| Qwen2-72B-Instruct | 'measure_value' | measure_id |
| baichuan4 | 'measure_value' | measure_id |
| doubao | 'id' | measure_id |
| gpto1 | measure_id | measure_id |
| kimi | measure_value | measure_id |
| minimax | measure_value | measure_id |
| wenxinyiyan | measure_value | measure_id |
| zhipu | measure_value | measure_id |
| deepseek-v3 | measure_value | measure_id |