| 模型 | 模型答案 | 正确答案 |
|---|---|---|
| 金声玉亮2.0 | ```python { "app_no": None, "type": None, "app_date": None, "status": None, "fru_interview_scheduled": "qU8fN4BcOE4", "drug_test": "kT8cHJ58B7E", "wav_course": "4MSYlVBQT9Y", "defensive_driving": "Qiz4gNNSkjU", "driver_exam": "BkPad8F1Zfw", "medical_clearance_form": "8QouQFH8JWo" } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| deepseek-r1 | {'drug_test': 'kT8cHJ58B7E', 'fru_interview_scheduled': 'qU8fN4BcOE4', 'wav_course': '4MSYlVBQT9Y'} |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| gpto1 | ```python {'fru_interview_scheduled': 'qU8fN4BcOE4'} ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| gpt-4o-2024-08-06 | ```python { "fru_interview_scheduled": "qU8fN4BcOE4", "drug_test": "kT8cHJ58B7E", "wav_course": "kT8cHJ58B7E", "defensive_driving": "kT8cHJ58B7E", "driver_exam": "kT8cHJ58B7E", "medical_clearance_form": "kT8cHJ58B7E" } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| DeepSeek-Coder-V2-Lite-Instruct | ```python { "type": "kT8cHJ58B7E", "app_date": "LAjKEsrx0pI", "status": "qU8fN4BcOE4", "fru_interview_scheduled": "4MSYlVBQT9Y", "drug_test": "qrA0NE/ugMQ", "wav_course": "8QouQFH8JWo", "defensive_driving": "Qiz4gNNSkjU", "driver_exam": "BkPad8F1Zfw" } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| Meta-Llama-3.1-405B-Instruct-Turbo | {'fru_interview_scheduled': None, 'drug_test': 'kT8cHJ58B7E', 'wav_course': 'LAjKEsrx0pI', 'defensive_driving': 'qU8fN4BcOE4', 'driver_exam': '4MSYlVBQT9Y', 'medical_clearance_form': 'qrA0NE/ugMQ'} | {"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| Meta-Llama-3.1-70B-Instruct-Turbo | {'app_no': 'kT8cHJ58B7E', 'type': 'LAjKEsrx0pI', 'app_date': 'qU8fN4BcOE4', 'status': '4MSYlVBQT9Y', 'fru_interview_scheduled': 'qrA0NE/ugMQ', 'drug_test': '8QouQFH8JWo', 'wav_course': 'Qiz4gNNSkjU', 'defensive_driving': 'BkPad8F1Zfw', 'driver_exam': None, 'medical_clearance_form': None} | {"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| Qwen2-72B-Instruct | {'drug_test': 'kT8cHJ58B7E', 'wav_course': 'LAjKEsrx0pI', 'defensive_driving': 'qU8fN4BcOE4', 'driver_exam': '4MSYlVBQT9Y', 'medical_clearance_form': 'qrA0NE/ugMQ'} | {"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| baichuan4 | { 'app_no': 'kT8cHJ58B7E', 'type': None, 'app_date': None, 'status': None, 'fru_interview_scheduled': None, 'drug_test': 'LAjKEsrx0pI', 'wav_course': 'qU8fN4BcOE4', 'defensive_driving': '4MSYlVBQT9Y', 'driver_exam': 'qrA0NE/ugMQ', 'medical_clearance_form': '8QouQFH8JWo' } |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| doubao | {"drug_test": "qU8fN4BcOE4", "wav_course": "qU8fN4BcOE4", "defensive_driving": "qU8fN4BcOE4", "driver_exam": "qU8fN4BcOE4", "medical_clearance_form": "qU8fN4BcOE4"} | {"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| gpto1 | ```python {'fru_interview_scheduled': 'qU8fN4BcOE4'} ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| kimi | ```python {'app_no': 'kT8cHJ58B7E', 'type': 'LAjKEsrx0pI', 'app_date': 'qU8fN4BcOE4', 'status': '4MSYlVBQT9Y', 'fru_interview_scheduled': 'qrA0NE/ugMQ', 'drug_test': '8QouQFH8JWo', 'wav_course': 'Qiz4gNNSkjU', 'defensive_driving': 'BkPad8F1Zfw', 'driver_exam': 'Complete', 'medical_clearance_form': '15.6466'} ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| minimax | ```python { 'fru_interview_scheduled': 'Not Applicable', 'drug_test': 'Complete', 'wav_course': 'Complete', 'defensive_driving': 'Complete', 'driver_exam': 'Complete', 'medical_clearance_form': 'Complete' } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| wenxinyiyan | ```python { 'drug_test': 'kT8cHJ58B7E', 'wav_course': 'LAjKEsrx0pI', 'defensive_driving': 'qU8fN4BcOE4', 'driver_exam': '4MSYlVBQT9Y', 'medical_clearance_form': 'qrA0NE/ugMQ' } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| zhipu | ```python { "drug_test": "kT8cHJ58B7E", "wav_course": "4MSYlVBQT9Y", "defensive_driving": "8QouQFH8JWo", "driver_exam": "Qiz4gNNSkjU", "medical_clearance_form": "BkPad8F1Zfw" } ``` |
{"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |
| deepseek-v3 | {'drug_test': 'kT8cHJ58B7E', 'wav_course': '4MSYlVBQT9Y', 'defensive_driving': 'qrA0NE/ugMQ', 'fru_interview_scheduled': 'LAjKEsrx0pI'} | {"defensive_driving": "kT8cHJ58B7E", "fru_interview_scheduled": "qU8fN4BcOE4", "wav_course": "4MSYlVBQT9Y"} |