| 模型 | 模型答案 | 正确答案 |
|---|---|---|
| 金声玉亮2.0 | ```python {'question_type': 'question_type', 'static_or_dynamic': 'static_or_dynamic', 'answer': 'answer'} ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| deepseek-r1 | { "interaction_id": "eJJm7lex974", "question_type": "PjOW3vib37M", "static_or_dynamic": "8b3ewM26+SI", "page_name": "N63uV44/QbQ", "query": "AUUii56u8tg", "answer": "AUUii56u8tg" } |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| gpto1 | ```python { 'interaction_id': 'eJJm7lex974', 'query_time': '31Z18wvwUiM', 'question_type': 'PjOW3vib37M', 'alternative_answers': 'aONjSdwYYDk', 'page_name': 'N63uV44/QbQ', 'answer': 'AUUii56u8tg' } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| gpt-4o-2024-08-06 | ```python { "interaction_id": None, "query_time": None, "domain": None, "question_type": "question_type", "static_or_dynamic": None, "query": None, "answer": None, "alternative_answers": None, "split": None, "page_name": None } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| DeepSeek-Coder-V2-Lite-Instruct | ```python { "interaction_id": "aONjSdwYYDk", "query_time": "PjOW3vib37M", "domain": "N63uV44/QbQ", "question_type": "31Z18wvwUiM", "static_or_dynamic": "eJJm7lex974", "query": "V9rPaOdeODk", "answer": "8b3ewM26+SI", "alternative_answers": "AUUii56u8tg" } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| Meta-Llama-3.1-405B-Instruct-Turbo | {'question_type': 'question_type', 'static_or_dynamic': 'static_or_dynamic', 'answer': 'answer'} | {"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| Meta-Llama-3.1-70B-Instruct-Turbo | {'interaction_id': 'aONjSdwYYDk', 'query_time': 'PjOW3vib37M', 'domain': 'N63uV44/QbQ', 'question_type': '31Z18wvwUiM', 'static_or_dynamic': 'eJJm7lex974', 'query': 'V9rPaOdeODk', 'answer': '8b3ewM26+SI', 'alternative_answers': 'AUUii56u8tg', 'split': 'PjOW3vib37M', 'page_name': 'N63uV44/QbQ'} | {"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| Qwen2-72B-Instruct | ```python {'interaction_id': 'aONjSdwYYDk', 'query_time': '', 'domain': '', 'question_type': 'PjOW3vib37M', 'static_or_dynamic': 'N63uV44/QbQ', 'query': '31Z18wvwUiM', 'answer': 'eJJm7lex974', 'alternative_answers': 'V9rPaOdeODk', 'split': '8b3ewM26+SI', 'page_name': 'AUUii56u8tg'} ``` However, this mapping does not hold up because not every column in table A can be mapped to a unique column in table B based on the provided data. Here's a more accurate mapping for the columns that can be matched: ```python {'interaction_id': 'aONjSdwYYDk', 'question_type': 'PjOW3vib37M', 'static_or_dynamic': 'N63uV44/QbQ', 'page_name': 'AUUii56u8tg'} ``` But since the task requires each column in A to map to 0 or 1 columns in B, and given the lack of clear correspondence for most columns, a complete and valid mapping cannot be created. Therefore, I will provide a mapping for the columns that have some form of correspondence: ```python { 'interaction_id': 'aONjSdwYYDk', 'question_type': 'PjOW3vib37M', 'static_or_dynamic': 'N63uV44/QbQ', 'page_name': 'AUUii56u8tg' } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| baichuan4 | { "interaction_id": "aONjSdwYYDk", "query_time": "PjOW3vib37M", "domain": "N63uV44/QbQ", "question_type": "31Z18wvwUiM", "static_or_dynamic": "eJJm7lex974", "query": "V9rPaOdeODk", "answer": "8b3ewM26+SI", "alternative_answers": "AUUii56u8tg" } |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| doubao | { "domain": "N63uV44/QbQ" } |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| gpto1 | ```python { 'interaction_id': 'eJJm7lex974', 'query_time': '31Z18wvwUiM', 'question_type': 'PjOW3vib37M', 'alternative_answers': 'aONjSdwYYDk', 'page_name': 'N63uV44/QbQ', 'answer': 'AUUii56u8tg' } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| kimi | { "interaction_id": "aONjSdwYYDk", "query_time": "2024-04-23T05:00:01.", "domain": "multi-hop", "question_type": "false_premise", "static_or_dynamic": "multi-hop", "query": "invalid question", "answer": "No", "alternative_answers": [], "split": "1cba1106-7e25-4777-8", "page_name": "Wadishewadi Dam - Wikip" } |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| minimax | ```python { "interaction_id": "aONjSdwYYDk", "query_time": "31Z18wvwUiM", "domain": "N63uV44/QbQ", "question_type": "eJJm7lex974", "static_or_dynamic": "V9rPaOdeODk", "query": "8b3ewM26+SI", "answer": "AUUii56u8tg", "alternative_answers": [], "split": [], "page_name": [] } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| wenxinyiyan | {'question_type': 'PjOW3vib37M', 'domain': '31Z18wvwUiM', 'split': 'eJJm7lex974'} | {"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| zhipu | ```python { "domain": "PjOW3vib37M", "question_type": "N63uV44/QbQ", "static_or_dynamic": "31Z18wvwUiM", "answer": "8b3ewM26+SI", "split": "AUUii56u8tg" } ``` |
{"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |
| deepseek-v3 | {"interaction_id": "aONjSdwYYDk", "query_time": "PjOW3vib37M", "domain": "N63uV44/QbQ", "question_type": "31Z18wvwUiM", "static_or_dynamic": "eJJm7lex974", "query": "V9rPaOdeODk", "answer": "8b3ewM26+SI", "alternative_answers": "AUUii56u8tg"} | {"question_type": "PjOW3vib37M", "interaction_id": "eJJm7lex974", "page_name": "N63uV44/QbQ", "answer": "AUUii56u8tg", "alternative_answers": "aONjSdwYYDk"} |