Answer:
I would say high. The data sets are completely identical besides column 17
1.6m questions
2.0m answers