Answer:
I would say high. The data sets are completely identical besides column 17
7.5m questions
10.1m answers