Answer:
I would say high. The data sets are completely identical besides column 17
7.8m questions
10.5m answers