Answer:
It's D
Step-by-step explanation:
A: Is wrong due to the fact that longer time spans give more credibility
B: Multiple group testing is needed
C: C makes it better because multiple groups would create more diversity than just 2 groups, but this is unnecesary
D: We need a control group