Answer:
Inter-rater reliability.
Step-by-step explanation:
Based on the scenario being described within the question it can be said that in this situation Bill and Nancy are interested in the measure's Inter-rater reliability. This term focuses on measuring the level extent in which two or more raters/observers/researchers agree on the on the something. Such as Bill and Nancy are doing by checking the consistency of the results to see if many raters agree with one another.