183k views
4 votes
Which of the following statements is not true about outliers?

1.Using a scatter plot to visualize the relationship between distance and fare would not help us to identify outliers
2.Extreme values of fare like 300 dollars could be considered outliers and we can consider removing them
3.If distance was zero but fare was very high, we could consider removing these because they don't make sense in the context of this problem
4.Extreme values of distance like 70 miles could be considered outliers and we can consider removing them

User Tayeb
by
7.5k points

1 Answer

6 votes

Final answer:

Outliers in data are significantly different observations from the rest of the data. The first, third, and fourth statements are false, while the second statement is true.

Step-by-step explanation:

An outlier is an observation in a data set that is significantly different from other observations. Let's analyze each statement:

  1. The first statement is false. Using a scatter plot can help us visualize the relationship between distance and fare and identify outliers. Outliers may appear as data points that are located far from the general trend of the data.
  2. The second statement is true. Extreme values of fare, such as 300 dollars, could be considered outliers and might be removed from the data set if they are significantly different from the other fares.
  3. The third statement is true. If the distance was zero but the fare was very high, it would be reasonable to consider removing this data point as it doesn't make sense in the context of the problem.
  4. The fourth statement is false. Extreme values of distance, like 70 miles, should not automatically be considered outliers. It depends on the specific scenario and whether these values are significantly different from the rest of the data.

User Jcvandan
by
7.2k points