292,218 views
12 votes
12 votes
which of the following regression best fits the data shown below? A. y= -0.66x^2+ 0.79x + 14.75B. y= -3.07x^2 + 0.41x + 15.65C. y= -2.34x^2+ 2.19x + 16.25D. y= -1.96x^2 + 3.46x + 13.75

which of the following regression best fits the data shown below? A. y= -0.66x^2+ 0.79x-example-1
User Miir
by
2.2k points

1 Answer

9 votes
9 votes
Step-by-step explanation

One way to study how well does a function fit a set of data is by looking at the sum of the square differences. Let's assume that (x',y') is a point in the data set and f(x) is a function that fits the data set. Then the square difference for that data point is:


s=(f(x^(\prime))-y^(\prime))^2

The idea here is to find all the square differences between a function and all the data points and then add them. The function with the smallest sum will be the one that best fits the data.

We can start with the function from option A:


f(x)=-0.66x^2+0.79x+14.75

We first need to find the values of f(x) for all the x-values of the data points and then we are going to use these results to find the square differences:


\begin{gathered} f(0)=14.75 \\ f(1)=14.88 \\ f(2)=13.69 \\ f(3)=11.18 \\ f(4)=7.35 \\ f(5)=2.2 \end{gathered}

Then we find the square differences using these six values of f(x) and the y-values of the data points:


\begin{gathered} (f(0)-16)^2=(14.75-16)^2=1.5625 \\ (f(1)-13)^2=(14.88-13)^2=3.5344 \\ (f(2)-13)^2=(13.69-13)^2=0.4761 \\ (f(3)-12)^2=(11.18-12)^2=0.6724 \\ (f(4)-9)^2=(7.35-9)^2=2.7225 \\ (f(5)-1)^2=(2.2-1)^2=1.44 \end{gathered}

Then we add these 5 square differences and we obtain:


S_A=1.5625+3.5344+0.4761+0.6724+2.7225+1.44=10.4079

And this is the number that we must compare.

Then we continue with option B:


f(x)=-3.07x^2+0.41x+15.65

We find the six values of f(x) using the x-values from the data:


\begin{gathered} f(0)=15.65 \\ f(1)=12.99 \\ f(2)=4.19 \\ f(3)=-10.75 \\ f(4)=-31.83 \\ f(5)=-59.05 \end{gathered}

Now the following step would be to find the square differences and then add them. However we don't need to do that, remember that each square difference is a positive number because is given by the square of another value. This means that if we find that one of the square differences is greater than 10.4079 then we can assure that the sum of the square differences of function B is greater than that of the function in A and therefore B is not the best fit and can be discarded.

We can find the square difference related to f(5):


(f(5)-1)^2=(-59.05-1)^2=3606.0025

3606.0025 is far greater than 10.4079 so we can discard option B since the function from A is a better fit.

Then we continue with C:


f(x)=-2.34x^2+2.19x+16.25

We find the 6 values of f(x) as we did before:


\begin{gathered} f(0)=16.25 \\ f(1)=16.1 \\ f(2)=11.27 \\ f(3)=1.76 \\ f(4)=-12.43 \\ f(5)=-31.3 \end{gathered}

Just like we did in option B we can find the square difference of a value that differs a lot from the one in the data point. We can find the square difference for x=5:


(f(5)-1)^2=(-31.3-1)^2=1043.29

Again this number is far greater than the sum of the square differences from option A so the fit in C is worse than that in A and we can discard this option.

Then we continue with the model in option D:


f(x)=-1.96x^2+3.46x+13.75

We find the 6 values of f(x):


\begin{gathered} f(0)=13.75 \\ f(1)=15.25 \\ f(2)=12.83 \\ f(3)=6.49 \\ f(4)=-3.77 \\ f(5)=-17.95 \end{gathered}

Using the same argument as before we can calculate the square difference of a single point. We find that of x=5:


(f(5)-1)^2=(-17.95-1)^2=359.1025

This value is greater than 10.4079 so the function in option A is a better fit than the function in D.

Answer

Then the answer is option A.

User Jay Dangar
by
3.4k points