216,580 views
43 votes
43 votes
Calculus-based Physics I, can someone explain this to me?

My apologies for the broadness of my question. I especially don't understand the notation being used here, but I know this is about data collection, specifically standard deviation and standard error. I mostly need help with the data collection of multiple variables, the formulae for standard deviation and standard error make no sense to me.

I could also use some examples.

Calculus-based Physics I, can someone explain this to me? My apologies for the broadness-example-1
User Mujuonly
by
3.2k points

1 Answer

23 votes
23 votes

2: For a sample of data
x_1,x_2,\ldots,x_N, the mean of this sample denoted by
\overline x is the sum of the data divided by the number of data points,


\overline x = \frac{x_1+x_2+\cdots+x_N}N = \displaystyle\frac1N\sum_(i=1)^N x_i

As an example, consider
x_1=-1,
x_2=1, and
x_3=3. Then


\overline x = \frac{-1+1+3}3 = 1

3: Standard deviation is a measure of how dispersed a given data sample is relative to the mean. Consult the plot: for a normal distribution, approximately 68% of it lies within 1 standard deviation of the mean, approx. 95% within 2 standard deviations, and approx. 99.7% within 3 standard deviations.

For instance, if the data is pulled from a normally distributed population with mean 0 and standard deviation 1, if you were to randomly select any data from the population, then 68% of the time it will fall in the range (-1, 1); 95% of the time it will fall within (-2, 2); 99.7% of the time it fall within (-3, 3).

To compute the standard deviation for a sample, for each
x_i in
x_1,x_2,\ldots,x_N, you

• take the difference between
x_i and the mean
\overline x

• square this difference

• sum all the squared differences

• divide the sum by N - 1 (for a sample) or N (for a population)

• take the square root

Here the standard deviation is denoted
\sigma^x_(N-1), which I would read as "the sample standard deviation of the data x" - sample because of the N - 1 subscript.

Continuing with the previous example, we'd have


\sigma^x_(N-1) = \displaystyle \sqrt{(\left(-1-1\right)^2+\left(1-1\right)^2+\left(3-1\right)^2)/(3-1)} = \sqrt4 = 2

4: Not much more to say here, the standard error is basically a measure of how accurate a given estimate is about the population based on the sample data. It's analogous to uncertainty in measuring length with a ruler, for instance.

In our example,


\alpha^x = \frac2{\sqrt3}

5: If x, y, and z are random variables, then I suppose ρ is meant to denote a function of these random variables (so that ρ itself is just another random variable). For instance, you could have ρ = x + 3y - 2z. Then
\overline\rho is the sample mean of ρ.

I'm not entirely sure about the notation
x(\overline x,\sigma^x_(N-1),\alpha^x), but I suspect it's just referring to sample x with mean
\overline x and standard deviation
\sigma^x_(N-1) with standard error
\alpha^x.

ρ is just the differential of ρ, essentially capturing how ρ changes with respect to small changes in x, y, and z. The expression you see here follows from the chain rule for differentiation.

The formula you see for
\sigma^\rho_(N-1) is the sample standard deviation of ρ. Think of ∆ρ as a vector with 3 components. Then
\sigma^\rho_(N-1) is the magnitude of this vector.

Similarly,
\alpha^\rho is the standard error for ρ, and corresponds to the magnitude of the vector whose components are the standard errors of x, y, and z.

In order for these statistics to make sense, each of x, y, and z must be samples of the same number of data. Say we take x as before
(x_1=-1,x_2=1,x_3=3), along with
y_1=0,y_2=4,y_3=-2 and
z_1=-3,z_2=\frac12,z_3=10. Suppose ρ = x + 3y - 2z. Then

• the sample means of y and z :


\overline y = \frac{0+4-2}3 = \frac23 \\\\ \overline z = \frac{-3+\frac12+10}3 = \frac52

• the standard deviations of y and z :


\sigma^y_(N-1) = \sqrt{(\left(0-\frac23\right)^2+\left(4-\frac23\right)^2+\left(2-\frac23\right)^2)/(3-1)} = 2√(\frac73) \approx 3.06\\\\ \sigma^z_(N-1) = \sqrt{(\left(-3-\frac52\right)^2+\left(\frac12-\frac52\right)^2+\left(10-\frac52\right)^2)/(3-1)} = \frac{√(181)}2 \approx 6.73

• the values of ρ :


\rho_1 = x_1+3y_1-2z_1 = -1+2*0-2*(-3) = 5 \\\\ \rho_2 = x_2+3y_2-2z_2 = 1+3*4-2*\frac12=12 \\\\ \rho_3 = x_3+3y_3-2z_3 = 3+3*(-2)-2*10 = -23

• the sample mean of ρ :


\overline\rho = \frac{5+12-23}3 = -2

• by the chain rule,


\Delta\rho = \Delta x+3\Delta y-2\Delta z

so the standard deviation of ρ :


\sigma^\rho_(N-1) = \sqrt{\left(\sigma^x_(N-1)\right)^2 + \left(3\sigma^y_(N-1)\right)^2 + \left(-2\sigma^z_(N-1)\right)^2} \\\\\sigma^\rho_(N-1)= \sqrt{2^2 + 9\left(2√(\frac73)\right)^2 + 4\left(\frac{√(181)}2\right)^2} = \frac12\sqrt{\frac{703}3} \approx 7.65

• the standard errors of y and z :


\alpha^y = (2√(\frac73))/(\sqrt3) = \frac23\sqrt7 \approx 1.76 \\\\ \alpha^z = \frac{\frac{√(181)}2}{\sqrt3} = \frac12\sqrt{\frac{181}3} \approx 3.88

• the standard error of ρ :


\alpha^\rho=√(\left(\alpha^x\right)^2+\left(3\alpha^y\right)^2+\left(-2\alpha^z\right)^2)\\\\\alpha^\rho=\sqrt{\left(\frac2{\sqrt3}\right)^2+9\left(\frac23\sqrt7\right)^2+4\left(\frac12\sqrt{\frac{703}3}\right)^2}=√(269)\approx16.40

User Wilmarie
by
3.1k points