- Joined
- Jul 22, 2010
- Messages
- 961
- Reaction score
- 4

Or would we use the n version?
Chad says n-1, Destroyer says n.
This hella threw me off too but I don't know why it is n-1 because:
Since you are taking the average distance the points are away from the mean which is the definition of standard deviation it would make sense you would included the number of things you averaged it in the first place. I don't understand why they have to subtract one from the number of data points. When you are taking an average of something you divide by the number of things. Can someone explain to me in simplistic terms why they do this instead of just telling its the degrees of freedom?
Note: I do know that there is n for populations and n-1 for data sets. I just don't understand why they subtract one from it.
It has to do with the principles of variable uncertainty. This is the reason that degrees of freedom is typically used for a sample (n-k) because each variable can increase the total standard deviation due to uncertainty. However, when the population is used, there is no additional uncertainty involved because the whole population is tested. This is why only n is used (k would be zero because there are no unknown variables in the whole population). If I remember my stats theory correctly, I think this is how it would go lol 😀
Nice bereno that was good, so K would be a factors the could affect the data set. How do we know its just 1 in a random data set though? There could be a whole set of variables that are unaccounted for and could be offsetting the variance in one way or the other?
Well, the easy way to look at it is that if you are taking one sample looking for the mean, or something similar to that, there is only one variable, so k=1. This is why most people think of degrees of freedom as "n-1" because most samples are inclusive of only 1 variable. However, when you start jumping into two-sample tests, tests of varianve, or ANOVA, or Chi squared, etc, you start running into many variables at once, so you need the appropriate value for k.
When in doubt, just I would assume k=1 for most simple setups (n-1)
Hope this helps
sample standard deviation is calculated around the sample mean
if you're given the sample mean and n-1 of the data points, you could simply calculate the nth data point...sample mean times n, minus all n-1 data points you've been given thus far
basically you use n-1 because after you have a given, set sample mean (which you do the moment you start using it to calculate each data point's deviation from that value) you don't have n independent data points anymore, just n-1 independent data points
ive never done n-1 and always use N and get all the problems right. so i dunno if this n-1 is a must.
Bump on this topic please. Has anyone that has taken the DAT encountered having to calculate/determine the variance or standard deviation on the actual exam?
I'm still a bit unclear on whether the ADA would base these questions off of using N to calculate the standard deviation of the sample set, like how Destroyer/Math Destroyer use N, OR if they would deem it correct to use (N-1) like how Chad teaches for the sample set.
For the purposes of the actual exam, when calculating the standard deviation or variance of a data set, such as {3,4,5,6,7}, should we use N, or N-1 in the denominator for the standard deviation equation?
I recall that Chad teaches to use N-1 in his video, but seems like the questions/answers from the DAT Destroyer and Math Destroyer base the correct answers off of using N (as opposed to N-1). (as stated above)
I understand that technically, one is used for a sample set, and the other is used for a population, but which should we use for the DAT under which conditions?
Any definite insight on this?
Average and variance showed up on mine. I used N to determine the variance. Same thing as math destroyer. I swear the QR part was just a clone of the math destroyer but easier.