 mersenneforum.org Standard deviation for twin prime data
 Register FAQ Search Today's Posts Mark Forums Read 2012-01-25, 01:16 #1 roger   Oct 2006 22×5×13 Posts Standard deviation for twin prime data I tried to do a standard deviation list to see how the variance changes as does n, but I can't seem to get it right. It's been a long time since I did this high school math. Maybe I've gone wrong somewhere: I'm using the square root of the variance as the standard deviation. Calculating the range of one standard deviation should be simply the average - std.dev, and average + std.dev, right? Unfortunately, I'm getting lots of answers for std.dev that are well above the average, and so giving ranges of a single standard deviation starting well below zero. I've been doing the calculations of the twin prime search twins in groups of 100 to give a somewhat accurate view of the range of variance and how it changes over time in relation to n. Any suggestions? I've doublechecked my steps, and just don't know why the numbers don't make sense.   2012-01-25, 19:23   #2
ccorn

Apr 2010

22×37 Posts I am not involved in the twin prime search and I am not familiar with its data. However...

Quote:
 Originally Posted by roger Unfortunately, I'm getting lots of answers for std.dev that are well above the average, and so giving ranges of a single standard deviation starting well below zero.
This is generally possible, particularly if your data (i. e. the sampled random variable values) have a lower bound (e. g. zero) dictated by their very nature (e. g. positiveness) but no fundamental upper bound.

Of course, this then means that the probability distribution of your dataset cannot be expected to be direction-independent around its mean value, or around any other value for that matter.

In such cases, it might be more meaningful to compute stats on logarithms of the sampled values instead: Choose one reference value to your liking (it need not occur in the dataset), then divide all sampled values by that reference value, then take some logarithm with convenient base B (e.g. B=2, B=e, or B=10) of the resulting ratios. A negative result then just means that the original sampled value is less than the chosen reference value.
Of course, the interpretation changes accordingly: The arithmetic mean of the logarithms is not the logarithm of the arithmetic mean but the logarithm of the geometric mean, and adding/subtracting one standard deviation of the logarithms means multiplying/dividing the original values by some factor (namely by the base-B exponential of the standard deviation).

Last fiddled with by ccorn on 2012-01-25 at 19:43   2012-01-26, 04:45 #3 roger   Oct 2006 1000001002 Posts Thanks, I'll give it a try!  Thread Tools Show Printable Version Email this Page Similar Threads Thread Thread Starter Forum Replies Last Post hydeer Lone Mersenne Hunters 9 2018-04-03 22:54 cuBerBruce Puzzles 3 2014-12-01 18:15 gd_barnes Twin Prime Search 3 2007-10-12 08:30 MooooMoo Twin Prime Search 2 2006-05-11 23:38 jinydu Puzzles 5 2004-01-10 02:12

All times are UTC. The time now is 11:13.

Sat Oct 31 11:13:59 UTC 2020 up 51 days, 8:24, 2 users, load averages: 1.93, 2.23, 2.24