Letter to the Editor

CHANCE, May 2009

Susanne Aref, Ray Stefani

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007%2Fs00144-009-0014-1.pdf

Letter to the Editor

Letters to the Editor Dear Editor, I enjoyed the article “‘We All Survived’ and Other Failings of Risk Perception” by Stephanie R. Land very much. [CHANCE,Volume 21, Issue 4] I vaguely recall seeing the email about all the childhood fun and freedom we oldies (I am from the ’50s) had and simply dismissed it as absurd. It was nice to see the graphs in Figure 3 solidly rejecting the myth about how we ALL survived so well without any new-fangled restrictions on our lives. However, there is a slight problem in the graph in Figure 4 on Page 55. In the graph in the original paper (G. K. Singh and S. M. Yu (1996) U.S. childhood mortality, 1950 through 1993: Trends and socioeconomic differentials. American Journal of Public Health 86(4):505–12) both NON-MVA and Homicide were labeled with a solid line, though the NON-MVA should have been dashed. In the copy in CHANCE, the label for Homicide got “dashed,” leading one to look at the wrong curve. Looking closely at the graph in Figure 4, the homicide rate shows a rather steady increase between 1968 and 1992, even though the other rates in Figure 4 decreased and the overall child death rates between 1930 and 2005 (Figure 3 a and b) decreased. That disturbed me. All other causes of child deaths improved except for the one of killing kids. There here is an o the accompanying graph in Singh and Yu for years five through 14 to one for years zero to four reproduced in Figure 4. That graph shows the same trends for the different causes of death. In the graph, there is also a suicide cause, which behaves much like the homicide cause at half the rate. Both the two homicide rates and the suicide rate in the two graphs in Singh and Yu were increasing between 1968 and 1992. So, not only did kids get killed more, they also became so depressed that they committed suicide at a higher and higher rate. rI wondered what happened after 1992. The data on the Materrc/ nal and Child Health Bureau web page, www.mchb.hrsa.gov/mchirc/ de chusa_04/pages/0436cm.htm, show that the three rates for homicide and suicide have all decreased from the 1992 levels. Adding the 2002 rates to the 1968–1992 rates shows the P&I (pneumonia and influenza) cause to have bottomed out, while all other rates keep decreasing. So it continues to be safer and safer to be born now, ratherr than in the fun, free, good-old days … Susanne Aref Aref Consulting Group LLC Stephanie Land responds: I thank the reader for her interesting observations. Regarding the graph, the unfortunate aspect of the original image is that the legend does not distinguish between the dashed line (non-MVA) and solid (homicides). 6 VOL. 22, NO. 2, 2009 Dear Editor, In CHANCE Volume 21, Issue 4, the articles “War, Enmity, and Statistical Tables” by Brian Clauser and "Fisher and the 5% Level" by Stephen Stigler provided insight into the dysfunctional relationship between R. A. Fisher and Karl Pearson. Hidden within those articles was an equally interesting interaction between William Gosset and Karl Pearson. Gosset had the enviable position of brew master at Guinness Brewery, which objected to him publishing his statistical work under his own name, hence his pseudonym "student." I thought I would use the Excel TDIST command to duplicate the probabilities in Clauser's Figure 1, showing a fragment of Gosset's (Student's) table from "The Probable Error of a Mean," Biometrica, 6(1), published March 1908. Gosset's table is parameterized using z = x/s, where x is the difference from the mean and s is the standard deviation of n observations. I assumed that Gosset used the unbiased s2, found by dividing by n–1 when estimating the variance of n independent observations. To find t, as is common practice today, I divided the square root of the unbiased s2 (multiplied z) by the n. With n–1 degrees of freedom, TDIST did not duplicate Gosset's probabilities. For example, in Table 1 with z=.1 and n = 4, then t would be the 4 times .1 or .2 with n–1 = 3 degrees of freedom. The cumulative probability using TDIST is 0.5729, not 0.5633. Also, with z =.5 and n = 6, then t would be the square root of 6 times .5 or 1.225 with n–1 = 5 degrees of freedom. The cumulative probability from TDIST is 0.8624, not 0.8428. I realized that Gosset must have used the biased s2, found by dividing by n; hence, it was necessary to find t by dividing the square root of the biased s2 (multiplying z) by the n−1. With n–1 degrees of freedom, TDIST duplicated Gosset's probabilities. For n = 4 observations, the values in column one are multiplied by the square root of 3 to get t and using 3 degrees of freedom, we get all the values in column 2. Similarly, for the n = 5 column, the values in column one are multiplied by the square root of 4 to get t and using 4 degrees of freedom, we get all the values in column 3. I downloaded a copy of Gosset's 1908 paper, and indeed, on page 3, the variance s2 was found by dividing by n; but why? The answer is contained in "Student's z, t, and s: What If Gosset Had R?" by Hanley, Julien and Moodie in The American Statistician, 62(1), February 2008. Here is what they wrote: "Gosset defined s2 as the sum of squared deviations divided by n, rather than n–1 (suggested in Airy's textbook) that yields an unbiased estimator of s2–a decision influenced by his professor Karl Pearson. Gosset would have preferred to use n–1: he wrote to a Dublin colleague in May 1907, ‘when you only have quite small numbers I think the formula with the divisor of n–1 we used i better.’ b ’ Even E iin 1912 K ill a is Karll P Pearson—still large sample person—remarked to him that it made little difference whether the sum of squares was divided by n or n–1 ‘because only naughty brewers take n so small that the difference is not the order of the probable error’ (Pearson 1939)." True to his pseudonym, Gosset was the dutiful student to his professor, Karl Pearson. It is noteworthy that "Student" effectively parameterized his own t different from today's practice. Ray Stefani California State University, Long Beach Correction In Volume 21, Issue 3, part of the “Children 2–5 year olds” graph for Figure 6 is missing from the article “Healthy for Life: Accounting for Transcription Errors Using Multiple Imputation— Application to a study of childhood obesity.” CHANCE 7 (...truncated)


This is a preview of a remote PDF: https://link.springer.com/content/pdf/10.1007%2Fs00144-009-0014-1.pdf
Article home page: https://link.springer.com/article/10.1007/s00144-009-0014-1

Susanne Aref, Ray Stefani. Letter to the Editor, CHANCE, 2009, pp. 6-7, Volume 22, Issue 2, DOI: 10.1007/s00144-009-0014-1