A Comprehensive Study on Power of Tests for Normality (pdf)

Article PDF cannot be displayed. You can download it here:

https://download.atlantis-press.com/article/125905578.pdf

A Comprehensive Study on Power of Tests for Normality

“JSTA-17-4-7_print” — 2018/12/21 — 12:53 — page 647 — #1 Journal of Statistical Theory and Applications Vol. 17(4), December 2018, pp. 647–660 DOI: 10.2991/jsta.2018.17.4.7; ISSN 1538-7887 https://www.atlantis-press.com/journals/jsta A Comprehensive Study on Power of Tests for Normality Hadi Alizadeh Noughabi1,* 1 Department of Statistics, University of Birjand, Birjand, Iran ARTICLE INFO ABSTRACT Article History Many statistical procedures assume that the underling distribution is normal. In this paper, we consider the popular and powerful tests for normality and investigate the power values of these tests to detect deviations from normality. The family of fourparameter generalized lambda distributions (FMKL) for its high flexibility is considered as alternative distributions. We then compare the power values of normality tests against these alternatives and for different sample sizes. The considered tests are Kolmogorov-Smirnov, Anderson-Darling, Kuiper, Jarque-Bera, Cramer von Mises, Shapiro-Wilk and Vasicek. These tests are popular tests which are commonly used in practice and statistical software. The tests are described and then power values of the tests are compared against FMKL family by Monte Carlo simulation. The results are discussed and interpreted. Finally, we apply some real data examples to show the behavior of the tests in practice. Received 14 Oct 2017 Accepted 17 May 2018 Keywords Test of normality Monte Carlo simulation Power of test The generalized lambda distribution 2000 Mathematics Subject Classification 62G10, 62P20, 62P30 © 2018 The Authors. Published by Atlantis Press SARL. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). 1. INTRODUCTION Ramberg and Schmeiser [1] introduced the four-parameter generalized lambda distribution (GLD) as Q (u) = 𝜆1 + u𝜆3 − (1 − u)𝜆4 , 𝜆2 where Q(u) is the quantile function, 0 ≤ u ≤, 𝜆1 , 𝜆2 are the location and scale parameters, and 𝜆3 , 𝜆4 are shape parameters jointly related to the strengths of the lower and upper tails, respectively. For its high flexibility it is used in many fields such as modeling financial data. Because of the limitations of the Ramberg and Schmeiser (RS) parameterizations, Freimer et al. [2] proposed a new parameterization called FMKL as Q (u) = 𝜆1 + 1 u𝜆3 − 1 (1 − u)𝜆4 − 1 − , ) 𝜆2 ( 𝜆3 𝜆4 where 0 ≤ u ≤ 1, 𝜆1 , 𝜆2 are the location and scale parameters, respectively. Also 𝜆3 and 𝜆4 determine the shape characteristics and for a symmetric distribution 𝜆3 = 𝜆4 . The five different shapes of the FMKL are: unimodal, U-shaped, J-shaped, S-shaped, and monotone, which may be symmetric and asymmetric with smooth, abrupt, truncated, long, medium or short tails. In many situations, a goodness of fit test about the distribution of the population using observations is necessary. Since the normal distribution is widely used in many statistical procedures and also is the most fundamental distribution, test for the normal hypothesis is indispensable. Moreover, testing normality is one of the most areas of statistical research. For example in statistical modeling the normal assumption of the underlying error distribution must be checked. Therefore, many tests for normality are proposed by authors. A fair of normality tests can be found in the statistical literature. In many situations, a goodness of fit test about the distribution of the population is necessary. Since the normal distribution is the most basic distribution and use widely in statistics, test for the normal hypothesis has been studied by many authors. See for example, D’Agostino and Stephens [3], Huber-Carol et al. [4], Thode [5]. Recently, testing normality has been considered by Alizadeh Noughabi and Arghami [6], Harri and Coble [7], Sanqui et al. [8], Zamanzade and Arghami [9], Marmolejo-Ramos and González-Burgos [10], Joenssen and Vogel [11] and Wang [12]. *Corresponding author. Email: Pdf_Folio:61 “JSTA-17-4-7_print” — 2018/12/21 — 12:53 — page 648 — #2 648 Hadi Alizadeh Noughabi / Journal of Statistical Theory and Applications 17(4) 647–660 In this article, we consider seven popular (like Kolmogorov-Smirnov, Anderson-Darling) and powerfulness (like Shapiro-Wilk, Vasicek) normality tests and compare power values of these tests against the GLD (FMKL) with different parameters. We show that no single test procedure is uniformly more powerful than others. However, the powerful tests can be determined based on type of alternatives. Thus, tests for normality based on type of alternatives are classified. The methodologies of the considered tests are presented in Section 2. Power values of the tests are compared with each other against the FMKL family by Monte Carlo simulation in Section 3. In Section 4, the applicability of the tests in practice is shown by real data. Finally, some conclusions are given in Section 5. 2. TESTS FOR NORMALITY Given a random sample X1 , ..., Xn from a continuous probability distribution F with a density f(x), over the real line and with mean 𝜇 and variance 𝜎2 < ∞, the hypothesis of interest is x−𝜇 2 1 , for some (𝜇, 𝜎) ∈ Θ , x ∈ ℝ H0 ∶ f (x) = f0 (x; 𝜇, 𝜎) = exp {− 12 ( 𝜎 )} √2𝜋𝜎 where 𝜇 and 𝜎 are unspecified and Θ = ℝ × ℝ+ . The alternative to H0 is H1 ∶ f (x) ≠ f0 (x; 𝜇, 𝜎) for any (𝜇, 𝜎) ∈ Θ . In this section, we consider seven popular tests for the above hypothesis. The considered tests are Cramer von Mises [13], KolmogorovSmirnov [14], Anderson-Darling [15], Kuiper [16], Shapiro-Wilk [17], Vasicek [18] and Jarque-Bera [19]. These tests are commonly used in practice and software. For example, the Shapiro-Wilks test is used in SAS software for testing normality. The description of each normality tests is presented in Table 1. From the aforementioned tests, Vasicek’s test, Shapiro-Wilk and Jarque-Bera test are specific in the sense that the null hypothesis is normal, while the rest are suitable for any null family of distributions. For further study about this tests, see D’Agostino and Stephens [3] and references there in. 3. SIMULATION STUDY In this section, Type I error of the tests are obtained and then power values of the tests against flexible FMKL family are computed through a Monte Carlo simulation. Table 1 Tests of normality. Test of normality Test statistic Notations n 1- Cramer von Mises 2- Kolmogorov-Smirnov CH = D+ = max 2 1 2i − 1 + − Zi ; i = 1, ..., n. ( 2n ) 12n ∑ i=1 i i−1 − Zi , D− = max Zi − , i = 1, ..., n {n } { n } D = max(D+ , D− ) V = D+ + D− 3- Kuiper X(i) − X : where Φ is the ( SX ) cdf of standard normal distribution. X(i) − X Zi = Φ : where Φ is the ( SX ) cdf of standard normal distribution. D+ and D− are as above. Zi = Φ [n/2] (∑ i=1 W= 4- Shapiro-Wilk a(n−i+1) (X(n−i+1) − X(i) )2 ) n 2 (X(i) − X) ∑ The coefficients ai are tabulated in Pearson and Hartley [20] i−1 n (2i − 1) {ln(Zi ) + ln(1 − Zn−i+1 )} ∑ 5- Anderson-Darling 6- Vasicek i=1 2 A = −n − KLmn = n exp(H(n, (...truncated)