Talk:Anderson–Darling test

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Untitled[edit]

"Very large sample sizes may reject the assumption of normality with only slight imperfections. But, industrial data with sample sizes of 200 and more, have easily passed the Anderson-Darling test."

This citation comes from text I wrote for the MVPstats help files. MVPstats is a statistical analysis software program. Although this software has evolved, it originally began in 1986 as a simple program to provide computation of the Anderson-Darling test statistic. The citation may be found here: http://mvpprograms.com/help/mvpstats/distributions/NormalityTestingGuidelines

This software was used quite frequently in work we were doing in industrial situations, that is why the reference is to "industrial data." I have personally tested thousands of distributions over the years, and yes the statement is accurate. Anderson-Darling is one of the more powerful test for normality. The question one is generally trying to answer in using this or any other test for normality is whether or not the data come from a distribution that can be adequately modeled with a normal distribution. As the citation suggests, with large sample sizes, there may exist slight deviations from the normal/Gaussian distribution, although the model may be adequate. And yes, I also have seen much larger samples sizes easily pass the Anderson-Darling test.

Mvpetrovich 17:09, 26 March 2007 (UTC)[reply]


According to the Stephens (1974) article cited in the reference section, the actual sample size correction is A^2* = A^2 * (1 + 4/n - 25/n^2) and the 5% statistic for normality is 0.787.

Who is right?

141.211.198.75 (talk) 20:40, 18 February 2008 (UTC) Kayhan Gultekin kayhan aht gmail daht com[reply]

I have adjusted values to agree with one of the later references, which indicates they are taken from later revisions by Stephens. Melcombe (talk) 13:08, 13 February 2009 (UTC)[reply]

It would probably be a good idea to actually reference the book being cited by Shapiro: how to test for normality and other distributional assumptions. —Preceding unsigned comment added by 24.136.9.55 (talk) 03:20, 2 August 2008 (UTC)[reply]

Critical values table[edit]

The article is missing the table of critical values. Unfortunately, I don't have access to Stephens (1974). Perhaps someone with access could supply the numbers (assuming it's not a large table)? Many thanks. pgr94 (talk) 14:38, 6 November 2008 (UTC)[reply]

The critical value for case 0 and confidence level 1% does not coincide with that one in the original Publication "A Test of Goodness of Fit" by Anderson and Darling. Has there been any improvemnet through the years or is it a typo? An Explanation would be nice. — Preceding unsigned comment added by 62.128.6.5 (talk) 15:31, 13 November 2020 (UTC)[reply]

Statistic and the integral[edit]

The statistic in the sum form and the integral are exactly equal, which can be showed in three pages by computing simple integrals. I see my change for making this explicit was reverted; it should be added back. I'm not sure though on how to go about convincing people other than suggesting to do the exercise themselves:) --Kaba3 (talk) 23:10, 23 February 2011 (UTC)[reply]

What you really need to do is to provide a citation for the result in the article itself ... it must have been published somewhere. Melcombe (talk) 17:01, 24 February 2011 (UTC)[reply]

@Kaba3 or (Pgr94 ?) You are right - the computing formula for the statistic (wish we had equation numbers) is given in D'Agostino and Stephens on page 101 so there is a citation to show it equals the integral form. I still think it is nicer to introduce the notion of distance first rather than rework the sentence to say "the statistic is given by ..." It helps to know what a statistic measures. I agree the connection should be made explicit. Isn't it possible to make that connection explicit at the computing formula?

(Also found the statistic formula (labeled W-squared) in Anderson-Darling (1954) page 765 equation (2).) Mathstat (talk) 05:57, 26 February 2011 (UTC)[reply]

Can someone give a worked example how to do this?[edit]

This page has way too many boring formulas only a math guy would read. I really want some numbers plugged in so I can see what's happening.

Here's what I did with python:

import numpy
import scipy.stats
x=numpy.random.randn(10000)
scipy.stats.anderson(x)

this gives the following:

(0.43368580228707287, array([ 0.576,  0.656,  0.787,  0.918,  1.092]), array([ 15. ,  10. ,   5. ,   2.5,   1. ]))

So I compare the 0.43 (A^2 test statistic) to [ 0.576, 0.656, 0.787, 0.918, 1.092], see that it is lower than all of them, so the hypothesis of normality is not rejected at the 15% significance level. Is this right?

I repeated the above with x=numpy.random.rand(10000) (not a normal distribution) and get the following:

(114.45988296709584, array([ 0.576,  0.656,  0.787,  0.918,  1.092]), array([ 15. ,  10. ,   5. ,   2.5,   1. ]))

So I compare the 114 number to [ 0.576, 0.656, 0.787, 0.918, 1.092], see that it is higher than all of them, and reject the hypothesis of normality at the 1% significance level. Is this right?

Thanks! 128.196.56.50 (talk) 22:10, 9 February 2014 (UTC)[reply]


In the formular, is it dx or dF(x)? What is n in Eq. 1 ? What distribution should this statistic satisfy ?[edit]

Thanks for the clarification!

How does this mention the single-sample case and the k-sample case but not the two-sample case?[edit]

It seems odd to omit it. Indeed the structure of the page looks as if it's been set up specifically to talk about it but then doesn't.

See A. N. Pettitt's "A two-sample Anderson-Darling rank statistic", Biometrika (1976) 63 (1): 161-168.

Glenbarnett (talk) 14:01, 5 September 2016 (UTC)[reply]

External links modified[edit]

Hello fellow Wikipedians,

I have just modified one external link on Anderson–Darling test. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:

When you have finished reviewing my changes, please set the checked parameter below to true or failed to let others know (documentation at {{Sourcecheck}}).

This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}} (last update: 18 January 2022).

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.

Cheers.—InternetArchiveBot (Report bug) 19:33, 12 October 2016 (UTC)[reply]