Item-total correlation

From Wikipedia, the free encyclopedia

The item–total correlation is the correlation between a scored item and the total test score. It is an item statistic used in psychometric analysis to diagnose assessment items that fail to indicate the underlying psychological trait so that they can be removed or revised [1].

The item-total correlation in item analysis[edit]

In item analysis, an item–total correlation is usually calculated for each item of a scale or test to diagnose the degree to which assessment items indicate the underlying trait. Assuming that most of the items of an assessment do indicate the underlying trait, each item should have a reasonably strong positive correlation with the total score on that assessment. An important goal of item analysis is to identify and remove or revise items that are not good indicators of the underlying trait[2].

A small or negative item-correlation provides empirical evidence that the item is not measuring the same construct measured by the assessment. Exact values depend on the type of measure, but as a heuristic, a correlation value less than 0.2 indicates that the corresponding item does not correlate very well with the scale overall and, thus, it may be dropped. A negative value indicates that the item may be damaging the overall psychometric reliability of the measure. [3][4] Identifying and removing (or revising) poorly-performing items is a critical way that psychometric analysis can improve the quality of a measure.

When items are scored dichotomously, as in exams with correct and incorrect answers, the item-total correlation may be calculated as either a point-biserial correlation or a biserial correlation. This is considered important because items vary in difficulty and the point-biserial correlation cannot attain its theoretical maxima [+1,-1] unless the proportion correct is 0.50 (50% answering the item correctly). The biserial correlation has a correction that, in theory, avoids this issue[1]. In practice, analysts should choose either the point-biserial or biserial and not try to compare, because the correction of the biserial will always produce a slightly larger magnitude as compared to the point-biserial[5].

The item-reliability index (IRI) is defined as the product of the point-biserial item-total correlation and the item standard deviation. In classical test theory, the IRI indexes the degree to which an item contributes true score variance to the exam observed score variance. In practice, a negative IRI indicates the relative degree which an item damages the reliability estimate and a positive value indicates the relative degree which it contributes towards a high reliability estimate[5].

See also[edit]

References[edit]

  1. ^ a b Henrysson, Sten (1963-06-01). "Correction of item-total correlations in item analysis". Psychometrika. 28 (2): 211–218. doi:10.1007/BF02289618. ISSN 1860-0980. S2CID 120534016.
  2. ^ Churchill, G.A., (1979). "A paradigm for developing better measures of marketing constructs", Journal of Marketing Research, 16(1) pp 64–73, doi:10.1177/002224377901600110, JSTOR 3150876
  3. ^ Everitt, B.S. (2002) The Cambridge Dictionary of Statistics, 2nd Edition, CUP. ISBN 0-521-81099-X
  4. ^ Field, A., (2005). Discovering Statistics Using SPSS. 2nd ed. London: Sage
  5. ^ a b Allen, M.J., & Yen, W. M. (1979) Introduction to Measurement Theory, Wadsworth. ISBN 0-8185-0283-5