Parametric versus Nonparametric Statistics – When to use them and which is more

Transcription

Parametric versus Nonparametric Statistics – When to use them and which is more
Parametric versus
Nonparametric
Statistics – When to use
them and which is more
powerful?
Angela Hebel
Department of Natural Sciences
University of Maryland Eastern Shore
April 5, 2002
Parametric Assumptions
The observations must be independent
 The observations must be drawn from
normally distributed populations
 These populations must have the same
variances
 The means of these normal and
homoscedastic populations must be linear
combinations of effects due to columns
and/or rows*

Nonparametric Assumptions
Observations are independent
 Variable under study has underlying
continuity

Measurement

What are the 4 levels of measurement
discussed in Siegel’s chapter?
1. Nominal or Classificatory Scale
 Gender, ethnic background
2. Ordinal or Ranking Scale
 Hardness of rocks, beauty, military ranks
3. Interval Scale
 Celsius or Fahrenheit
4. Ratio Scale
 Kelvin temperature, speed, height, mass or weight
Nonparametric Methods


There is at least one nonparametric test
equivalent to a parametric test
These tests fall into several categories
1. Tests of differences between groups
(independent samples)
2. Tests of differences between variables
(dependent samples)
3. Tests of relationships between variables
Differences between independent
groups

Two samples –
compare mean value
for some variable of
interest
Parametric
Nonparametric
t-test for
independent
samples
Wald-Wolfowitz
runs test
Mann-Whitney
U test
KolmogorovSmirnov two
sample test
Mann-Whitney U Test
Nonparametric alternative to two-sample
t-test
 Actual measurements not used – ranks of
the measurements used
 Data can be ranked from highest to lowest
or lowest to highest values
 Calculate Mann-Whitney U statistic

U = n1n2 + n1(n1+1) – R1
2
Example of Mann-Whitney U test
Two tailed null hypothesis that there is no
difference between the heights of male
and female students
 Ho: Male and female students are the
same height
 HA: Male and female students are not the
same height

U = n1n2 + n1(n1+1) – R1
2
U=(7)(5) + (7)(8) – 30
2
U = 35 + 28 – 30
U = 33
U’ = n1n2 – U
Heights
of
males
(cm)
Heights
of
females
(cm)
Ranks of
male
heights
Ranks
of
female
heights
193
175
1
7
188
173
2
8
185
168
3
10
183
165
4
11
180
163
5
12
178
6
170
9
n1 = 7
n2 = 5
R1 = 30
U’ = (7)(5) – 33
U’ = 2
U 0.05(2),7,5 = U 0.05(2),5,7 = 30
As 33 > 30, Ho is rejected
Zar, 1996
R2 = 48
Differences between independent
groups

Multiple groups
Parametric
Nonparametric
Analysis of Kruskal-Wallis
variance
analysis of
(ANOVA/
ranks
MANOVA)
Median test
Differences between dependent
groups


Compare two variables
measured in the same
sample
Parametric
t-test for
dependent
samples
If more than two
variables are measured in Repeated
same sample
measures
ANOVA
Nonparametric
Sign test
Wilcoxon’s
matched pairs
test
Friedman’s two
way analysis of
variance
Cochran Q
Relationships between variables
Parametric
Nonparametric
Correlation
coefficient
Spearman R
Kendall Tau
Coefficient Gamma

Two variables of
interest are
categorical
Chi square
Phi coefficient
Fisher exact test
Kendall coefficient of
concordance
Summary Table of Statistical Tests
Level of
Measurement
Sample Characteristics
1
Sample
Categorical
or Nominal
Χ2 or
binomial
Rank or
Ordinal
Parametric
(Interval &
Ratio)
z test
or t test
2 Sample
Correlation
K Sample (i.e., >2)
Independent
Dependent
Independent
Dependent
Χ2
Macnarmar’
s Χ2
Χ2
Cochran’s Q
Mann
Whitney U
Wilcoxin
Matched
Pairs Signed
Ranks
Kruskal Wallis
H
Friendman’s
ANOVA
Spearman’s
rho
t test
between
groups
t test within
groups
1 way ANOVA
between
groups
1 way
ANOVA
(within or
repeated
measure)
Pearson’s r
Factorial (2 way) ANOVA
(Plonskey, 2001)
Advantages of Nonparametric Tests
Probability statements obtained from most
nonparametric statistics are exact
probabilities, regardless of the shape of
the population distribution from which the
random sample was drawn
 If sample sizes as small as N=6 are used,
there is no alternative to using a
nonparametric test

Siegel, 1956
Advantages of Nonparametric Tests
Treat samples made up of observations from
several different populations.
 Can treat data which are inherently in ranks as
well as data whose seemingly numerical scores
have the strength in ranks
 They are available to treat data which are
classificatory
 Easier to learn and apply than parametric tests

Siegel, 1956
Criticisms of Nonparametric
Procedures
Losing precision/wasteful of data
 Low power
 False sense of security
 Lack of software
 Testing distributions only
 Higher-ordered interactions not dealt with

Power of a Test

Statistical power – probability of rejecting
the null hypothesis when it is in fact false
and should be rejected
– Power of parametric tests – calculated from
formula, tables, and graphs based on their
underlying distribution
– Power of nonparametric tests – less
straightforward; calculated using Monte Carlo
simulation methods (Mumby, 2002)
Questions?