PDF of the full article

Transcription

PDF of the full article
Psychology Science, Volume 45, 2003 (2), p. 217-222
Charting the future of Configural Frequency Analysis:
The development of a statistical method
ALEXANDER VON EYE1, ERWIN LAUTSCH2
1. Configural Frequency Analysis - the past
The first 35 years of the development of Configural Frequency Analysis (CFA; Lienert,
1969) were characterized by a rapid expansion of possibilities. Full of enthusiasm, researchers
developed new designs that allow one to answer increasingly specific questions. The areas of
categorical variable analysis, parametric and non-parametric statistics, significance testing,
modeling, sampling, α-protection, frequentist and Bayesian statistics, and many other
domains were combed with the goal of identifying methods, models, and techniques that could
be adopted for use in CFA. In addition, the advent of CFA triggered the development of new
methods, in particular in the areas of significance testing and α protection. Table 1 presents a
non-exhaustive time table of CFA-related innovations.
Table 1:
CFA-related innovations in the second millennium
Year
1904
1922
1950
1968
1969
1971
1971
1973
1973
1975
1988
1989
1994
1995
1998
2000
2000
1
2
Event
first discussion of contingence tables (Pearson)
combination of symptoms beyond expectation (Pfaundler & von Sehr)
first discussion of the concept of configurations (Meehl)
CFA proposed (Lienert)
first discussion of log-linear models (Bishop & Fienberg)
X2-test for CFA proposed (Lienert)
first CFA model for two groups of variables (2-sample CFA; Lienert)
Binomial test proposed for use in CFA (Krauth & Lienert)
α-adjustment proposed (Krauth)
first dedicated CFA software (Roeder)
hierarchical log-linear models proposed as base models for CFA (von Eye)
log-linear quasi-independence models proposed as a new approach to CFA (Victor)
Bayesian CFA proposed (Wood, Sher, & von Eye)
alternative concepts of deviation from independence discussed (von Eye, Spiel, & Rovine)
discussion of the relationship between sampling schemes and the selection of CFA base
models (von Eye & Schuster)
use of β-error for evaluation of test performance in CFA (von Weber)
Covariates introduced in CFA (Glück & von Eye)
Prof. Dr. Alexander von Eye, Michigan State University, Department of Psychology, 119 Snyder Hall, East
Lansing, MI 48824-1117; E-mail: [email protected]
Prof. Dr. Dr. Erwin Lautsch, Universität Kassel, FB 5: Gesellschaftswissenschaften, Nora-Platiel-Str. 1,
D-34127 Kassel; E-mail: [email protected]
218
A. von Eye, E. Lautsch
These efforts paid off greatly. CFA now belongs to the arsenal of generally accepted
methods of analysis. The method finds applications in all areas of the empirical sciences.
Empirical articles in which data are analyzed using CFA appear in the best journals. Textbooks
on CFA have been published by reputed publishers, computer programs have been published,
and CFA as a method is covered by entries in recent and upcoming encyclopedias. In other
words, CFA as a method for the exploration of cross-classifications is known to be a useful
method that is employed widely (for a brief history of CFA see von Eye & Lautsch, 2000).
2. Configural Frequency Analysis - the future
At least as important as the recognition and the use of a statistical method is its continuous
development. In the history of most methods of statistics, the presentation of a new method is
followed by a period of euphoria. During this period, the basics of the method are established,
and researchers explore fields of application. The possibilities provided by the new method
are charted. Soon, limits become apparent and misuses become known. Researchers learn that
there are optimal data characteristics for the application of a method, but that there are also
conditions under which an application is less promising. For example, data bodies may be too
small or too large, distributional characteristics may not meet requirements, or the questions
asked by researchers cannot be answered using a particular method.
In the case of CFA, the bases have been established, as can be seen from the brief time line
in Table 1. The method finds widespread application. In addition, methodologists are now in a
phase in which the characteristics of elements of CFA are examined under various conditions.
Six fields of research on the method of CFA can currently be distinguished:
1. Simulation studies that center on the behavior of statistical tests that are used to make type/
antitype decisions (von Eye, 2002; in press; von Weber, 2000); more studies are under way
(see below).
2. Studies concerning the dependency structure of tests performed in CFA. First studies
exist (Victor, 1989), in which the authors propose that there be at least 3 or 4 degrees of
freedom for each type/antitype in a cross-classification. More studies on this topic are
being undertaken (see below).
3. Studies concerning the size of tables that can be meaningfully explored using CFA.
Stimulated by a paper by duMouchel (1999), studies are being undertaken with the goal
to determine the maximum and the minimum size of tables for which CFA is a suitable
method of analysis (see below).
4. The statistical bases of CFA are being expanded. The original approach to CFA is based
on methods for the estimation of expected cell frequencies that reside in what is known
as χ2-analysis. These methods have been put in the context of hierarchical log-linear
modeling (von Eye, 1988), and in the context of the more general log-linear models of
quasi-independence (Victor, 1989; Kieser & Victor, 1999). In addition, CFA has been
reformulated as a method of Bayesian statistics (Wood, Sher, & von Eye, 1994; GutiérrezPeña, & von Eye, 2000). The earlier approaches used noninformative priors. We are
waiting for these researchers to present Bayesian CFA methods that employ different
concepts of priors.
5. First attempts exists at formulating a new version of Interaction Structure Analysis (ISA;
Lienert & Krauth, 1973) that is based on the General Linear Model instead of the General
The future of CFA
219
Log-linear Model (Bortz, 2002). These attempts are underway, and we look forward to
seeing first written reports.
6. Existing computer programs are continuously being improved. Current foci include
improved procedures of α-protection, the incorporation of estimates of β-errors, and the
automatized determination of continuity corrections (see below). First attempts have also
been made to base the estimation of expected cell frequencies on multivariate distributional
assumptions (von Eye & Gardiner, in preparation).
These six topics of further development of CFA indicate that this method not only found
broad fields of application, but it possesses great potential for further development and for
users such that an even wider range of questions can be answered, and tailored solutions are
provided for even more problems.
3. The topics of the current issue
The current Special Issue reflects the trends described for the development and application
of CFA. The contributions present (1) interesting and innovative applications of CFA, (2)
new developments of the method of CFA, and (3) discuss CFA in comparison with existing
other methods of data analysis. The contributions are grouped in two sections. The first is
applications of CFA. This section contains seven articles in which existing methods are
employed. The second section proposes developments of the method of CFA. This domain
contains eleven articles that reflect the lines of development highlighted in Section 2.
3.1 Applications of CFA
The first article in this section presents a re-analysis of data that Janke analyzed using
multiple regression methods in the years 1963 - 1966. These were the years immediately
before the first version of CFA was proposed by Lienert (1968). The authors, Janke and Ising,
show CFA-specific results and compare CFA with regression analysis.
The second article, contributed by Ising, employs CFA as a method for the detection of
genetic associations for complex diseases. This article illustrates the usefulness of CFA as an
exploratory method in case-control studies and in family-based association studies.
The third article is authored by Wagner-Menghin. This contribution centers around the
possibility of using CFA for the identification of achievement motivation types from data
collected with the Work Style test battery, a short, computer-assisted test battery.
Bäumler and Stemmler study an interesting socio-genetic hypothesis in the fourth article.
The authors ask whether mate selection in Germany 200 years ago can be retraced from
physical characteristics of athletes in the 20st century. CFA methods are used to confirm this
hypothesis.
In the fifth article, Lautsch and Thöle use data from the Shell Youth Study, 2000, to
classify and explain life concepts in adolescents. CFA is used for both goals of analysis.
On the interface of application and development of a method is the comparison of
statistical methods using empirical data. Two articles are included that address this topic. In
the first of these two, Reuter, Hüppe, Netter, and Hennig compare the methods of CFA and
of Structural Equations Modeling in the sixth article of this section. The authors conclude
220
A. von Eye, E. Lautsch
that both methods, while providing congruent findings, yield non-redundant results. In the
second, Lautsch and Plichta compare CFA, correspondence analysis, and latent class analysis.
The authors conclude that these three methods complement each other in the analysis of the
structure of types.
3.2 New Developments of CFA
This section presents new and classical methodological and conceptual developments
of CFA. In the first contribution, Krauth asks whether dichotomization, a popular method
of categorizing continuous scales, is a suitable procedure that can lead to appropriate CFA
applications. Artifacts are pointed out and illustrated.
A topic that is central to the interpretability of CFA results is the dependency of CFA tests.
Krauth shows in the second article, using the base model of first order CFA, that tests in small
tables are dependent. Bounds for the percentage of possible type structures are provided.
Related to this topic is the third article which was contributed by von Weber, Lautsch, and
von Eye. The authors present conceptual and simulation results on the question of whether the
application of the first order or the zero order CFA base models is meaningful in 2 x 2 tables.
Another two simulation studies follow. The first of these articles, also authored by von
Weber, Lautsch, and von Eye, focuses on the performance of CFA tests in tables of varying
sizes. In addition, this study presents a new method for the determination of continuity
corrections that help researchers keep the α-level constant, and the study shows the magnitude
of the β-errors one faces when performing CFA. The last simulation study in this group,
presented by von Eye in the fifth article in this section, focuses on the performance of tests
used for the 2 x 2 tables of interest in 2-sample CFA. This work focuses on relative power and
on the distributional characteristics of the test statistics.
In the sixth article of this section, Lautsch and von Weber propose a new procedure for use
in CFA. This procedure uses Victor’s and Bayesian concepts of CFA. Numerical simulations
show that the procedure performs well in comparison with established procedures.
Critical notes about the coefficient of determination as applied in CFA are presented in the
seventh article, by Betzin and Bollmann-Sdorra.
Stemmler and Bingham take up the topic of how to analyze improvement scores in prepost designs. The authors propose CFA methods for analysis in the eighth article of this
section, specifically, CFA methods of group comparisons.
New methods for the analysis of change using CFA are proposed by Stemmler and von
Eye in article nine. The authors propose using marginal homogeneity models and compare the
new approach with methods of Directed CFA and Prediction CFA.
In article ten, Lautsch, von Eye, and von Weber present a comparison of currently actively
developed software programs for CFA.
This section concludes with three articles from the fundus of unpublished CFA papers. It is
well known that a large number of articles on CFA exists in draft form, but was never pursued
until publication. Three of these articles are presented here, authored by Krauth. These
articles provide the mathematical foundation of CFA. The first of the three articles deals with
Lancaster’s χ2 decomposition model as the basis for Lienert’s Association Structure Analysis.
The second article discusses the bases of methods for α protection. The third paper provides
an inferential basis for two- and multisample CFA. These three articles are of dual importance.
First, they show the mathematical bases of a method that has been discussed largely from an
The future of CFA
221
applied perspective. Second, these articles are of historical value. They show that from the
beginning of the development of CFA, the mathematical foundation of CFA as a statistical
method was discussed. Current efforts to describe the characteristics of the methods of CFA,
exemplified, for instance by Krauth’s paper on type structures or by the simulation studies in
this Special Issue, can be viewed as a continuation of the attempts to develop CFA as a method
of defensible mathematical and statistical characteristics.
Thus, this Special Issue reflects the two streams of work that characterize current work in
the domain of CFA. On the one hand, there is a large field of application. On the other hand,
there is continuous development of CFA as a method.
References
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
Bishop, Y.M.M., & Fienberg, S.E. (1969. Incomplete two-dimensional contingency tables.
Biometrics, 25, 119 - 128.
Bortz, J. (2002). Interaktionsstrukturanalyse (ISA) bei nicht orthogonalen Kontingenztafeln.
Vortrag auf dem G.A. Lienert Gedächtnissymposium: die Konfigurationsfrequenzanalyse in
Theorie und Anwendung. Wien: Universität.
DuMouchel, W. (1999). Bayesian data mining in large frequency tables, with an application
to the FDA spontaneous reporting system. The American Statistician, 53, 177 - 190.
Glück, J., & von Eye, A. (2000). Including covariates in Configural Frequency Analysis.
Psychologische Beiträge, 42, 405 - 417.
Gutiérrez-Peña, E., & von Eye, A. (2000). A Bayesian approach to Configural Frequency
Analysis. Journal of Mathematical Sociology, 24, 151 - 174.
Kieser, M., & Victor, N. (1999). Configural Frequency Analysis (CFA) revisited - A new look
at an old approach. Biometrical Journal, 41, 967 - 983.
Krauth, J. (1973). Inferenzstatistischer Nachweis von Typen und Syndromen. In J. Krauth, &
G.A. Lienert. KFA. Die Konfigurationsfrequenzanalyse und ihre Anwendung in Psychologie
und Medizin (pp. 39 - 51). Freiburg: Alber.
Krauth, J., & Lienert, G.A. (1973). Nichtparametrischer Nachweis von Syndromen durch
simultane Binomialtests. Biometrische Zeitschrift, 15, 13 - 20.
Lienert, G.A. (1969). Die Konfigurationsfrequenzanalyse” als Klassifikationsmethode in der
klinischen Psychologie. In M. Irle (Ed.), Bericht über den 16. Kongress der Deutschen Gesellschaft für Psychologie in Tübingen 1968 (pp. 244 - 255). Göttingen: Hogrefe.
Lienert, G.A. (1971). Die Konfigurationsfrequenzanalyse I. Ein neuer Weg zu Typen und
Syndromen. Zeitschrift für Klinische Psychologie und Psychotherapie, 19, 99 - 115.
Lienert, G.A., & Krauth, J. (1973). Die Konfigurationsfrequenzanalyse V. Kontingenz- und
Interaktionsstrukturanalyse multinär skalierter Merkmale. Zeitschrift für Klinische Psychologie und Psychotherapie, 21, 26 - 39.
Meehl, P.E. (1950). Configural scoring. Journal of Consulting Psychology, 14, 165 - 171.
Pearson, K. (1904). On the theory of contingency and its relation to association and normal
correlation. Draper’s Company Research Memoris, Biometric Series I.
Roeder, B. (1975). KFA-Programm. Dortmund, Pädagogische Hochschule. Unpublished
software.
Victor, N. (1989). An alternative approach to configural frequency analysis. Methodika,
3, 61 - 73.
von Eye, A. (1988). The General Linear Model as a framework for models in Configural
Frequency Analysis. Biometrical Journal, 30, 59-67.
222
A. von Eye, E. Lautsch
17. von Eye, A. (2002). The odds favor antitypes - A comparison of tests for the identification of
configural types and antitypes. Methods of Psychological Research - online, 7, 1-29.
18. von Eye, A. (in press). A comparison of tests used in 2 x 2 tables and in two-sample CFA.
Psychologische Beiträge.
19. von Eye, A., & Gardiner, J.C. (in preparation). Locating deviations from multivariate normality.
20. von Eye, A., & Lautsch, E. (2000). A brief history of Configural Frequency Analysis. Psychologische Beiträge, 42, 241 - 249.
21. von Eye, A., & Schuster, C. (1998). On the specification of models for Configural Frequency
Analysis - Sampling schemes in Prediction CFA. Methods of Psychological Research online, 3, 55 - 73.
22. von Eye, A., Spiel, C., & Rovine, M. J. (1995). Concepts of nonindependence in Configural
Frequency Analysis. Journal of Mathematical Sociology, 20, 41 - 54.
23. von Weber, S. (2000). Ein Vergleich der in der KFA verwendeten Tests mittels Simulationsrechnungen. Psychologische Beiträge, 42, 260 - 272.
24. Wood, P. K., Sher, K., & von Eye, A. (1994). Conjugate methods in Configural Frequency
Analysis. Biometrical Journal, 36, 387 - 410.
Acknowledgements. About half of the articles that are included in this Special Issue are
based on the presentations that the authors made at the conference that Lautsch, Lantermann,
and von Eye had organized to commemorate the first anniversary of G.A. Lienerts death, in
Kassel, May 2002. The other articles are contributions written for this Special Issue. The editors
of this Special Issue are indebted to the authors. Their efforts result in this most attractive
Special Issue which demonstrates clearly that research with and on the method of CFA is most
active and most promising. The editors are also indebted to the G.A. Lienert Foundation for
financial support of the conference in Kassel. Finally, we would like to thank the publisher and
the editor of this journal, W. Pabst and K. Kubinger, respectively, for providing us with the
opportunity to present this exciting issue to the readership of the journal.
Alexander von Eye (East Lansing) and Erwin Lautsch (Kassel)