numerical method. Psychometrika, 29, 115-129.
Kubany, E., Haynes, S. N., Abueg, F. R., Marke, F. P., Brennan,
J. e Stahura, C. (1996). Development and validation of
the Trauma-Related Guilt Inventory. Psychological Assessment, 8, 428-444.
Kuder, G. F. (1934). Kuder Preference Record-Vocational. Chicago: Science Research Association.
Kuder, G. F. (1939) Manual to the Kuder Preference Record. Chicago: Science Research Associates.
Kuder, G. F. e Richardson, M. W. (1937). The theory of estimation of test reliability. Psychometrika. 2, 151-160.
Kunce, J. T., Cook, W. D. e Miller, D. E. (1975). Random variables and correlational overkill. Educational and Psychological Measurement, 35, 529-534.
Lai, C. T. (1970). A Scholar in Imperial China. Hong Kong:
Kelly e Walsh.
Lance, C. E., Butts, M. M. e Michels, L. C. (2006). The sources
of four commonly reported cutoff criteria: What did they
really say? Organizational Research Methods, 9, 202-220.
Lancia, F. (2004). Strumenti per l’analisi dei testi. Introduzione
all’uso di T-LAB. Milano: Franco Angeli.
Landis, C. e Katz, S. (1934). The validity of certain questions
which purport to measure neurotic tendencies. Journal of
Applied Psychology, 18, 343-356
Landis, J. R. e Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33, 159-174.
Lang, P. J. (1980). Behavioral treatment and bio-behavioral
assessment: computer applications. In J. B. Sidowski, J. H.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Johnson e T. A. Williams (Eds.), Technology in Mental Health
Care delivery Systems (pp. 119-l 37). Norwood, NJ: Ablex.
Lang, P. J., Öhman, A. e Vaitl, D. (1988). The International Affective Picture System [photographic slides]. Gainesville, FL:
The Center for Research in Psychophysiology, University
of Florida.
Lauriola, M. (2007). L’analisi non parametrica dei dati. In A.
P. Ercolani (Ed.), Strumenti statistici per la ricerca, la valutazione e la diagnosi in psicologia (pp. 71-147). Milano: Raffaello Cortina.
Lawley, D. (1943). On problems connected with item selection and test construction. Proceedings of the Royal Society of
Edinburgh 61A, 273-287.
Lawley, D. N. (1940). The estimation of factor loadings by the
method of maximum likelihood. Proceedings of the Royal Society of Edinburgh, 60, 64-82.
Lawley, D. N. e Maxwell, A. E. (1971). Factor Analysis as a Statistical Method. London: Butterworth and Co.
Lawshe, C. H. (1975). The quantitative approach to content
validity. Personnel Psychology, 28,. 563-575.
Ledermann, W. (1937). On the rank of reduced correlation
matrices in multiple factor analysis. Psychometrika, 2, 85-93.
Lennon, R. T. (1956). Assumptions underlying the use of
content validity. Educational and Psychological Measurement,
16, 294-304.
Levy, P. (1968). Short-form tests: A methodological review.
Psychological Bulletin, 69, 410-416.
Li, H. e Stout, W. (1996). A new procedure for detection of
crossing DIF. Psychometrika, 61, 647-677.
Liebowitz, M. R. (1987). Social phobia. Modern Problems of Pharmacopsychiatry, 22, 141-173.
Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology, 140, 1-55.
Linacre, J. M. (1996). DIF in polytomous items. Rasch Measurement Transactions 10:3, 5-20.
Linacre, J. M. (1998). Structure in Rasch residuals: why principal components analysis? Rasch Measurement Transactions, 12, 6-36.
Linacre, J. M. (2000). Computer -Adaptive Testing: A methodology whose time has come. MESA Memorandum n.
69. Chicago: MESA Psychometric Laboratory, University
of Chicago.
Linacre, J. M. (2002). What do Infit and Outfit, Mean-square
and Standardized mean? Rasch Measurement Transactions,
16(2), 8-78.
Linacre, J. M. (2003). A User’s Guide to WINSTEPS: Rasch-Model
Computer Programs. Chicago: Mesa Press.
Linacre, J. M. (2004). Discrimination, guessing and carelessness: Estimating IRT parameters with Rasch. Rasch Measurement Transactions, 18, 959-960.
Linacre, J. M. e Wright, B. D. (1986). Item bias: Mantel-Haenszel
and the Rasch Model (MESA Psycho:netric Laboratory, Memorandum No. 39). Chicago: University of Chicago, Department of Education.
Lindeman, R.H., Merenda, P.F. e Gold, R.Z. (1980). Introduction to Bivariate and Multidimensional Analysis. Glenview, IL:
Scott, Foresman.
Linton, M. (1982). Transformations of memory in everyday
life. In U. Neisser (Ed.), Memory Observed: Remembering in
Natural Contexts (pp. 77-91). San Francisco: Freeman.
Little, R. J. A. (1988). A test of missing completely at random
for multivariate data with missing values. Journal of the
American Statistical Association, 83, 1198-1202.
Little, T. D., Lindenberger, U. e Nesselroade, J. R. (1999). On
selecting indicators for multivariate measurement and
modeling with latent variables: When "good" indicators
are bad and "bad" indicators are good. Psychological Methods, 4(2), 192-211.
Loevinger, J. (1957) Objective tests as instruments of psychological theory. Psychological Reports, 3, 635-694.
Loevinger, J. (1994). Has psychology lost its conscience? Journal of Personality Assessment, 62, 2-8.
Loftus, E. F. e Marburger,W. (1983). Since the eruption of Mt.
St. Helens, has anyone beaten you up? Memory and Cognition, 11, 114-120.
Loken, E. e Rulison, K. L. (2010). Estimating a four-parameter
item response theory model. British Journal of Mathematical
and Statistical Psychology, 63, 509-525.
Lonner, W. J. (1990). The introductory psychology text and
cross-cultural psychology: Beyond Ekman, Whorf, and
biased I.Q. tests. In D. Keats, D. Monro e L. Mann (Eds.),
Heterogeneity in Cross-Cultural Psychology: Selected Papers from
the Ninth International Conference of the International Association for Cross-Cultural Psychology (pp. 422). Lisse, the Netherlands: Swets and Zeitlinger.
Lord, F. (1952). A Theory of Test Scores (Psychometric Monograph
No. 7). Richmond, VA: Psychometric Corporation.
Lord, F. M. (1959). Tests of the same length do have the same
standard error of measurement. Educational and Psychological Measurement, 19, 233-239.
Lord, F. M. (1968). An analysis of the verbal scholastic aptitude
test using Birnbaum’s three– parameter logistic model.
Educational and Psychological Measurements, 28, 989-1020.
Lord, F. M. (1977). Practical applications of item characteristic
curve theory. Journal of Educational Measurement, 14, 117-138.
Lord, F. M. (1980). Applications of item response theory to
practical testing problems. Hillsdale, NJ: Erlbaum.
Lord, F. M. e Novick, M. R. (1968). Statistical Theories of
Mental Test Scores. Reading: Addison-Wesley.
Lord, F. M. e Wingersky, M. S. (1983). Comparison ofIRT
true-score and equipercentile observed-score "equatings."
Applied Psychological Measurement, 8, 453-461.
Lorenzo-Seva, U. e(1999). Promin: a method for oblique factor rotation. Multivariate Behavioral Research, 34, 347-356.
Lorenzo-Seva, U. (2003). A factor simplicity index. Psychometrika, 68, 49-60.
Lorenzo-Seva, U. e Ferrando, P. J. (2006). FACTOR: A computer program to fit the exploratory factor analysis model.
Behavioral Research Methods, 38, 88-91.
Lorenzo-Seva, U. e ten Berge, J. M. F. (2006). Tucker's congruence coefficient as a meaningful index of factor similarity. Methodology, 2(2), 57-64.
Lorenzo-Seva, U., Timmerman, M. E. e Kiers, H.A.L. (2011).
The Hull method for selecting the number of common factors. Multivariate Behavioral Research, 46(2), 340-364.
Lorge, I. (1937). Gen-like: Halo or reality? Psychological Bulletin, 34, 545-546.
Lucarelli, A. (1993). Psicologia dello sviluppo: le origini. Firenze: Giunti.
Luce, R. D. e Krumhansl, C. (1988) Measurement, scaling,
and psychophysics. In R. C. Atkinson, R. J. Herrnstein, G.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Lindzey e R. D. Luce (Eds.) Stevens’ Handbook of Experimental Psychology (pp. 1-74). New York: Wiley.
Luce, R. D. e Tukey, J. (1964). Simultaneous conjoint measurement: A new type of fundamental measurement. Journal of Mathematical Psychology, 1, 1-27.
Lüdtke, O., Trautwein, U., Nagy, G. e Köller, O. (2004). Eine
Validierungsstudie zum NEO-FFI in einer Stichprobe junger Erwachsener: Effekte des Itemformats, faktorielle Validität und Zusammenhänge mit Schulleistungsindikatoren, Diagnostica, 50, 134-144.
Lyman, H. B. (1971). Test Scores and What They Mean, Second
Edition. Englewood Cliffs, N.J.: Prentice-Hall, Inc.
Lynn, M. R. (1986). Determination and quantification of content validity. Nursing Research, 35,. 382-385.
MacCallum, R. C., Roznowski, M. e Necowitz, L. B. (1992).
Model modification in covariance structure analysis: The
problem of capitalization on chance, Psychological Bulletin,
111, 490-504
MacCallum, R. C., Widaman, K. F., Preacher, K. J. e Hong, S.
(1999). Sample size in factor analysis: The role of model
error. Multivariate Behavioral Research, 36, 611-637.
MacCallum, R. C., Widaman, K. F., Zhang, S. e Hong, S.
(1999). Sample size in factor analysis. Psychological Methods, 4, 84-99.
MacCorquodale, K. e Meehl, P. E. (1948). On a distinction
between hypothetical constructs and intervening variables. Psychological Review, 55, 95-107
Machiavelli, N. (1513/1967). Il Principe (a cura di Luigi Russo,
XIII ed., Sansoni editore, Firenze 1967).
Mack, N., Woodsong, C., MacQueen, K., Guest, G. e Namey,
E. (2005). Qualitative Research Methods: A Data Collector's
Field Guide. Research Triangle Park, NC: Family Health
Mackintosh, N. J. (1995). Cyril Burt: Fraud or Framed?. Oxford:
Oxford University Press.
Magis, D., Raiche, G., Beland, S. & Gerard, P. (2010). A logistic
regression procedure to detect differential item functioning among
multiple groups. Unpublished manuscript.
Mahalanobis, P. C. (1936). On the generalised distance in statistics. Proceedings of the National Institute of Sciences of India
2 (1), 49-55.
Maher, B. A. e Gottesman, I. I. (2005) Deconstructing, reconstructing, preserving Paul E. Meehl’s legacy of construct
validity. Psychological Assessment, 17, 415-422..
Marascuilo, L. A. e Levin, J. R. (1983). Multivariate Statistics
in the Social Sciences. Monterey, CA: Brooks/Cole.
Marcoulides, G. A. e Hershberger, S. L. (1997). Multivariate
Statistical Methods. A First Course. Mahawa, NJ: Lawrence
Erlbaum Associates.
Mari, L. (2000). Beyond the representational viewpoint: a new
formalization of measurement. Measurement, 27, 71-84.
Marsh, H. W. (1989). Confirmatory factor analyses of multitrait multi-method data: Many problems and a few solutions. Applied Psychological Measurement, 13, 335-361.
Marsh, H. W. (1993). Multitrait-multimethod analyses: Inferring each trait-method combination with multiple indicators. Applied Measurement in Education, 6(1), 49-81.
Marsh, H. W. e(2007). Application of confirmatory factor
analysis and structural equation modeling in sport/exercise psychology. In G. Tenenbaum & R. C. Eklund (Eds.),
Handbook of Sport Psychology (3rd ed., pp. 774-798). Hoboken, NJ: Wiley.
Marsh, H. W. e Hocevar, D. (1988). A new, more powerful
approach to multitrait-multimethod analyses: Application
of second-order confirmatory factor analysis. Journal of
Applied Psychology, 73, 107-117.
Marsh, H. W., Hau, K.-T. e Wen, Z. (2004). In search of golden
rules: Comment on hypothesis testing approaches to setting cutoff values for fit indexes and dangers in overgeneralising Hu e Bentler’s (1999) findings. Structural Equation Modeling, 11, 320-341.
Marsh, H. W., Lüdtke, O., Muthén, B., Asparouhov, T., Morin, A. J., Trautwein, U. e Nagengast B. (2010). A new
look at the big five factor structure through exploratory
structural equation modeling. Psychological Assessment, 22,
Marsh, H. W., Muthén, B., Asparouhov, T., Lüdtke, O., Robitzsch, A., Morin, J. S. e Trautwein, U. (2009). Exploratory Structural Equation Modeling, Integrating CFA and
EFA: Application to Students’ Evaluations of University
Teaching. Structural Equation Modeling, 6, 439-476.
Martin, R. (2004). The St. Petersburg Paradox. In E. N. Zalta
(Ed.), The Stanford Encyclopedia of Philosophy (Fall 2004 Ed.).
Stanford, California: Stanford University.
Martin-Löf, P. (1973). Statistiska modeller. Antechningar from seminarier lasaret 1969-70 utarbetade av Rolf Sundberg. 2: a uppl.
Institutet for forsakrings-matematik och matematisk statistik vid Stockholms universitet.
Masin, S. C., Zudini, V. e Antonelli, M. (2009). Early alternative derivations of Fechner’s Law. Journal of the History
of Behavioral Sciences, 45, 56-65.
Masters, G. N. (1982) A Rasch model for partial credit scoring.
Psychometrika, 47, 149-174.
Mattick, R. P. e Clarke, J. C. (1998). Development and validation of measures of social phobia scrutiny fear and social interaction anxiety. Behavioral Research and Therapy,
36, 455-470.
Mayer, J. D. (2000). Spiritual intelligence or spiritual consciousness. The International Journal for the Psychology of Religion, 10, 47-56
Mayer, J. D., Salovey, P. e Caruso, D. R. (2002). Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT): User’s Manual. Toronto, Canada: Multi-Health Systems.
Mazor, K. M., Clauser, B. E. e Hambleton, R. K. (1994). Identification of non-uniform differential item functioning
using a variation of the Mantel-Haenszel procedure. Educational and Psychological Measurement, 54, 284-291.
McArdle, J. J. (1996). Current directions in structural factor
analysis. Current Directions in Psychological Science, 5, 11-18.
McCall, W. A. (1939). Measurement. New York: McMillan.
McCormack, T. J. (1922). A critique of mental measurements.
School and Society, 15, 686-692.
McCrae, R. R., Zonderman, A. B., Costa, P. T., Jr., Bond,M.
H. e Paunonen, S. (1996). Evaluating the replicability of
factors in the revised NEO Personality Inventory: Confirmatory factor analysis versus procrustes rotation. Journal
of Personality and Social Psychology, 70, 552-566.
McDonald, R. P. (2005). Semiconfirmatory factor analysis:
The example of anxiety and depression. Structural Equation
Modeling, 12, 163-172.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
McGlinchey, J. B., Atkins, D. C. e Jacobson, N. S. (2002). Clinical significant methods:Which one to use and how useful are they? Behavior Therapy, 33, 529-550.
McGraw, K. O. e Wong, S. P. (1996). Forming inferences
about some intraclass correlation coefficients. Psychological
Methods, 1(1), 30-46.
McKinley, J. C e Hathaway, S. R. (1940). A multiphasic personality schedule (Minnesota): II. A differential study of
hypochondriasis. Journal of Psychology, 10, 255-268.
McKinley, J. C e Hathaway, S. R. (1942b). A multiphasic personality schedule (Minnesota): IV. Psychasthenia. Journal
of Applied Psychology, 26, 614-624.
McKinley, J. C e Hathaway, S. R. (1944). A multiphasic personality schedule (Minnesota): V. Hysteria, Hypomania,
and Psychopathic Deviate. Journal of Applied Psychology,
28, 153-174.
McNair, D., Lorr, M. e Droppleman, L.F. (1971). Manual for
the Profile of the Mood States. San Diego: EdITS Educational
and Industrial Testing Service.
Mehl, M. R., Vazire, S., Ramirez-Esparza, N., Slatcher, R. B.
e Pennebaker, J. W. (2007). Are women really more talkative than men? Science, 317, 82.
Mehrabian, A. e Russell, J. A. (1974). An Approach to Environmental Psychology. Cambridge, MA: MIT.
Meier, S. T. (1994). The Chronic Crisis in Psychological Measurement and Assessment: A Historical Survey. San Diego, CA:
Academic Press.
Mellenbergh, G. J. (1982). Contingency table models for assessing item bias. Journal of Educational Statistics, 7, 105-118.
Meng, X. L., Rosenthal, R. e Rubin, D. B. (1992). Comparing
correlated correlation coefficients. Psychological Bulletin,
111, 172-175.
Menon, G., Raghubir, P. e Schwarz, N. (1995). Behavioral frequency judgments: An accessibility-diagnosticity framework. Journal of Consumer Research, 22, 212-228.
Meredith, W. (1964). Notes on factorial invariance. Psychometrika, 29, 177-185.
Meredith, W. e(1964). Rotation to achieve factorial invariance. Psychometrika, 29, 187-206.
Meredith, W. e Tisak, J. (1990). Latent curve analysis. Psychometrika, 55, 107-122.
Merkei, W. T. e Wiener, R. L. (1987). A reconsideration of the
Willoughby Personality Schedule with psychiatric inpatients.
Journal of Behavioral and Experimental Psychiatry, 18, 13-18.
Mesmer-Magnus, J., Viswesvaran, C., Deshpande, S. e Joseph, J. (2006). Social desirability: the role of over-claiming, self-esteem, and emotional intelligence. Psychological Science, 48, 336-356.
Messick, S. (1962). Response style and content measures from
personality inventories. Educational and Psychological Measurement, 22, 41-56.
Messick, S. (1975). The standard problem: Meaning and values in measurement and education. American Psychologist,
30, 955-966.
Messick, S. (1988). The once and future issues of validity. Assessing the meaning and consequences of measurement.
In H. Wainer e H. Braun (Eds.), Test Validity (pp. 33-45).
Hillsdale, NJ: Lawrence Erlbaum.
Messick, S. (1993). Validity. In R. L. Linn (Ed.), Educational
Measurement (2nd ed.., pp. 13-104). Phoenix: American
Council on Education and Oryx Press.
Messick, S. (1995). Validity of psychological assessment. American Psychologist, 50, 741-749.
Miceli, R. (2004). Questionari e test, dati e modelli. In R. Miceli
(Ed.), Numeri, dati, trappole (pp. 53-105). Roma: Carocci.
Michell, J. (1993). The origins of the representational theory
of measurement: Helmholtz, Hölder and Russell. Studies
in History and Philosophy of Science, 24, 185-206.
Michell, J. (1997). Quantitative science and the definition
of measurement in psychology. British Journal of Psychology,
88, 355-383.
Michell, J.(1990). An Introduction to the Logic of Psychological
Measurement. Hillsdale, NJ: Erlbaum.
Miele, F. (2002). Intelligence, Race, And Genetics: Conversations
with Arthur R. Jensen. Oxford: Westview Press.
Milgram, S. (1963). Behavioral study of obedience. Journal of
Abnormal and Social Psychology, 67, 371-378.
Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63, 81-97.
Miller, T. R e Spray, J. A (1993). Logistic discriminant function
analysis for DIF identification of polytomously scored
items. Journal of Educational Measurement, 30, 107-122.
Miner, J. B. (1922). An aid to the analysis of vocational interests. Journal of Educational Research, 5, 311-323.
Minton, H. L. (1988). Lewis M. Terman: Pioneer in Psychological
Testing. New York: University Press.
Montebarocci, O., Codispoti, M., Baldaro, B. e Rossi, N.
(2002). La validazione italiana di uno strumento di misura
del narcisismo: il Narcissistic Personality Inventory. Ricerche di Psicologia, 25(2), 6-30.
Moore, B. V. (1921). Personnel selection of graduate engineers. Psychological Monographs, 30, 318.
Morgan, S. L. e Winship, C. (2007). Counterfactuals and Causal
Inference: Methods and Principles for Social Research. Cambridge, UK: Cambridge University Press.
Mosier, C. I. (1939). Determining a simple structure when loadings for certain tests are known. Psychometrika, 4, 149-162.
Mosier, C. I. (1940). Psychophysics and mental test theory:
Fundamental postulates and elementary theorems. Psychological Review, 47, 355-366.
Mosier, C. I. (1941). Psychophysics and mental test theory.
II. The constant process. Psychological Review, 48, 235-249.
Mosier, C. I. (1947). A critical examination of the concepts
of face validity. Educational and Psychological Measurement,
7, 191-205
Moustaki, I. (2001). A review of exploratory factor analysis
for ordinal categorical data. In R. Cudeck, S. du Toit e D.
Sörbom (Eds.). Structural Equation Modeling: Present and Future (pp. 461-480). Chicago, IL: SSI.
Müller-Lyer, F. C. (1889). Optische Urteilstäuschungen. Archiv für Physiologie, Suppl., 263-270.
Mumpower, D. L. (1964). The fallacy of the short form. Journal of Clinical Psychology, 20, 111-113.
Mundy-Castle, A. C. (1974). Social and technological intelligence in Western and non-Western cultures. Universitas, 4, 46-52.
Murphy, K. R. e Davidshofer, C. O. (1994). Psychological Testing: Principles and Applications (3rd ed.).Englewood Cliffs,
NJ: Prentice-Hall.
Murstein, B. L. (1963). Theory and Research in Projective Techniques. New York: Wiley.
Murstein, B. L. (1963). Theory and Research in Projective Techniques. New York: Wiley.
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Mushquash, C. e O'Connor, B. P. (2006). SPSS and SAS programs for generalizability theory analyses. Behavior Research Methods, 38(3), 542-547.
Muthén, B. (1984). A general structural equation model with
dichotomous, ordered categorical, and continuous latent
variable indicators. Psychometrika, 49, 115-132.
Muthén, B. e Kaplan, D. (1985). A comparison of some methodologies for the factor analysis of non-normal Likert
variables. British Journal of the Mathematical and Statistical
Psychology, 38, 171-189.
Muthén, B. e Kaplan, D. (1992). A comparison of some methodologies for the factor analysis of non-normal Likert
variables: A note on the size of the model. British Journal
of Mathematical and Statistical Psychology, 45, 19-30.
Muthén, L. K. e Muthén, B. O. (1998-2007). Mplus User’s Guide (5th ed.) Los Angeles, CA: Muthén e Muthén.
Myers, I. B. (1962). Manual: the Myers-Brigg sType Indicator.
Princeton: Educational Testing Service.
Narens, L. (1996). A theory of ratio magnitude estimation.
Journal of Mathematical Psychology, 40, 109-129.
Nasser, F., Benson, J. e Wisenbaker, J. (2002). The performance of regression-based variations of the visual scree
for determining the number of common factors. Educational and Psychological Measurement, 62, 397-419.
Nelson, R. A. e Ruby, L. (1993). Physiological units in the SI.
Metrologia, 30, 55-60.
Neuhaus, J. O. e Wrigley, C. (1954). The quartimax method:
an analytical approach to orthogonal simple structure. British Journal of Statistical Psychology, 7, 187-191.
Nevo, B. (1985). Face validity revisited. Journal of Educational
Measurement, 22, 287-293.
Newcombe, R.G. (1998). Two-sided confidence intervals for
the single proportion: Comparison of seven methods. Statistics in Medicine, 17, 857-872.
Nichols, D. S. (2001). Essentials of MMPI-2 Assessment. New
York: John Wiley e Sons, Inc.
Nieder, A. and Miller, E.K. (2003) Coding of cognitive magnitude: Compressed scaling of numerical information in
the primate prefrontal cortex. Neuron, 37, 149-157.
Norman, W. T. (1963). Toward an adequate taxonomy of personality attributes: Replicated factor structure in peer nomination personality ratings. Journal of Abnormal and Social
Psychology, 66, 574-583.
Norman, W. T. (1967). 2800 Personality Trait Descriptors: Normative Operating Characteristics for a University Population. Ann
Arbor: Department of Psychology, University of Michigan.
Norusis, M. J. (2005). SPSS 13.0 Statistical Procedures Companion. Chicago: SPSS, Inc.
Nunes, T., Schliemann, A. D. e Carraher, D. W. (1993). Street
Mathematics and School Mathematics. New York: Cambridge
University Press.
Nunnally, J. C. (1967). Psychometric Theory. New York:
McGraw Hill.
Nunnally, J. C. e Bernstein, I. H. (1994). Psychometric Theory
(3rd ed.). New York: McGraw Hill.
Ochner, C. N., Gray, J. A. e Brickner, K. (2009). The development and initial validation of a new measure of male
body dissatisfaction. Eating Behaviors, 10, 197-201.
O’Connor, B. P. (2000). SPSS and SAS programs for determining the number of components using parallel analysis
and Velicer’s MAP test. Behavior Research Methods, Instrumentation, and Computers, 32, 396-402.
Oishi, S., Diener, E., Scollon, C. N. e Biswas-Diener, R. (2004).
Cross-situational consistency of affective experiences
across cultures. Journal of Personality and Social Psychology,
86, 460-472.
Osgood, D. W., McMorris, B. J. e Potenza, M. T. (2002). Analyzing multiple-item measures of crime and deviance I:
Item response theory scaling. Journal of Quantitative Criminology, 18, 267-296.
Osterlind, S. J. (1989). Constructing Test Items. Hingham, MA:
Ostini, R. e Nering, M. L. (2006). Polytomous Item Response
Theory Models. Sage University Paper Series on Quantitative
Applications in the Social Sciences, Series no. 07-144. Thousand
Oaks, CA: Sage.
Ozer, D. (1989).Construct validity in personality assessment.
In D.M. Buss e N. Cantor (Eds.), Personality Psychology: Recent Trends and Emerging Directions (pp. 224-234). New
York: Springer-Verlag.
Ozer, D. J. e Reise, S. P. (1994). Personality assessment. Annual Review of Psychology, 45, 357-388.
Pace, C. R. (1994). College Student Experiences Questionnaire (3rd
ed.). Bloomington: Indiana University, Center for Postsecondary Research and Planning.
Packard, V. (1957). The Hidden Persuaders. New York: David
McKay Company (trad. it. a cura di Carlo Fruttero, I persuasori occulti, Milano: Il Saggiatore, 1968).
Palmer, E. M., Horowitz, T. S., Torralba, A. e Wolfe, J. M.
(2011). What are the shapes of response time distributions
in visual search? Journal of Experimental Psychology: Human
Perception and Performance, 37, 58-71.
Parducci, A. (1965). Category judgment: a range-frequency
model. Psychological Review, 72, 402-418.
Pareek, U. e Rao, T. V. (1980). Cross-cultural surveys and interviewing. In H. C. Triandis & J. E. Berry (Eds.), Handbook
of Cross-Cultural Psychology, Vol. 2: Methodology (pp. 127180). Boston: Allyn & Bacon.
Paterson, D. G., Eliot, R. M., Anderson, L. D., Toops, H. A. e
Heidbreder, E. (Eds.), Minnesota Mechanical Ability Tests.
Minneapolis: University of Minnesota Press.
Patterson, G. R. (1993). Orderly change in a stable world: The
antisocial trait as a chimera. Journal of Consulting and Clinical Psychology, 61, 911-919.
Paulhus, D. L (1984). Two-component models of socially desirable responding. Journal of Personality and Social Psychology, 46, 598-609.
Paulhus, D. L. (2002). Socially desirable responding: The evolution of a construct. In H. I. Brawn, D. N. Jackson e D. E.
Wiley (Eds.), The Role of Constructs in Psychological and Educational Measurement (pp. 49-69). Mahwah, NJ: Erlbaum.
Paulhus, D. L. e Bruce, N. (1990). Validation of the OCQ: An
initial study. Presented at the meeting of Canadian Psychological
Association, Ottawa.
Paulhus, D. L. e John, O. P. (1998). Egoistic and moralistic
bias in self-perceptions: The interplay of self-deceptive
styles with basic traits and motives. Journal of Personality,
66, 1024-1060.
Paulhus, D. L. e Reid, D. B. (1991). Enhancement and denial
in socially desirable responding. Journal of Personality and
Social Psychology, 60, 307-317 .
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Pearson K. (1900). Mathematical contributions to the theory
of evolution. VII. On the correlation of characters not
quantitatively measurable. Philosophical Transactions of the
Royal Society of London, Series A, 195, 1-47.
Pearson, K. (1895). Contributions to the mathematical theory
of evolution. Philosophical Transactions of the Royal Society of
London, 91, 343-358.
Pearson, K. (1901). On lines and planes of closest fit to systems
of points in space. Philosophical Magazine 2(6), 559-572.
Pedrabissi, L. e Santinello, M. (1997). I test psicologici. Teorie e
tecniche. Bologna: Il Mulino.
Penfield, R. D. (2001). Assessing differential item functioning among multiple groups: a comparison of three Mantel-Haenszel procedures. Applied Measurement in Education, 14, 235-259.
Penfield, R. D. (2003). Application of the Breslow-Day test
of trend in odds ratio heterogeneity to the detection of
nonuniform DIF. Alberta Journal of Educational Research,
49, 231-243.
Peng, C.-Y. J., Harwell, M., Liou, S.-M. e Ehman, L. H. (2006).
Advances in missing data methods and implications for
educational research. In S. Sawilowsky (Ed.), Real Data
Analysis (pp. 31 -78). Greenwich, CT: Information Age.
Pett, M. A., Lackey, N. R. e Sullivan, J. J. (2003). Making Sense
of Factor Analysis: The Use of Factor Analysis for Instrument Development in Health Care Research. London: Sage.
Phillips, D. L. e Clancy, K. J. (1972). Some effects of social
desirability in survey studies. American Journal of Sociology,
77, 921-940.
Piaget, J. (1926). La representation du monde chez l’enfant. Paris: Alcan.
Piazza, T. (1980). The analysis of attitude items. American Journal of Sociology, 86, 584-603.
Pick, A. (1891). Ueber primäre chronische Demenz (so. Dementia praecox) im jugendlichen Alter. Prager Medicinische
Wochenschrift, 16, 312-315.
Pigott, T. D. (2001). A review of methods for missing data.
Educational Research and Evaluation, 7, 353-383.
Pilkonis, P. A., Kim, Y., Proietti, J. M. e Barkham, M. (1996).
Scales for personality disorders developed from the Inventory of Interpersonal Problems. Journal of Personality Disorders, 10, 355-369.
Pincus, A. L., Ansell, E. B., Pimentel, C. A., Cain, N. M.,
Wright, A. G. C. e Levy, K. N. (2009). Initial construction
and validation of the Pathological Narcissism Inventory.
Psychological Assessment, 21, 365-379.
Pintner, R. e Paterson, D. G. (1917). A Scale of Performance
Tests. New York: Appleton.
Podsakoff, P.M., MacKenzie, S.B., Lee, J.Y. e Podsakoff, N.P.
(2003). Common method biases in behavioral research: a
critical review of the literature and recommended remedies. Journal of Applied Psychology, 88(5), 879-903.
Pointer, M. R. (2003). New directions – Soft metrology requirements for support from mathematics statistics and software. NPL
Report CMSC 20/03.
Popham, W. J. (1978). Criterion-Referenced Measurement. Englewood Cliffs NJ: Prentice-Hall.
Popham, W. J. (1994). The instructional consequences of criterion-referenced clarity. Educational Measurement: Issues
and Practice, 13, 15-18, 30.
Porteus, S. D. (1915a). Motor Intellectual Tests for Mental
Defectives. Journal of Experimental Pedagogy, 3, 127-135.
Porteus, S. D. (1915b). Mental Tests for Feeble-Minded: A
New Series. Journal of Psycho-Aesthenics, 19, 200-213.
Preacher, K. J. e MacCallum, R. C. (2003). Repairing Tom
Swift’s electric factor analysis machine. Understanding Statistics, 2, 13-32.
Pressey, S. L. e Pressey, L. W. (1919). Cross-Out Test, with
suggestions as to a group scale of the emotions. Journal of
Applied Psychology, 3, 138-150.
Preston, C. C. e Colman, A. M. (2000). Optimal number of
response categories in rating scales: reliability, validity, discriminating power, and respondent preferences. Acta Psychologica,104, 1-15.
Prezza M., Trombaccia F. R. e Armento L. (1997). La scala
dell'autostima di Rosenberg: traduzione e validazione italiana. Bollettino di Psicologia Applicata, 223, 35-44.
Purghé, F. (1997). Metodi di psicofisica e di scaling unidimensionale. Torino: Bollati-Boringhieri.
Pyle, W. H. (1913). Examination of School Children. New York:
Raîche, G. (2005). Critical eigenvalue sizes in standardized
residual principal components analysis. Rasch Measurement
Transactions, 19, 10-12.
Raîche, G., Riopel, M. e Blais, J.-G. (2006). Non graphical
solutions for the Cattell’s scree test. Proceedings of the International Meeting of the Psychometric Society, Montréal, June
Raju, N. S. (1988). The area between two item characteristic
curves. Psychometrika, 54, 495-502.
Raju, N. S. (1990). Determining the significance of estimated
signed and unsigned areas between two item response
functions. Applied Psychological Measurement, 14, 197-207.
Rammstedt, B. e John, O. P. (2007). Measuring personality
in one minute or less: A 10-item short version of the Big
Five Inventory in English and German. Journal of Research
in Personality, 41, 203-212.
Rao, C. R. (1964). The use and interpretation of principal component analysis in applied research. Sankhya A, 26, 329-358.
Rasch, G. (1960). Probabilistic Models for Some Intelligence and
Attainment Tests. Copenhagen: Danish Institute for Educational Research.
Raskin, R. e Hall, C. S. (1979). A narcissistic personality inventory. Psychological Reports, 45, 590.
Raykov, T. e Little, T. D. (1999). A note on Procrustean rotation in exploratory factor analysis: A computer intensive
approach to goodness-of-fit evaluation. Educational and
Psychological Measurement, 59, 47-57.
Raykov, Tenko (1997). Estimation of composite reliability for
congeneric measures. Applied Psychological Measurement,
21, 173-184.
Ream, M. J. (1924). Ability to Sell: Its Relation to Certain Aspects
of Personality and Experience. Baltimore: Williams and Wilkins.
Recommendations for Getting the Most From Your Analysis.
Practical Assessment, Research e Evaluation, 10(7), Available
Reed, J. (1987). Robert M. Yerkes and the mental testing movement. In M. M. Sokal (Ed.), Psychological Testing and American Society: 1890-1930 (pp. 75-94). New Brunswick: Rutgers University Press.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Reise, S. P. e Waller, N. G. (2003). How many IRT parameters
does it take to model psychopathology items? Psychological
Methods, 8(2), 164-184.
Reiser, B. J., Black, J. B. e Kalamarides, P. (1986). Strategic
memory search processes. In D. C. Rubin (Ed.), Autobiographical Memory (pp. 100-121). New York: Cambridge
University Press.
Reitan, R. M. e Wolfson, D. (1993). The Halsted-Reitan Neuropsychological Test Battery: Theory and Clinical Interpretation
(2nd ed.). Tucson, AZ: Neuropsychology Press.
Remmers, H. H. (1929). The measurement of interest differences between students of engineering and agriculture.
Journal of Applied Psychology, 13, 105-119.
Rennie, L. J. (1982). Detecting a response set to Likert-style
attitude items with the rating model. Educational Research
Perspectives, 9, 114-118.
Revelle, W. (1979). Hierarchical cluster-analysis and the internal structure of tests. Multivariate Behavioral Research,
14(1), 57-74.
Revelle, W. e Zinbarg, R. E. (2009) Coefficients alpha, beta,
omega and the glb: comments on Sijtsma. Psychometrika,
74(1), 145-154.
Reynolds-Keefer, L., Johnson, R., Dickenson, T. e McFadden,
L. (2009). Validity issues in the use of pictorial Likert scales. Studies in Learning, Evalutaion Innovation and Development, 6, 15-24.
Richardson, J. T. E. (2005). Knox’s cube imitation test: A historical review and an experimental analysis. Brain and
Cognition, 59, 183-213.
Richardson, L. F. e Ross, J. S. (1930). Loudness and telephone
current. Journal of General Psychology, 3, 288-306.
Richardson, M. W. (1936).The relation between the difficulty
and the differential validity of a test. Psychometrika, 1, 33-49.
Roberts, D.M. (2000). Face validity: Is there a place for this
in measurement? Shiken: The JALT Testing and Evaluation
SIG Newsletter 4(2), 5-6.
Roberts, J. K. (1998). Thurstone’s Method of Equal-Appearing Intervals in Measuring Attitudes: An Old Method That is Not Forgotten (Report No. ED 426 085). Texas AeM Universty
(ERIC Document Reproduction Service No. TM 029 313).
Robins, R. W., Hendin, H. M. e Trzesniewski, K. H. (2001).
Measuring global self-esteem: construct validation of a
single-item measure and the Rosenburg Self-Esteem Scale. Personality and Social Psychology Bulletin, 27(2), 151-161.
Roff, M. (1935). Some properties of the communality in multiple factor theory. Psychometrika, 1(2), 1-6.
Rogers, H. J. e Swaminathan, H. (1993). A comparison of logistic regression and the Mantel-Haenszel procedures for
detecting differential item functioning. Applied Psychological Measurement, 17, 105-116.
Rogers, H. J. e Swaminathan, H. (1994, April). Logistic Regression Procedures for Detecting DIF in non-Dichotomous Item Responses. Paper presented at the Annual Meeting of the
American Educational Research Association, New Orleans.
Rogosa, D., Floden, R. e Willett, J. B. (1984). Assessing the
stability of teacher behavior. Journal of Educational Psychology, 76, 1000-1027.
Rosanoff, A. (1920). Manual of Psychiatry. New York: John Wiley e Sons, Inc.
Rosenberg, M. (1965). Society and the Adolescent Self-Image.
Princeton: Princeton University Press.
Rosenthal, R., Rosnow, R. L. e Rubin, D. B. (2000). Contrasts
and Effect Sizes in Behavioral Research: A Correlational Approach. New York: Cambridge University Press.
Ross, M. (1989). The relation of implicit theories to the construction of personal histories. Psychological Review, 96,
Rossi, G. B. (2006). An attempt to interpret some problems
in measurement science on the basis of Kuhn’s theory of
paradigms, Measurement, 39, 512-521.
Rossi, G. B. (2007). Measurability. Measurement, 40, 545-562.
Rossi, G. B., Crenna, F. e Codda M. (2003). Measurement of
quantities depending upon perception by jury-test methods. Measurement, 34, 57-66.
Roth, P. L. (1994). Missing data: A conceptual review for applied psychologists. Personnel Psychology, 47, 537-570.
Rothman, A. J., Haddock, G. e Schwarz, N. (2001). How many
partners is too many? Shaping perceptions of personal vulnerability. Journal of Applied Social Psychology, 31, 2195-2214.
Rowley, G. (1989). Assessing error in behavioral data: Problems of sequencing. Journal of Educational Measurement,
26, 273-284.
Rubin, D. B. (1976). Inference and missing data. Biometrika,
63, 581-592.
Rulison, K. L. e Loken, E. (2009). I've fallen and I can't get
up: Can high ability students recover from early mistakes
in Computer Adaptive Testing? Applied Psychological Measurement, 33, 83-101.
Rulon, P. J. (1939). A simplified procedure for determining
the reliability of a test with two split halves. Harvard Educational Review, 9, 99-103.
Rulon, P. J. (1946). On the validity of educational tests. Harvard Educational Review, 16, 290-296.
Rundquist, E. A. e Sletto, R. F. (1936). Personality in the Depression. Minneapolis: University of Minnesota Press.
Russell, B. (1897). On the relations of number and quantity.
Mind, 6, 326-341.
Russell, J. A. e Carroll, J. M. (1999). On the bipolarity of positive and negative affect. Psychological Bulletin, 125, 3-30.
Sackeim, H. A. e Gur, R. C. (1978). Self-deception, other-deception and consciousness. In G. E. Schwartz e D. Shapiro
(Eds.), Consciousness and Self-regulation: Advances in Research
(Vol. 2; pp. 139-197). New York: Plenum Press.
Samejima, F. (1969). Estimation of Latent Ability Using a Response
Pattern of Graded Scores (Psychometric Monograph No. 17). Richmond, VA: Psychometric Society.
Samelson, F. (1987). Was early mental testing (a) Racist Inspired (b) Objective Science (c) A Technology for Democracy (d) The Origin of the Multiple-Choice Exams (e)
Non of the Above (Mark the RIGHT Answer). In M. M.
Sokal (Ed.), Psychological Testing and American Society: 18901930 (pp. 113-127). New Brunswick: Rutgers University
Sands, W., Waters, B. K. e McBride, J. R. (1997) (Eds.). Computerized Adaptive Testing: From Inquiry to Operation. Washington. D.C.: American Psychological Association.
Sartori, R. (2003). La valutazione della personalità tramite
test. Quaderni Dipav, 6, 159-178.
Sartori, R. (2010). Face validity in personality tests: Psychometric instruments and projective techniques in comparison. Quality and Quantity, 44, 749-759.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Sartori, R. e Pasini, M. (2007). Quality and quantity in test
validity: How can we be sure that psychological tests measure what they have to? Quality ad Quantity, 41, 359-374.
Sass, D. A. e Schmitt, T. A. (2010). A comparative investigation of rotation criteria within exploratory factor analysis.
Multivariate Behavioral Research, 45, 73-103.
Satorra, A. e Bentler, E. M. (1988b). Scaling Corrections for Statistics in Covariance Structure Analysis (UCLA Statistics Series
#2). Los Angeles, CA: University of California.
Satorra, A. e Bentler, P. M: (2001). A scaled difference chisquare test statistic for moment structure analysis. Psychometrika, 66(4), 507-514.
Saucier, G. (1994). Mini-markers: A brief version of Goldberg’s unipolar Big-Five markers. Journal of Personality Assessment, 63, 506-516.
Saunders, D. R. (1953). An analytic method for rotation to
orthogonal simple structure. Research Bulletin, 53-10, Princeton, NJ: Educational Testing Service.
Saunders, D. R. (1961). The rationale for an oblimax method of transformation in factor analysis. Psychometrika,
26, 317-324.
Savardi, U. (Ed.) (2009). The Perception and Cognition of Contraries. Milano: Mc-Graw Hill.
Savardi, U. e Bianchi I. (2009). The spatial path to contrariety.
In U. Savardi (Ed.). The Perception and Cognition of Contraries
(pp. 63-92). Milano: Mc-Graw Hill.
Savardi, U., Bianchi, I. e Burro, R. (2009). From opposites to
dimensions: filling in the gaps. In U. Savardi (Ed.). The
Perception and Cognition of Contraries (pp. 275-294). Milano:
Mc-Graw Hill.
Schacter, D. L. (1999). The seven sins of memory. Insights
from psychology and cognitive neuroscience. American
Psychologist, 54, 182-203.
Schafer, J. L. (1999). Multiple imputation: A primer. Statistical
Methods in Medical Research, 8, 3-15.
Schermelleh-Engel, K., Moosbrugger, H. e Müller, H. (2003).
Evaluation of the fit of structural equation models: Test
of significance and descriptive goodness-of-fit measures.
Methods of Psychological Research Online, 8, 23-74.
Schlomer, G. L., Bauman, S. e Card, N. A. (2010). Best practices for missing data management in counseling psychology. Journal of Counseling Psychology, 57(1), 1-10.
Schmid, J. e Leiman, J. M. (1957). The development of hierarchical factor solutions. Psychometrika, 22, 53-61.
Schmitt, N. (1996). Uses and abuses of coefficient alpha. Psychological Assessment, 8, 350-353.
Schoemaker, P. J. H. (1982). The expected utility model: Its
variants, purposes evidence and limitations. Journal of Economic Literature, 20, 529-563.
Schonemann, P. H. (1966), A generalized solution of the orthogonal Procrustes problem. Psychometrika, 31, 1-10
Schönemann, P. H. (1966). The generalized solution of the
orthogonal Procrustes problem. Psychometrika, 31, 1-16.
Schriesheim, C. A. ed Eisenbach, R. J. (1995). An exploratory and confirmatory factor– analytic investigation of
item wording effects on the obtained factor structures of
survey questionnaire measures. Journal of Management,
21, 1177-1193.
Schriesheim, C. A. e Hill, K. D. (1981). Controlling acquiescence response bias by item reversals: The effect on que-
stionnaire validity. Educational and Psychological Measurement, 41, 1101-1114.
Schriesheim, C. A., Eisenbach, R. J. e Hill, K. D. (1991). The
effect of negation and polar opposite item reversals on questionnaire reliability and validity: An experimental investigation. Educational and Psychological Measurement, 51, 67-78.
Schutte, N. S., Malou , J.M., Hall, L. E., Haggerty, D. J., Cooper, J. T., Golden, C. J. et al. (1998). Development and
validation of a measure of emotional intelligence. Personality and Individual Di erences, 25, 167-177.
Schwarz, G. E. (1978). Estimating the dimension of a model.
Annals of Statistics, 6(2), 461-464.
Schwarz, N. (1999). Self-reports. How the questions shape
the answers. American Psychologist, 54, 93-105.
Schwarz, N. e Oyserman, D. (2001). Asking questions about
behavior: cognition, communication, and questionnaire
construction. American Journal of Evaluation, 22, 127-160.
Schwarz, N. e Scheuring, B. (1992). Selbstberichtete verhaltens– und symptomhaufigkeiten: Was befragte aus anwortvorgaben des fragebogens lernen. Zeitschrift fur Klinische
Psychologie, 22, 197-208.
Schwarz, N., Hippler, H. J. e Noelle-Neumann, E. (1994). Retrospective reports: The impact of response alternatives.
In N. Schwarz e S. Sudman (Eds.), Autobiographical Memory
and the Validity of Retrospective Reports (pp. 187-202). New
York: Springer Verlag.
Schwarz, N., Knauper, B., Hippler, H. J., Noelle-Neumann, E.
e Clark, F. (1991). Rating scales: Numeric values may
change the meaning of scale labels. Public Opinion Quarterly, 55, 570-582.
Scott, W. D. (1903). The Psychology of Advertising in Theory and
Practice. Boston: Small, Maynard & Co.
Sechrest, L. (1963). Incremental validity: A recommendation.
Educational and Psychological Measurement, 23, 155-158.
Seguin, E. (1846). Traitement moral, hygiène et éducation des idiots
et des autres enfants erroires. Paris: Brailliere.
Seguin, E. (1866). Idiocy: and its Treatment by the Physiological
Method. New York: William Wood & Co.
Shapiro, A. e Ten Berge, J. M. F. (2002). Statistical inference of
minimum rank factor analysis. Psychometrika, 67(1), 79-94.
Shapiro, S. E., Lasarev, M. R. e McCauley, L. (2002). Factor
analysis of Gulf War illness: What does it add to our understanding of possible health effects of deployment? American Journal of Epidemiology, 156, 578-585
Shapiro, S. S. e Wilk, M. B. (1965). An analysis of variance test
for normality (complete samples). Biometrika, 52, 591-611.
Sharp, S. E. (1899). Individual psychology: A study in psychological method. American Journal of Psychology, 10, 329-391.
Shavelson, R. J., Webb, N. e Rowley, G. L. (1989). Generalizability theory. American Psychologist, 44(6), 922-932.
Shealy, R e Stout, W. F. (1993). A model-based standardization approach that separates true biasIDIF from group differences and detects test bias/DTF as well as item bias/DIF.
Psychometrika, 58, 159-194.
Shepard, R. N. (1962). The analysis of proximities: Multidimensional scaling with an unknown distance function. I.
Psychometrika, 27, 125-140.
Shepard, R. N. (1980). Multidimensional scaling, tree-fitting,
and clustering. Science, 210, 390-398.
Shepard, R. N. (1987). Toward a universal law of generalization for psychological science. Science, 237, 1317-1323.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Shepard, R. N., Romney, A. K. e Nerlove, S. (Eds.) (1972).
Multidimensional Scaling: Theory and Applications in the Behavioral Sciences. New York: Seminar Press.
Shrout, P. E. e Fleiss, J. L. (1979). Intraclass correlations: Uses
in assessing rater reliability. Psychological Bulletin, 86, 420428.
Sijtsma, K. (2009). On the use, the misuse, and the very limited usefulness of Cronbach’s alpha. Psychometrika, 74,
Silverstein, A. B. (1990). Short forms of individual intelligence tests. Psychological Assessment, 2, 3-11.
Sireci, S. G. (1998). The construct of content validity. Social
Indicator Research, 45, 83-177.
Siu, M. K. (2004). Official Curriculum in Mathematics in Ancient China: How Did Candidates Study for the Examination?. In F. Lianghuo, W. Ngai-Ying, C. Jinfa e L. Shiqi
(Eds.), How Chinese Learn Mathematics: Perspective From Insiders (Vol. I) (pp. 157-185). Singapore: World Scientific
Publishing, Co.
Sloan, J. A., Aaronson, N., Cappelleri, J. C., Fairclough, D.
L., Varricchio, C. and the Clinical Significance Consensus
Meeting Group (2002). Assessing the clinical significance
of single items relative to summated scores. Mayo Clinic
Proceedings, 77, 479-487.
Slocum-Gori, S. L. e Zumbo, B. D. (2010). Assessing the unidimensionality of psychological scales: Using multiple criteria from factor analysis. Social Indicator Research, DOI
Smirnov, N. V. (1939). On the estimation of the discrepancy
between empirical curves of distribution for two independent samples. Bullettin Moscow University (Math), II, 3-16.
Smith, G. T. e McCarthy, D.M. (1995). Methodological considerations in the refinement of clinical assessment instruments. Psychological Assessment, 7, 300-308.
Smith, E. V., Jr. (2002). Detecting and evaluating the impact
of multidimensionality using item fit statistics and principal component analysis of residuals. Journal of Applied Measurement, 3, 205-231.
Smith, G. T., McCarthy, D.M. e Anderson, K. G. (2000). On
the sins of short-form development. Psychological Assessment, 12, 102-111.
Smith, R. M., Linacre, J. M. e Smith, Jr., E.V. (2003). Guidelines
for Manuscripts. Journal of Applied Measurement, 4, 198-204.
Snyder, M. (1974). Self-monitoring of expressive behavior.
Journal of Personality and Social Psychology, 30, 526-537.
Snyder, M. e Ickes, W. (1985). Personality and social behavior. In G. Lindzey ed E. Aronson (Eds.), Handbook of Social Psychology, Vol. 2, 3d ed. (pp. 883-948). New York:
Random House.
Sokal, M.M. (1982). James McKeen Cattell and the future of
anthropometric mental testing. In: Woodward, W.R. e
Ash, M.G. (Eds.), The Problematic Science: Psychology in Nineteenth-Century (pp. 322-345) New York: Thought Praeger
Somerfield, M. e Curbow, B. (1992). Methodological issues
and research strategies in the study of coping with cancer.
Social Science Medicine. 34, 1203-1216.
Soto, C. J. e John, O. P. (2009). Ten facet scales for the Big
Five Inventory: Convergence with NEO PI-R facets, selfpeer agreement, and discriminant validity. Journal of Research in Personality, 43, 84-90.
Spearman, C. (1904a). General intelligence objectively determined and measured. American Journal of Psychology,
15, 201-293.
Spearman, C. (1904b). The proof and measurement of association between two things. The American Journal of Psychology, 15(1), 72-101.
Spearman, C. (1910). Correlation calculated from faulty data.
British Journal of Psychology, 3(3), 271-295.
Spearman, C. E. (1927). The Abilities of Man, their Nature and
Measurement. New York: Macmillan.
Spector P. E. (1992), Summated rating scale construction. An
introduction Sage University Paper Series on Quantitative Applications in the Social Sciences, Series no. 07-82. Thousand
Oaks, CA: Sage.
Spector, P. E. (1976). Choosing response categories for summated rating scales. Journal of Applied Psychology, 61, 374375.
Speer, D. C. e Greenbaum, P. E. (1995). Five methods for computing significant individual client change and improvement
rates: support for an individual growth curve approach. Journal of Consulting and Clinical Psychology, 63, 1044-1048.
Spence, K. W. (1960). Behavior Theory and Learning. Englewood Cliffs, NJ: Prentice-Hall.
Spieler, D. H., Balota, D. A. e Faust, M. E. (1996). Stroop performance in healthy younger and older adults and in individuals with dementia of the Alzheimer’s type. Journal
of Experimental Psychology: Human Perception and Performance, 22, 461-479.
Spirrison, C. L. (1994). Factorial hue and cry: Comments on
Jane Loevinger’s “Has Psychology lost its conscience?’
Journal of Personality Assessment, 63, 579-583.
Sprague, E. K. (1914). Mental examination of immigrants.
The Survey, 31, 466-468.
Steiger, J. H. (1979). The relationship between external variables and common factors. Psychometrika, 44, 93-97.
Steiger, J. H. e Lind, J. C. (1980, May). Statistically based tests
for the number of common factors. Paper presented at the annual meeting of the Psychometric Society, Iowa City, IA.
Stendhal (1830/1998). Le rouge et le noir (trad. it. Il rosso e il
nero. Colognola ai Colli [VR]: Demetra).
Stern, W. (1912). Die psychologischen Methoden der Intelligenzprüfung und deren Anwendung an Schulkindern 5. Berlin:
Kongreß für Experimentelle Psychologie.
Sternberg, R. J. (1987). Teorie dell’intelligenza. Milano: Bompiani.
Stevens, S. S. (1935a). The operational definition of psychological terms. Psychological Review, 42, 517-527.
Stevens, S. S. (1935b). The operational basis of psychology.
American Journal of Psychology, 47, 323-330.
Stevens, S. S. (1936). A scale for the measurement of psychological magnitude: Loudness. Psychological Review, 43,
Stevens, S. S. (1936a). Psychology: The propedeutic science.
Philosophy of Science, 3, 90-103.
Stevens, S. S. (1936b). A scale for the measurement of a
psychological magnitude: Loudness. Psychological Review,
43, 405-416.
Stevens, S. S. (1946). On the theory of scales and measurement. Science, 103, 667-680.
Stevens, S. S. e Davies, H. (1938). Hearing. Its Psychology and
Physiology. New York: John Wiley and Sons, Inc.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Stocking, M. L. e Lord, F. M. (1983). Developing a common
metric in item response theory. Applied Psychological Measurement, 7, 201-210.
Strauss, M. E. e Smith, G. T. (2009). Construct validity: Advances in theory and methodology. Annual Review of Clinical Psychology, 5, 1-25.
Strelau, J. (1987). Emotion as a key concept in temperament
research. Journal of Research in Personality, 21, 510-528.
Strong, E. K., Jr. (1927). Vocational Interest Blank. Stanford:
Stanford University Presso.
Stuive, I., Kiers, H. A. L. e Timmerman, M. E. (2009). comparison of methods for adjusting incorrect assignments of
items to subtests: Oblique multiple group method versus
confirmatory common factor method. Educational and Psychological Measurement, 69, 948-965.
Suen, H. K. (1990). Principles of Test Theories. Hillsdale, NJ: Erlbaum.
Suler, J. (2004). The online disinhibition effect. CyberPsychology and Behavior, 7, 321-326.
Swaminathan, H. and Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361-370.
Swets, J. A. (1961). Is there a sensory threshold? Science,
134, 168-177.
Swets, J. A., Dawes, R. M. e Monahan, J. (2000). Psychological science can improve diagnostic decisions. Psychological Science in the Public Interest, 1, 1, 1-26.
Swets, J. A., Tanner, W. P., Jr. e Birdsall, T. G. (1961). Decision
processes in perception. Psychological Review, 68, 301-340.
Sydenham, P. H. (1979). Measuring Instruments: Tools of Knowledge and Control. London: Peter Peregrinus Ltd in association with the Science Museum.
Sydenham, P. H. (2003). Relationship between measurement,
knowledge and advancement. Measurement, 34, 3-16.
Sylvester, R. H. (1918). The Form Board Tests. Psychological
Monographs, 15(4), Whole No. 65.
Symonds, P. M. (1924). On the loss of reliability in ratings
due to coarseness of the scale. Journal of Experimental Psychology, 7, 456-461.
Tanaka, J. S. (1987). How big is big enough? Sample size and
goodness-of-fit in structural equation models with latent
variables. Child Development, 58, 134-146.
Tanner, W. P., Jr. e Swets, J. A. (1954). A decision-making
theory of visual detection. Psychological Review, 61, 401-409.
Tanzer, N. K.,& Sim, C. O. E. (1999). Adapting instruments
for use in multiple languages and cultures: A review of
the ITC Guidelines for test adaptations. European Journal
of Psychological Measurement, 15, 258-269.
Taras, V. e Kline, T. (2010). Scale validation via quantifying
item validity using the Dm index. Psychological Reports,
107, 535-546.
Tataryn, D. J., Wood, J. M. e Gorsuch, R. L. (1999). Setting
the value of k in promax: A Monte Carlo study. Educational
and Psychological Measurement, 59, 384-391.
Tavares, H. R., de Andrade, D. F. e Pereira, C. A. (2004). Detection of determinant genes and diagnostic via item response theory. Genetics and Molecular Biology, 27, 679-685.
Ten Berge, J. M. F. (1998). Some recent developments in factor analysis and the search for proper communalities. In
A. Rizzi, M. Vichi e H. H. Bock (Eds), Advances in Data Science and Classification (pp. 325-334). Heidelberg: Springer.
Ten Berge, J. M. F. e Hofstee, W. K. B. (1999). Coefficients
alpha and reliabilities of unrotated and rotated components. Psychometrika, 64(1), 83-90.
Ten Berge, J. M. F. e Sočan, G. (2004). The greatest lower
bound to the reliability of a test and the hypothesis of unidimensionality. Psychometrika, 69(4), 613-625.
Ten Berge, J. M. F. e Zegers, F. E. (1978). A series of lower
bounds to the reliability of a test. Psychometrika, 43(4), 575579.
Ten Berge, J., Krijnen, W., Wansbeek, T. e Shapiro, A.(1999).
Some new results on correlation-preserving factor scores
prediction methods. Linear Algebra and its Applications,
289, 311-318.
Teng, S. (1942-43). Chinese influence on the Western examination system. Harvard Journal of Asiatic Studies, 7, 267-312.
Terman, L. M. e Childs, H. G. (1912a). A tentative revision
and extension of the Binet-Simon measuring scale of Intelligence. Journal of Educational Psychology, 3(2), 61-74.
Terman, L. M. e Childs, H. G. (1912b). A tentative revision
and extension of the Binet-Simon measuring scale of intelligence. Part II. Supplementary tests. 1. Generalization
test: interpretation of fables. Journal of Educational Psychology, 3(3), 133-143.
Terman, L. M. e Childs, H. G. (1912c). A tentative revision
and extension of the Binet-Simon measuring scale of intelligence. Part II. Supplementary tests – continued. Journal of Educational Psychology, 3(4), 198-208.
Terman, L. M. e Childs, H. G. (1912d). A tentative revision
and extension of the Binet-Simon measuring scale of intelligence. Part III. Summary and criticisms. Journal of Educational Psychology, 3(5), 277-289.
Terman, L. M. (1917). A trial of mental and pedagogical tests
in a civil service examination for policemen and firemen.
Journal of Applied Psychology, 1, 9-16.
Terman, L. M. (1906). Genius and stupidity: a study of some
of the intellectual processes of seven “bright” and seven
“stupid” boys. Pedagogical Seminary, 13, 307-373.
Terman, L. M. (1915). The mental hygiene of exceptional
children. Pedagogical Seminary, 22, 529-537.
Terman, L. M. (1916). The Measurement of Intelligence. Boston:
Houghton Mifflin.
Terman, L. M. (1917). The intelligence quotient of Francis Galton in childhood. American Journal of Psychology, 28, 209-215.
Terman, L. M. (1922). A new approach to the study of genius.
Psychological Review, 29(4), 310-318.
Terr, L. (1988). Editor’s note. Child sexual abuse: Why the
controversy? Journal of the American Academy of Child and
Adolescent Psychiatry, 27, 788.
Thissen, D., Steinberg, L. e Wainer, H. (1993). Detection of
differential item functioning using the parameters of item
response models. In P. Holland e H. Wainer (Eds.), Differential Item Functioning (pp. 67-114). Hillsdale, NJ: Erlbaum.
Thompson, B. (2004). Exploratory and Confirmatory Factor Analysis. Washington, DC: American Psychological Association.
Thomson, G. H. (1934). Hotelling’s method modified to give
Spearman’s g. Journal of Educational Psychology, 25, 366-374.
Thomson, G.H. (1940). Weighting for battery reliability and
prediction. Psychometrika, 5, 335-345.
Thomson, W. (1889). Popular Lectures and Addresses (Vol. I).
New York: McMillan & Co.
Thomson, W. (1889). Popular Lectures and Addresses (Vol. I).
New York: McMillan & Co.
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Thordarson, D. S., Radomsky, A. S., Rachman, S., Shafran,
R., Sawchuck, C. N. e Hakstian, A. R. (2004). The Vancouver Obsessional Compulsive Inventory (VOCI). Behaviour Research and Therapy, 42, 1289-1314.
Thorndike, E. L. (1904). An Introduction to the Theory of Mental
and Social Measurements. New York: Science Press.
Thorndike, E. L. (1918). The nature, purposes, and general
methods of measurements of educational products. National Society for the Study of Educational Products: Seventeenth
Yearbook, 16-24.
Thorndike, E. L. (1918). The nature, purposes, and general
methods of measurement of educational products. In S. A.
Courtis (Ed.), The Measurement of Educational Products (17th
Yearbook of the National Society for the Study of Education, Pt. 2. pp. 16-24). Bloomington, IL: Public School.
Thorndike, E. L. (1931). Measurement of Intelligence. Columbia
University, New York: Bureau of Publishers.
Thorndike, E. L. (1935). Adult Interests. New York: Macmillan.
Thorndike, E. L., Bregman, E. O., Tilton, J. W. e Woodyard,
E. (1928). Adult Learning. New York: Macmillan.
Thorndike, R. L. e Hagen, E. (1977). Measurement and Evaluation in Education and Psychology (4th ed.). New York: Wiley.
Thurstone, L. L. (1927) A law of comparative judgment. Psychological Review, 34, 273-286.
Thurstone, L. L. (1927a). The method of paired comparisons
for social values. Journal of Abnormal and Social Psychology,
21(4), 384-400.
Thurstone, L. L. (1928). Attitudes can be measured. American
Journal of Sociology, 33, 529-554.
Thurstone, L. L. (1931a). The indifference function. Journal
of Social Psychology, 2, 139-167.
Thurstone, L. L. (1931b). Multiple factor analysis. Psychological
Review, 38, 406-427.
Thurstone, L. L. (1932). The Reliability and Validity of Tests. Ann
Arbor, Michigan: Edwards Brothers.
Thurstone, L. L. (1933a). A Simplified Multiple Factor Method
and an Outline of the Computations. Chicago: University of
Chicago Bookstore.
Thurstone, L. L. (1933b). The Theory of Multiple Factors. Chicago: University of Chicago Bookstore.
Thurstone, L. L. (1934). The vectors of mind. Psychological Review, 41, 1-32.
Thurstone, L. L. (1935). The Vectors of Mind. Chicago: University of Chicago Press.
Thurstone, L. L. (1935b). Multiple Factor Analysis. Chicago:
University of Chicago Press.
Thurstone, L. L. (1938). Primary Mental Abilities. Chicago: University of Chicago Press
Thurstone, L. L. (1938). Primary Mental Abilities. Chicago: University of Chicago Press.
Thurstone, L. L. (1945). A multiple group method of factoring
the correlation matrix. Psychometrika, 10, 73-78.
Thurstone, L. L. (1947). Multiple Factor Analysis (2nd ed.). Chicago: University of Chicago Press.
Thurstone, L. L. (1949). Note about the multiple group method. Psychometrika, 14, 43-45.
Thurstone, L. L. (1959). The Measurement of Values. Chicago:
University of Chicago Press.
Thurstone, L. L. e Thurstone, T. G. (1930). A neurotic inventory. Journal of. Social Psychology, 1, 3-30.
Thurstone, L. L. e Thurstone, T. G. (1941). Factorial Studies of
Intelligence. Chicago: University of Chicago Press.
Tiegs, E. W.; Clark, W. W.; e Thorpe, L. P. (1941).The California. Test of Personality. Journal of Educational Research,
35, 102-108.
Timmerman, M. E. e Lorenzo-Seva, U. (2011). Dimensionality assessment of ordered polytomous items with parallel
analysis. Psychological Methods, 16, 89-92.
Tingey, R. C., Lambert, M. L., Burlingame, G.M., e Hansen,
N. B. (1996a). Assessing clinical significance: Proposed extensions to the method. Psychotherapy Research, 6, 109– 123.
Tinsley, H. E. A. e Weiss, D. J. (1975). Interrater reliability
and agreement of subjective judgements. Journal of Counseling Psychology, 22, 358-376.
Tomlinson-Keasey, C. e Little, T. D. (1990). Predicting educational attainment, occupational achievement, intellectual skill, and personal adjustment among gifted men and
women. Journal of Educational Psychology, 82, 442-455.
Torgerson, W. S. (1952). Multidimensional scaling: I. Theory
and method. Psychometrika, 17, 401-419.
Trull, T. J. (1991). Discriminant validity of the MMPI-Borderline Personality Disorder Scale. Psychological Assessment,
3, 232-238.
Tucker, L. R. (1951). A method for synthesis of factor analysis
studies. Personnel Research Section Report No. 984. Washington, DC: Department of the Army.
Tucker, L. R. (1955). The objective definition of simple structure in linear factor analysis. Psychometrika, 20, 209-225.
Tucker, L. R. e Lewis, C. (1973). A reliability coefficient for maximum likelihood factor analysis. Psychometrika, 38, 1-10.
Tupes, E. C e Christal, R. E. (1961). Recurrent Personality Factors
Based on Trait Ratings (Technical Report No. ASD-TR-6197). Lackland Air Force Base, TX: U.S. Air Force.
Turner, S. P. (1979). The concept of face validity. Quality and
Quantity, 13, 85-90.
Tutz, G. (1990). Sequential Item Response Models with an
Ordered Response. British Journal of Mathematical and Statistical Psychology, 43, 39-55.
Tutz, G. (1997). Sequential Models for Ordered Responses.
In: W. J. Van der Linden e R. K. Hambleton (Eds.), Handbook of Modern Item Response Theory (pp. 139-152). NewYork: Springer-Verlag.
Tversky, A. e Kahneman, D.(1973). Availability: A heuristic
for judging frequency and probability. Cognitive Psychology,
5, 207-232.
Ubbiali, A., Chiorri, C. e Donati, D. (in stampa). The Italian
version of the Inventory of Interpersonal Problems Personality Disorders Scales (IIP-47): Psychometric properties
and clinical usefulness as a screening measure. Journal of
Personality Disorders.
Urban, F. M. (1908). The Application of Statistical Methods to Problems of Psychophyisics. Philadephia: Psychological Clinic Press.
Urban, W. J. (1989). The black scholar and intelligence testing: The case of Horace Mann Bond. Journal of the. History of the Behavioral Sciences, 25, 323-334.
Van der Ark, A., Croon, M. A. e Sijtsma, K. Van de Vijver, F.
J. R. e Poortinga, Y. H. (2005). New Developments in Categorical Data Analysis for the Social and Behavioral
Sciences. Conceptual and methodological issues in adapting tests. In R. K. Hambleton, P. F. Merenda e C. D. Spielberger (Eds.), Adapting Educational and Psychological Tests for
Cross-Cultural Assessment (pp. 39-63). Mahwah, NJ: Lawrence Erlbaum Associates, Inc.
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Cross-Cultural Assessment (pp. 39-63). Mahwah, NJ: Lawrence Erlbaum Associates, Inc.
Van den Broeck, J., Argeseanu Cunningham, S., Eeckels, R
e Herbst, K. (2005) Data cleaning: Detecting, diagnosing,
and editing data abnormalities. PLoS Med 2(10), e267.
Van der Linden, W. J. e Glas, C. A. W. (2000). Computerized
Adaptive Testing. Theory and Practice. Dordrecht, The Netherlands. Kluwer Academic Publishers.
Vannucci, M. (2008). Quando la memoria ci inganna. La psicologia delle false memorie. Roma: Carocci.
Velicer, W. F. e Fava, J. L. (1998). Effects of variable and subject sampling on factor pattern recovery. Psychological Methods, 3, 231-251.
Velicer, W. F. (1976). Determining the number of factors from
the matrix of partial correlations. Psychometrika, 41, 321-327.
Velicer, W. F., Eaton, C. A. e Fava, J. L. (2000). Construct explication through factor or component analysis: A review
and evaluation of alternative procedures for determining
the number of factors or components. In R. D.Goffin ed
E. Helmes (Eds.), Problems and Solutions in Human Assessment: Honoring Douglas N. Jackson at Seventy (pp. 41-71).
Boston: Kluwer.
Velicer, W.F. e Jackson, D.N. (1990a). Component analysis vs.
common factor analysis: Some further observations. Multivariate Behavioral Research, 25, 95-112.
Velicer, W.F. e Jackson, D.N. (1990b). Component analysis vs.
common factor analysis: Some issues in selecting an appropriate procedure. Multivariate Behavioral Research, 25, 1-2.
Vernon, P. E. e Parry, J. A. (1949). Personnel Selection in the
British Forces. London: London University Press.
Vernon, P. E. (1960). Intelligence and Attainment Tests. University of London Press, London.
Viteles, M. S. (1921). Test in industry. Journal of Applied Psychology, 5, 57-63.
Von Davier, A. A. (2011). Statistical Models for Test Equating,
Scaling, and Linking. New York: Springer.
von Kries, J. (1882). Über die Messung intensiver Grössen
und über das sogenannte psychophysische Gesetz. Vierteljahrsschrift für wissenschaftliche Philosophie, 6, 257-294.
von Mayrhauser, R. (1989). Making intelligence functional:
Walter Dill Scott and applied psychological testing in
World War I. Journal of the History of the Behavioral Sciences,
25, 60-72.
Wagenaar, W. A. (1986). My memory: A study of autobiographical memory over six years. Cognitive Psychology, 18,
Wagner, D. A. (1981). Culture and memory development. In
H. C. Triandis e A.Heron (Eds.), Handbook of Cross-Cultural
Psychology (Vol. 4, pp. 187-232).Boston: Allyn e Bacon.
Wainer, H. (2000) (Ed.). Computerized Adaptive Testing. A Primer. Mahwah, NJ: Lawrence Erlbaum Associates.
Wainer, H., Sireci, S. G. e Thissen, D. (1991). Differential testlet functioning: Definitions and detection. Journal of Educational Measurement, 28, 197-219.
Waller, N. G. e Reise, S. P. (2010). Measuring psychopathology with non-standard IRT models: Fitting the four parameter model to the MMPI. In S. Embretson e J. S. Roberts (Eds.), New Directions in Psychological Measurement with
Model-Based Approaches (pp. 147-173) Washington, DC:
American Psychological Association.
Walsh, W. B. (1995). Tests and Assessment. New York: Prentice-Hall.
Wanous, J. P., Reichers, A. e Hudy, M. J. (1997). Overall job
satisfaction: How good are single-item measures? Journal
of Applied Psychology, 82, 247-252.
Wason, P. C. (1960). On the failure to eliminate hypotheses
in a conceptual task. Quarterly Journal of Experimental Psychology, 12, 129-140.
Wasserman, G. S., Felsten, G. ed Easland, G. S. (1979). The
psychophysical function: harmonizing Fechner and Stevens. Science, 204, 85-87.
Watson, D. (1982). The actor and the observer: How are their
perceptions of causality divergent? Psychological Bulletin,
92, 682-700.
Watson, D., Clark, L. A. e Tellegen, A (1988). Development
and validation of brief measures of positive and negative
affect: The PANAS scales. Journal of Personality and Social
Psychology, 54, 1063-1070.
Weber, E. H. (1834). De pulsu, resorptione, auditu et tactu. Annotationes anatomicae et physiologicae. Leipzig: Koehler.
Wechsler, D. (1939). The Measurement of Adult Intelligence. Baltimore: Williams e Wilkins.
Weems, G. H. e Onwuegbuzie, A. J. (2001). The impact of
midpoint responses and reverse coding on survey data.
Measurement and Evaluation n Counseling and Development,
34, 166-176.
Weierstrass, K. (1868). Zur Theorie der quadratischen und
bilinearen Formen. Monatsberichte der Akademie der Wissenschaften zu Berlin,Werke 1 (Berlin 1894), 233-246.
Weijters, B., Geuens, M. e Schillewaert, N. (2009). The proximity effect: The role of inter-item distance on reverseitem bias. International Journal of Research in Marketing, 26,
Weir, J. P. (2005). Quantifying test-retest reliability using the
intraclass correlation coefficient and the SEM. Journal of
Strength and Conditioning Research, 19(1), 231-240.
Weiss, R. L. e Heyman, R. E. (1990). Observation of marital
interaction. In F. D. Fincham e T. N. Bradury (Eds.), The
Psychology of Marriage: Basic Issues and Applications (pp. 87117). New York: Guilford.
Wells, F. L. (1914). The systematic observation of the personality in its relation to the hygiene of the mind. Psychological Review, 21, 295-333.
Werner, O. e Campbell, D. T. (1970). Translating, working
through interpreters, and problems of decentering. In R.
Naroll e R Cohen (Eds.), A Handbook of Method in Cultural
Anthropology (pp. 398-420). New York: Columbia University Press.
Westen, D. e Rosenthal, R. (2003). Quantifying construct validity: Two simple measures. Journal of Personality and Social
Psychology, 84, 608-618.
Westland, J. C. (2010). Lower bounds on sample size in structural equation modeling. Electronic Commerce Research and
Applications, 9, 476-487.
Whitten, W. B. e Leonard, J. M. (1981). Directed search
through autobiographical memory. Memory and Cognition,
9, 566-579.
Widaman, K. F. e Thompson, J. S. (2004). On Specifying the
Null Model for Incremental Fit Indices in Structural Equation Modeling. Psychological Methods, 8(1), 16-37.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Wiegersma, S. (1982). Sequential response bias in randomized response sequences: A computer simulation. Acta Psychologica, 52, 249-256.
Wiggins, J. S. (1964). Convergences among stylistic response
measures from objective personality tests. Educational and
Psychological Measurement, 24, 551-562 .
Wiggins, J. S.(1959). Interrelationships among MMPI measures of dissimulation under standard and social desirability
instructions. Journal of Consulting Psychology, 23 ,419-427.
Wiley, D. E. (1973). The identification problem for structural
equation models with unmeasured variables. In A. S. Goldberger e O. D. Duncan (Eds.), Structural Equation Models
in the Social Sciences (pp. 69-83). New York: Academic Press.
Willoughby R. R. (1932) Some properties of the Thurstone
Personality Schedule and a suggested revision. Journal of
Social Psychology, 3, 401-424.
Wilson, D. e Winkelstein, M. L. (2005). Wong’s Essentials of Pediatric Nursing. St. Louis: Elsevier.
Wilson, E. B. (1927). Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association, 22, 209-212.
Wilson, E. B. (1928a). Review of “The abilities of man, their
nature and measurement” by C. Spearman. Science, 67,
Wilson, E. B. (1928b). On hierarchical correlation systems.
Proceedings of the National Academy of Sciences, 14, 283-291.
Wilson, E. B. e Hilferty, M. M. (1931). The distribution of chisquare. Proceedings of the National Academy of Sciences of the
United States of America, 17, 684-688.
Wilson, G. D. (1975) Manual for the Wilson-Patterson Attitude
Inventory. Windsor, UK: NFER/Nelson.
Wilson, G. D. (1985). The “catchphrase ” approach to attitude
measurement. Personality and Individual Differences, 6, 31-37.
Wilson, G. D. e Patterson, J. R. (1968). A new measure of
conservatism. British Journal of Social and Clinical Psychology,
7, 264-269.
Winkler, J. D., Kanouse, D. E. e Ware, J. E., Jr. (1982). Controlling for acquiescence response set in scale development. Journal of Applied Psychology, 67, 555-561.
Wissler, C. (1901). The correlation of mental and physical
tests. Psychological Review Monograph Supplements, 3 (6).
Witkin, H. A. (1962). Psychological Differentiation: Studies of Development. New York: Wiley.
Wolpe, J. (1958) Psychotherapy by Reciprocal Inhibition. Stanford, CA: Stanford University Press.
Wong, Y. K. (1935). Application of orthogonalization processes to the theory of least squares. Annals of Mathematical
Statistics, 6, 53-75.
Woodhouse, B. e Jackson, P. H. (1977). Lower bounds for the
reliability of the total score on a test composed of nonhomogeneous items: II. A search procedure to locate the
greatest lower bound. Psychometrika, 42(4), 579-591.
Woodworth, R. S. (1919). Examination of emotional fitness
for war. Psychological Bullettin, 15, 59-60.
Woodworth, R. S. (1920). Personal Data Sheet. Chicago: Stoeling.
Woodworth, R. S. (1930). Autobiography of Robert S. Woodworth. In C. Murchison (Ed.), History of Psychology in Autobiography (Vol. 2, pp. 359-380). Worcester, MA.: Clark
University Press.
Woodworth, R. S. e Wells, F. L. (1911). Association tests. Psychological Monographs 13(5), 1-85.
Wothke W. 1995. Covariance components analysis of the
multitrait-multimethod matrix. In P. E. Shrout e S. T. Fiske (Eds.), Personality Research, Methods, and Theory: A Festschrift Honoring Donald W. Fiske (pp. 125-144). Hillside,
NJ: Erlbaum.
Wrenn, B., Stevens, R. E. e Loudon, D. L. (2007). Marketing
Research: Texts and Cases (2nd ed.). Binghamton, NY: Haworth Press.
Wright, B. D. e Masters, G. N. (1982). Rating Scale Analysis.
Chicago: MESA Press.
Wright, B. D. e Linacre, J. M. (1994). Reasonable mean-square fit values. Rasch Measurement Transactions, 8(3), 3-70.
Wright, D. B. (2000). Conventional factor analysis vs. Rasch
residual factor analysis. Rasch Measurement Transactions,
14(2), 7-53.
Wright, S. (1921). Correlation and causation. Journal of Agricultural Research, 20, 557-585.
Wundt, W. (1874). Grundzüge der physiologischen Psychologie.
Leipzig: Wilhelm Engelmann.
Yamaguchi, J. (1997). Positive vs. negative wording. Rasch
Measurement Transactions, 11, 567.
Yerkes, R. M. (Ed.). (1921). Psychological Examining in the United States Army. Memoirs of the National Academy of Sciences,
Volume 15. Washington: Government Printing Office.
Yerkes, R. M., Bridges, J. W. e Hardwick, R. S. (1915). A
Point Scale for Measuring Mental Ability. Baltimore: Warwick e York.
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl
Yoakum, C. S. (1922). Basic experiments in vocational guidance. Journal of Personnel Research, 1, 18-34.
Yoakum, C. S. e Yerkes, R. M. (1920). Army Mental Tests. New
York: Henry Holt and Company.
Zammuner, V. L. (1998). Tecniche dell’intervista e del questionario. Bologna: Il Mulino.
Zammuner, V. L. (2006). I focus group. Bologna: Il Mulino.
Zhang, H. (1988). Psychological measurement in China. International Journal of Psychology, 23, 101-117.
Zimmerman, M., Rothschild, L. e Chelminski, I. (2005). The prevalence of DSM-IV personality disorders in psychiatric outpatients. American Journal of Psychiatry, 162, 1911-1918.
Zumbo, B. D., Gadermann, A. M. e Zeisser, C.. (2007). Ordinal
versions of coefficients alpha and theta for Likert rating
scales. Journal of Modern Applied Statistical Methods, 6, 21-29.
Zwick, R. (1990). When do item response function and Mantel-Haenszel definitions of differential item functioning
coincide? Journal of Educational Statistics, 15, 185-197.
Zwick, R., Donoghue, J. R e Grima, A (1993). Assessment of
differential item functioning for performance tasks. Journal of Educational Measurement, 30, 233-251.
Zwick, W. R. e Velicer, W. F. (1982). Factors influencing four
rules for determining the number of components to retain.
Multivariate Behavioral Research, 17, 253-269.
Zwick, W. R. e Velicer, W. F. (1986). Comparison of five rules
for determining the number of components to retain. Psychological Bulletin, 99, 432-442.
Bibliografia_Layout 1 06/12/11 00:26 Pagina B26
Teoria e tecnica psicometrica - Costruire un test psicologico
Carlo Chiorri
© 2011, The McGraw-Hill Companies srl