Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Sign In to gain access to subscriptions and/or personal tools.
Language Testing
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Kim, M.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Detecting DIF across the different language groups in a speaking test

Mikyung Kim

University of California, Los Angeles, kimmi{at}ucla.edu

The investigation of differential item functioning (DIF) is crucial in language proficiency tests in which test-takers with diverse backgrounds are involved, because DIF items pose a considerable threat to the validity of tests. To date, DIF analysis in language testing has been conducted mainly for multiple-choice items. However, examining DIF with polytomous response items such as in writing and speaking tests should also be taken into consideration for validating tests. This study aims to investigate DIF across two different broad language groupings, Asian and European, in a speaking test in which the test-takers’ responses are rated polytomously. Data in this study were collected from 1038 nonnative speakers of English from France, Hong Kong, Japan, Spain, Switzerland and Thailand who took the SPEAK test in 1988 (see Educational Testing Service, 1985). The methods used for DIF analysis were the likelihood ratio test and the logistic regression procedure. The primary scoring categories of interest in this study were ‘grammar’, ‘pronunciation’ and ‘fluency.’ The results showed that ‘grammar’ and ‘pronunciation’ functioned differentially across the two groups. A content analysis of the DIF items suggested that the types and the numbers of scoring scales might influence the test validity. The study provides methodological information on differences between two approaches to DIF analysis and suggestions for future research.

Language Testing, Vol. 18, No. 1, 89-114 (2001)
DOI: 10.1177/026553220101800104


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Applied Psychological MeasurementHome page
C. M. Woods
Empirical Selection of Anchors for Tests of Differential Item Functioning
Applied Psychological Measurement, January 1, 2009; 33(1): 42 - 57.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
B. F. French and S. J. Maller
Iterative Purification and Effect Size Use With Logistic Regression for Differential Item Functioning Detection
Educational and Psychological Measurement, June 1, 2007; 67(3): 373 - 393.
[Abstract] [PDF]


Home page
Language TestingHome page
A. Huhta, P. Kalaja, and A. Pitkanen-Huhta
Discursive construction of a high-stakes test: the many faces of a test-taker
Language Testing, July 1, 2006; 23(3): 326 - 350.
[Abstract] [PDF]


Home page
Language TestingHome page
H. Uiterwijk and T. Vallen
Linguistic sources of item bias for second generation immigrants in Dutch tests
Language Testing, April 1, 2005; 22(2): 211 - 234.
[Abstract] [PDF]


Home page
Language TestingHome page
S.-K. Shin
Did they take the same test? Examinee language proficiency and the structure of language tests
Language Testing, January 1, 2005; 22(1): 31 - 57.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
A. M. Fidalgo, D. Ferreres, and J. MuNiz
Utility of the Mantel-Haenszel Procedure for Detecting Differential Item Functioning in Small Samples
Educational and Psychological Measurement, December 1, 2004; 64(6): 925 - 936.
[Abstract] [PDF]