Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Sign In to gain access to subscriptions and/or personal tools.
Language Testing
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Chalhoub-Deville, M.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Deriving oral assessment scales across different tests and rater groups

Micheline Chalhoub-Deville

Ohio State University

The purpose of this study is to derive the criteria/dimensions underlying learners' L2 oral ability scores across three tests: an oral interview, a narration and a read-aloud. A stimulus tape of 18 speech samples was presented to three native speaker rater groups for evaluation. The rater groups included teachers of Arabic as a foreign language in the USA, nonteaching Arabs residing in the USA for at least one year and nonteaching Arabs living in their home country (Lebanon). Each of the raters provided a holistic score for every speech sample. Holistic scores were analysed using the INDSCAL multidimensional scaling model. Results showed that the nonmetric three-dimensional solution provided a good fit to the data. Both regression and speech sample analyses were employed to identify those dimensions. Additionally, subject weights indicated that the three rater groups were emphasizing the three dimensions differentially, thus demon strating that native speaker groups with varied backgrounds perceive the L2 oral construct differently. The study contends that researchers might need to reconsider employing generic component scales. A research approach that derives scales empirically according to the given tests and audiences, and according to the purpose of assessment, is recommended. Finally, replicating this study using other languages, L2 oral ability levels, tests and rater groups is suggested.

Language Testing, Vol. 12, No. 1, 16-33 (1995)
DOI: 10.1177/026553229501200102


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Language TestingHome page
Y.-H. Kim
An investigation into native and non-native teachers' judgments of oral English performance: A mixed methods approach
Language Testing, April 1, 2009; 26(2): 187 - 217.
[Abstract] [PDF]


Home page
Language TestingHome page
L. Llosa
Validating a standards-based classroom assessment of English proficiency: A multitrait-multimethod approach
Language Testing, October 1, 2007; 24(4): 489 - 515.
[Abstract] [PDF]


Home page
Language TestingHome page
A. Cumming, L. Grant, P. Mulcahy-Ernt, and D. E. Powers
A teacher-verification study of speaking and writing prototype tasks for a new TOEFL
Language Testing, April 1, 2004; 21(2): 107 - 145.
[Abstract] [PDF]


Home page
Language TestingHome page
M. Chalhoub-Deville
Second language interaction: current perspectives and future trends
Language Testing, October 1, 2003; 20(4): 369 - 383.
[Abstract] [PDF]


Home page
Language TestingHome page
G. Fulcher and R. M. Reiter
Task difficulty in speaking tests
Language Testing, July 1, 2003; 20(3): 321 - 344.
[Abstract] [PDF]


Home page
Language TestingHome page
B. O'Sullivan, C. J. Weir, and N. Saville
Using observation checklists to validate speaking-test tasks
Language Testing, January 1, 2002; 19(1): 33 - 56.
[Abstract] [PDF]


Home page
Second Language ResearchHome page
D. Douglas
Performance consistency in second language acquisition and language testing research: a conceptual gap
Second Language Research, October 1, 2001; 17(4): 442 - 456.
[Abstract] [PDF]


Home page
Language TestingHome page
L. F. Bachman
Modern language testing at the turn of the century: assuring that what we count counts
Language Testing, January 1, 2000; 17(1): 1 - 42.
[Abstract] [PDF]


Home page
Language TestingHome page
J. A. Upshur and C. E. Turner
Systematic effects in the rating of second-language speaking ability: test method and learner discourse
Language Testing, January 1, 1999; 16(1): 82 - 111.
[Abstract] [PDF]


Home page
Language TestingHome page
G. Brindley
Outcomes-based assessment and reporting in language learning programmes: a review of the issues
Language Testing, January 1, 1998; 15(1): 45 - 85.
[Abstract] [PDF]


Home page
Language TestingHome page
M. Chalhoub-Deville
Theoretical models, assessment frameworks and test construction
Language Testing, March 1, 1997; 14(1): 3 - 22.
[Abstract] [PDF]