Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Language Testing
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Lumley, T.
Right arrow Articles by McNamara, T.F.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Rater characteristics and rater bias: implications for training

Tom Lumley

NLLIA Lanauaae Testing Centre, University of Melbourne

T.F. McNamara

NLLIA Lanauaae Testing Centre, University of Melbourne

Recent developments in multifaceted Rasch measurement (Linacre, 1989) have made possible new kinds of investigation of aspects (or 'facets') of performance assessments. Relevant characteristics of such facets (for exam ple, the relative harshness of individual raters, the relative difficulty of test tasks) are modelled and reflected in the resulting person ability measures.

In addition, bias analyses, that is interactions between elements of any facet, can also be analysed. (For the facet 'person', an element is an individual candidate; for the facet 'rater', an element is an individual judge, and so on.) This permits investigation of the way a particular aspect of the test situation (type of candidate, choice of prompt, etc.) may elicit a consistently biased pattern of responses from a rater.

The purpose of the research is to investigate the use of these analytical techniques in rater training for the speaking subtest of the Occupational English Test (OET), a specific-purpose ESL performance test for health professionals. The test involves a role-play based, profession-specific inter action, involving some degree of choice of role-play material. Data are presented from two rater training sessions separated by an 18-month interval and a subsequent operational test administration session. The analysis is used to establish 1) consistency of rater characteristics over different occasions; and 2) rater bias in relation to occasion of rating. The study thus addresses the question of the stability of rater characteristics, which has practical implications in terms of the accreditation of raters and the requirements of data analysis following test administration sessions. It also has research implications concerning the role of multifaceted Rasch measurement in understanding rater behaviour in performance assessment contexts.

Language Testing, Vol. 12, No. 1, 54-71 (1995)
DOI: 10.1177/026553229501200104


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Language TestingHome page
J. S. Johnson and G. S. Lim
The influence of rater language background on writing performance assessment
Language Testing, October 1, 2009; 26(4): 485 - 505.
[Abstract] [PDF]


Home page
Language TestingHome page
S. Matsuno
Self-, peer-, and teacher-assessments in Japanese university EFL writing classrooms
Language Testing, January 1, 2009; 26(1): 075 - 100.
[Abstract] [PDF]


Home page
Language TestingHome page
E. Schaefer
Rater bias patterns in an EFL writing assessment
Language Testing, October 1, 2008; 25(4): 465 - 493.
[Abstract] [PDF]


Home page
Language TestingHome page
H. Saito
EFL classroom peer assessment: Training effects on rating and commenting
Language Testing, October 1, 2008; 25(4): 553 - 581.
[Abstract] [PDF]


Home page
Language TestingHome page
T. Eckes
Rater types in writing performance assessments: A classification approach to rater variability
Language Testing, April 1, 2008; 25(2): 155 - 185.
[Abstract] [PDF]


Home page
Language TestingHome page
L. Llosa
Validating a standards-based classroom assessment of English proficiency: A multitrait-multimethod approach
Language Testing, October 1, 2007; 24(4): 489 - 515.
[Abstract] [PDF]


Home page
Language TestingHome page
M. East
Bilingual dictionaries in tests of L2 writing proficiency: do they make a difference?
Language Testing, July 1, 2007; 24(3): 331 - 353.
[Abstract] [PDF]


Home page
Language TestingHome page
Y. Sawaki
Construct validation of analytic rating scales in a speaking assessment: Reporting a score profile and a composite
Language Testing, July 1, 2007; 24(3): 355 - 390.
[Abstract] [PDF]


Home page
Language TestingHome page
C. Elder, G. Barkhuizen, U. Knoch, and J. von Randow
Evaluating rater responses to an online training program for L2 writing assessment
Language Testing, January 1, 2007; 24(1): 37 - 64.
[Abstract] [PDF]


Home page
Assessment for Effective InterventionHome page
B. Garrett, E. Towles, H. Kleinert, and J. Kearns
Portfolios in Large-Scale Alternate Assessment Systems: Frameworks for Reliability
Assessment for Effective Intervention, January 1, 2003; 28(2): 17 - 27.
[Abstract] [PDF]


Home page
Language TestingHome page
W. J. Bonk and G. J. Ockey
A many-facet Rasch analysis of the second language group oral discussion task
Language Testing, January 1, 2003; 20(1): 89 - 110.
[Abstract] [PDF]


Home page
Language TestingHome page
K. Kondo-Brown
A FACETS analysis of rater bias in measuring Japanese second language writing performance
Language Testing, January 1, 2002; 19(1): 3 - 31.
[Abstract] [PDF]


Home page
Language TestingHome page
L. F. Bachman
Modern language testing at the turn of the century: assuring that what we count counts
Language Testing, January 1, 2000; 17(1): 1 - 42.
[Abstract] [PDF]


Home page
Language TestingHome page
J. A. Upshur and C. E. Turner
Systematic effects in the rating of second-language speaking ability: test method and learner discourse
Language Testing, January 1, 1999; 16(1): 82 - 111.
[Abstract] [PDF]


Home page
Language TestingHome page
B. K. Lynch and T. F. McNamara
Using G-theory and Many-facet Rasch measurement in the development of performance assessments of the ESL speaking skills of immigrants
Language Testing, April 1, 1998; 15(2): 158 - 180.
[Abstract] [PDF]


Home page
Language TestingHome page
G. Brindley
Outcomes-based assessment and reporting in language learning programmes: a review of the issues
Language Testing, January 1, 1998; 15(1): 45 - 85.
[Abstract] [PDF]


Home page
Language TestingHome page
R. Schoonen, M. Vergeer, and M. Eiting
The assessment of writing ability: expert readers versus lay readers
Language Testing, July 1, 1997; 14(2): 157 - 184.
[Abstract] [PDF]