Predicting Holistic Ratings of Written Performance Assessments from Analytic Scoring

Slater, Sharon Cadman; Boulet, John R.

doi:10.1023/A:1011478224834

Predicting Holistic Ratings of Written Performance Assessments from Analytic Scoring

Published: May 2001

Volume 6, pages 103–119, (2001)
Cite this article

Advances in Health Sciences Education Aims and scope Submit manuscript

Sharon Cadman Slater¹ &
John R. Boulet¹

194 Accesses
13 Citations
Explore all metrics

Abstract

The use of experts to judge performance assessments is desirable because ratings of performances, carried out by experts in the content domain of the examination, are often considered to be the ``gold standard.'' However, one drawback of using experts to rate performances is the high cost involved. A more economic alternative for scoring performance assessments entails using analytic scoring, which typically involves assigning points to individual traits present in the performance, and summing to arrive at a single score. This strategy is less costly, but may lack the richness of holistic scoring. This study investigates the use of regression-based techniques to predict expert judgments on a written performance task from a combination of analytic scores. Potentially, this will result in scores that approximate the richness of holistic ratings while maintaining the cost-effectiveness of analytic scoring. Results show that a substantial proportion of variance in expert judgments can be explained by the analytic scores, but that decisions based on actual expert judgments and the predicted expert judgments were not sufficiently consistent to warrant the substitution of one score for the other.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Writing scale effects on raters: an exploratory study

Article Open access 30 December 2019

Heejeong Jeong

Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Article Open access 10 November 2020

Hyunwoo Kim

Raters’ perceptions of rating scales criteria and its effect on the process and outcome of their rating

Article Open access 01 August 2022

Nasim Heidari, Nasim Ghanbari & Abbas Abbasi

References

Bennett, R.E. & Sebrechts, M.M. (1996). The accuracy of expert-system diagnoses of mathematical problem solutions. Applied Measurement in Education 9: 133-150.
Article Google Scholar
Boulet, J., Friedman Ben-David, M., Hambleton, R. K., Burdick, W., Ziv, A. & Gary, N.E. (1998). An investigation of the sources of measurement error in the post-encounter written scores from standardized patient examinations. Advances in Health Sciences Education 3: 81-87.
Article Google Scholar
Boulet, J., Friedman Ben-David, M., Ziv, A., Burdick, W.P. & Gary, N.E. (2000). The use of holistic scoring for post-encounter written exercises. In D. Melnick (ed.), Evolving Assessment: Protecting the Human Dimension. The Eighth International Conference on Medical Education Proceedings, pp. 254-260. Philadelphia: National Board of Medical Examiners.
Google Scholar
Braun, H.I., Bennett, R.E., Frye, D. & Soloway, E. (1990). Scoring constructed responses using expert systems. Journal of Educational Measurement 27: 93-108.
Article Google Scholar
Burstein, J., Kukich, K., Wolff, S. & Lu, C. (1998, April). Computer analysis of essay content for automated score prediction. Paper presented at the meeting of the National Council on Measurement in Education, San Diego, CA.
Clauser, B.E., Subhiyah, R.G., Nungester, R.J., Ripkey, D.R., Clyman, S.G. & McKinley, D. (1995). Scoring a performance-based assessment by modeling the judgments of experts. Journal of Educational Measurement 32(4): 397-415.
Article Google Scholar
Clauser, B.E., Margolis, M.J., Clyman, S.G. & Ross, L.P. (1997). Development of automated scoring algorithms for complex performance assessments: A comparison of two approaches. Journal of Educational Measurement 34(2): 141-161.
Article Google Scholar
Friedman Ben-David, M., Boulet, J.R., Burdick,W.P., Ziv, A., Hambleton, R.K. & Gary, N.E. (1997). Issues of validity and reliability concerning who should score the post-encounter patient progress note. Academic Medicine 72: S79-S81.
Article Google Scholar
Goulden, N.R. (1994). Relationship of analytic and holistic methods to raters' scores for speeches. Journal of Research and Development in Education 27(2): 73-82.
Google Scholar
Swaminathan, H., Hambleton, R.K. & Algina, J. (1974). Reliability of criterion-referenced tests: A decision-theoretic formulation. Journal of Educational Measurement 11: 263-268.
Article Google Scholar
Vu, N.V. & Barrows, H.S. (1994). Use of standardized patients in clinical assessments: Recent developments and findings. Educational Researcher 23(3): 25-30.
Article Google Scholar
Wainer, H. & Thissen, D. (1993). Combining multiple-choice and constructed-response test scores: Toward a Marxist theory of test construction. Applied Measurement in Education 6: 103-118.
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Massachusetts Amherst, 152 Hills South, Amherst, MA, 01003, USA
Sharon Cadman Slater & John R. Boulet

Authors

Sharon Cadman Slater
View author publications
You can also search for this author in PubMed Google Scholar
John R. Boulet
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sharon Cadman Slater.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Slater, S.C., Boulet, J.R. Predicting Holistic Ratings of Written Performance Assessments from Analytic Scoring. Adv Health Sci Educ Theory Pract 6, 103–119 (2001). https://doi.org/10.1023/A:1011478224834

Download citation

Issue Date: May 2001
DOI: https://doi.org/10.1023/A:1011478224834

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting Holistic Ratings of Written Performance Assessments from Analytic Scoring

Abstract

Access this article

Similar content being viewed by others

Writing scale effects on raters: an exploratory study

Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Raters’ perceptions of rating scales criteria and its effect on the process and outcome of their rating

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Navigation

Predicting Holistic Ratings of Written Performance Assessments from Analytic Scoring

Abstract

Access this article

Similar content being viewed by others

Writing scale effects on raters: an exploratory study

Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Raters’ perceptions of rating scales criteria and its effect on the process and outcome of their rating

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation