Assessing the inter-rater reliability of a system-wide teacher evaluation observation instrument: moving beyond the kappa paradox

Jimenez, Albert Manuel

Assessing the inter-rater reliability of a system-wide teacher evaluation observation instrument: moving beyond the kappa paradox

Jimenez, Albert Manuel

2014

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

The purpose of this study was to investigate the processes and procedures impacting the validation and the establishment of inter-rater reliability of observation instruments used in a teacher evaluation context. A newly-created observation instrument used in an extensive teacher evaluation program in a southeastern school district served as the object of investigation in this research. Inter-rater reliability coefficients of the instrument were assessed as part of the validation of the evaluation system. Additionally, two methods of inter-rater reliability that correct for chance agreement were examined to determine if the Gwet AC1 statistic, which is often used in a medical context but rarely in an education one, outperformed the typically provided kappa statistic. The inter-rater reliability coefficients for all videos and all items combined were in an acceptable range. This was also the case for most individual standards as well. Gwets AC1 statistic regularly outperformed the kappa statistic as a chance-corrected measure of inter-rater reliability. This finding held for all teachers combined, for the highest-rated teacher, and for the lowest-rated teacher, suggesting that Gwets AC1 statistic shows promise for future inter-rater reliability studies in a teacher evaluation context. While Gwets AC1 statistic outperformed kappa for the lowest-rated teacher, what was clear is the inter-rater reliability coefficients for the lowest-rated teacher suggests that consistently and accurately identifying poorly performing teachers is elusive. Additionally, this finding suggests the possibility that standards by which teachers are traditionally assessed enabling accurate identification for poor performing teachers may be underdeveloped. Further research in this area is warranted.

Details

Record ID

11273

Record Created

2024-12-05

Title

Assessing the inter-rater reliability of a system-wide teacher evaluation observation instrument: moving beyond the kappa paradox

Author

Jimenez, Albert Manuel

Contributor

Zepeda, Sally Advisor
Cohen, Allan Committee Member
Gregg, Noel Committee Member

College or School

Mary Frances Early College of Education

Department

Lifelong Education, Administration and Policy

Date

2014

Publisher

University of Georgia

Content Type

Dissertation

Language

English

Dissertation/ Thesis Note

Doctoral

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia, Spring 2014

Year Degree Granted

2014

Keywords

Classroom Observation Instruments; Teacher Evaluation; Teacher Evaluation System Creation; Validity; Inter-rater Reliability; Gwet's AC1 Statistic & Teacher Evaluation

Record Appears in

College, School, or Unit > Mary Frances Early College of Education > Lifelong Education, Administration and Policy
Electronic Theses and Dissertations > Doctoral Dissertation
All Resources
Doctoral

System Control Number

9949334565702959

PDF

Statistics

Download Full History