Inter-rater Reliability in a Clinical Performance Examination Using Multiple Standardized Patients for the Same Case

Ko, Jinkyung; Yoon, Tai Young; Park, Jaehyun; Ko, Jinkyung; Yoon, Tai Young; Park, Jaehyun

doi:2008.20.1.61

Korean J Med Educ > Volume 20(1); 2008 > Article

Original Article

Korean Journal of Medical Education 2008;20(1): 61-72. doi: https://doi.org/10.3946/kjme.2008.20.1.61

진료수행시험에서 동일한 사례를 연기한 다수의 표준화 환자 간 채점결과 신뢰도

고진경, 윤태영, 박재현

경희대학교 의과대학 의학교육학교실

Inter-rater Reliability in a Clinical Performance Examination Using Multiple Standardized Patients for the Same Case

Jinkyung Ko, Tai Young Yoon, Jaehyun Park

Department of Medical Education, School of Medicine, Kyung Hee University, Seoul, Korea.

Corresponding Author: Jinkyung Ko, Tel: 02)961-9102, Fax: 02)969-0792, Email: michkay@khu.ac.kr

ABSTRACT

PURPOSE: The "standardization" of standardized patients (SP) is one of the most crucial factors for a successful clinical performance examination (CPX). This study aimed to examine the inter-rater reliability among SPs who portrayed the same case during a CPX. METHODS: The context was a CPX conducted under the supervision of CPX Seoul-Gyeonggi Consortium in K medical school in August 2007. K medical school ran 12 stations consisting of duplicated sets of 6 cases. In total, thirty SPs participated with 5 SPs acting each of the 6 cases. The SPs evaluated the student's performances in addition to portraying the cases. ANCOVA (analysis of covariance) was used to compare scores rated by the different SPs. The dependent variables were the case scores and the 4 subcomponent (history taking, physical examination, Clinical courtesy, and Patient-physician interaction) scores for each case; the independent variable was the SPs; and the covariate was the CPX total score. RESULTS: The Headache and Cough stations showed an acceptable level of reliability. Otherwise, Weight Loss and Facial Flushing failed to show consistent scores in all 4 subcomponents. Diarrhea and Lt. hemiparesis showed partial consistency. In terms of the subcomponents, the physical exam scores were most consistent and the patient-physician interaction scores were most inconsistent. CONCLUSION: This study tested the level of "standardization" of one set of CPX cases with mixed results. The authors hope that our results will contribute to quality assurance of CPX.

Keywords: CPX;Standardized Patient;Inter-rater Reliability;Analysis of Covariance;Assessment