<p>Psychometric Properties of the Persian Questionnaire for Evaluation of Clinical Teaching at Outpatient Settings</p>

Maryam Baradaran Binazir; Mohammad Barzegar; Fariba Heidari

doi:10.2147/AMEP.S261350

Back to Journals » Advances in Medical Education and Practice » Volume 11

Original Research

Psychometric Properties of the Persian Questionnaire for Evaluation of Clinical Teaching at Outpatient Settings

Authors Baradaran Binazir M, Barzegar M, Heidari F

Received 6 May 2020

Accepted for publication 30 July 2020

Published 17 August 2020 Volume 2020:11 Pages 549—555

DOI https://doi.org/10.2147/AMEP.S261350

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Md Anwarul Azim Majumder

Download Article [PDF]

Maryam Baradaran Binazir,¹ Mohammad Barzegar,² Fariba Heidari³

¹Community and Family Medicine Department, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran; ²Medical Education Research Center, Health Management and Safety Promotion Research Institute, Tabriz University of Medical Sciences, Tabriz, Iran; ³Social Determinants of Health Research Center, Health Management and Safety Promotion Research Institute, Tabriz University of Medical Sciences, Tabriz, Iran

Correspondence: Maryam Baradaran Binazir Email [email protected]

Purpose: To assess the validity and reliability of the Persian Outpatient Clinical Teaching Evaluation (OCTE) questionnaire, in order to evaluate clinical teaching for undergraduate medical students.
Methods: This was a methodological study conducted in Tabriz University of Medical Sciences in 2018. To assess the validity of the questionnaire, ten academic staff who had received master’s degrees in medical education were chosen by convenience sampling. The first draft of the Persian questionnaire was derived from the Ministry of Health’s clinical teaching standards booklet. The questionnaire consisted of just 15 obligatory items in the first draft of the questionnaire. Subsequently, the questionnaire was modified to be used in teaching outpatient clinics. Content validity indices were calculated. Subsequently, the modified questionnaires were given to 92 academic staff in Imam-Reza Hospital’s outpatient clinics. We applied principle component analysis (PCA) and Varimax rotation for exploratory factor analysis (EFA). In order to confirm the EFA, confirmatory factor analysis (CFA) was used. Cronbach’s alpha method, the intra-class correlation coefficient (ICC) and test–retest reliability were used to assess the reliability.
Results: The overall content validity ratio (CVR), content validity index (CVI), and impact score (IS) were 0.78, 0.79, and 3.26, respectively. Out of 92 academic staff, 85 participated. The Keizer-Meyer-Olkin (KMO) measure of sampling adequacy for data was 0.726 (p-value=0.0001). After the EFA, the 10-item questionnaire loaded on three factors. The internal consistency of the questionnaire was established by Cronbach’s alpha of 0.89 and the ICC was 0.94. Furthermore, there was a strong correlation with a 2-week retest interval.
Conclusion: The study demonstrated that the Persian clinical teaching questionnaire had optimal psychometric properties. Thus, application of this questionnaire at outpatient settings can be helpful.

Keywords: validity, reliability, education, medical, undergraduate, evaluation

Introduction

Clinical teaching at outpatient settings is a critical part of medical students’ training and its evaluation is important.¹ One of the common types of clinical teaching evaluations is using a questionnaire to grade teachers by students at the end of clinical rotations.² Recently, medical educators have increasingly applied these questionnaires. Almost all of the questionnaires introduced in Western settings and their generalizability is limited for other countries and even universities.³ In order to solve these challenges, a study described the development and content validation of an instrument for evaluating clinical teaching addressed to an East Asian setting.⁴ An other study about the evaluation of clinical teaching pointed out that assessment tools should be compatible with the setting of organization, favorable to teachers, easy to conduct, and appropriate to all levels of educators.⁵ Accordingly, to improve the quality of clinical teaching, the first step is to use valid and reliable instruments in the clinical teaching environment. Then by implementing interventions during a dynamic process, more successful clinical teaching could be expected.⁶

In recent years, several questionnaires have been developed and validated for evaluation of clinical teaching, such as the Dundee Ready Education Environment Measure (DREEM) and the Service Quality Measurement Questionnaire (SERVQUAL). Evidence demonstrated that all these questionnaires were more appropriate for inpatient settings.^7,8 Although the Persian version of the Ambulatory Care Learning Education Environment Measure (ACLEEM) questionnaire was developed by Parvizi et al⁹ in 2016 for evaluation of clinical teaching in outpatient settings, it was designed for residency medical clinics. To date, a valid and reliable Persian questionnaire to evaluate the clinical teaching for undergraduate medical students at the outpatient settings is lacking.

The Ministry of health in Iran developed the standards for clinical teaching because there was a need to improve the clinical teaching for undergraduate medical students.¹⁰ However, for the evaluation of clinical teaching, there were the basic necessities for the instrument to be valid, and reliable.¹¹

A limited number of studies have validated the Persian instrument to evaluate clinical teaching for undergraduate medical students in outpatient settings. For example, a 10-item questionnaire was found to be valid for the evaluation of clinical teaching capacities in outpatient settings. Although this tool was reviewed for content validity by the authors, no other psychometric properties were investigated.¹² Other studies validated the Persian questionnaire to evaluate the journal clubs and the morning reports in the teaching hospitals.^13,14

Evaluation is the important factor in the performing and improving of educational programs.⁹ Teachers, students, and health care systems, along with patients, can be supposed to gain from improvement of clinical teaching. So, the aim of this study was to assess the validity and the reliability of the questionnaire derived from the clinical teaching standards booklet of Iran’s Ministry of Health to evaluate clinical teaching in outpatient settings.

Methods

Study Design

This was a methodological study conducted in Tabriz University of Medical Sciences in Iran, from October to December, 2018. We investigated the validity and the reliability of the questionnaire to evaluate the clinical teaching at the outpatient settings.

The Questionnaire

The first draft of the Persian questionnaire was derived from the Ministry of Health’s clinical teaching standards booklet.¹⁰ The Educational Deputy of the Ministry of Health has conducted a project called “Determination of Clinical Medical Education Standards” in 2015, in order to develop the clinical teaching standards booklet. The first step in the project was situation analysis of clinical education in Iranian medical universities. In the next step, medical education experts reviewed the literature and summarized the reports and the first draft of the clinical teaching standards booklet was developed. These standards were related to the learning environment in the medical education process including empowering the teacher on clinical teaching methods, supervising the students, location and equipment for clinical teaching, frequency and duration of clinical teaching, the combination and number of patients, object-based teaching, teaching content, effective educational practices, night shifts for medical interns, ethics, evidence-based practice, medical recording and documentation, and evaluation.¹⁵ The booklet consisted of outpatient clinics, teaching round, grand round, morning reports, journal reports, and essentials of clinical skills’ standards.¹⁰

The clinical teaching standards at the outpatient settings consisted of 27 items, 15 obligatory and 12 preferred items. This study was conducted based on only the obligatory items. As the original and validated questionnaire both were in Persian, no translation was performed in the present study. Subsequently, the questionnaire was modified to be used in the teaching outpatient clinics.

Validity and Reliability

Both qualitative and quantitative methods were performed to investigate the validity of the questionnaire. To do so, ten academic staff with master’s degrees in medical education were chosen by convenience sampling, hereafter referred to as our expert panel. In qualitative method, in terms of face validity, the experts were requested to write their opinions about the items (the location of the items, the structure, the ambiguity, and the difficulty of the items) and the scaling. In the quantitative method, the CVR and the CVI of the items and the whole questionnaire were calculated. Additionally, the impact score (IS) was applied to assess the face validity of the questionnaire. To find out the threshold point for CVR, Lawshe’s table was applied indicating a CVR of 0.62 as the lowest required value for each item for ten experts.¹⁶

According to the content validity, we asked experts to read the questionnaire and evaluateeach item based on a three-point grading including essential, beneficial but unessential, and unessential. In order to determine the CVI, the simplicity (absolutely simple, simple, fairly complex, complex), the relevance (irrelevant, fairly relevant, relevant, absolutely relevant), and the clarity (absolutely clear, clear, fairly clear, unclear) of the items were independently examined based on a four-point scale. The CVI for each item was calculated by dividing the number of experts who graded the items as scores three and four for each factor (simplicity, relevance, and clarity) divided by the total number of experts. Accordingly, a CVI more than 0.78 pointed out a sufficient content validity. In case of a CVI less than 0.78, the item was modified or omitted.¹⁷

The IS (frequency (%) × importance) was used to reveal the percentage of the experts, who identified the item as important or quite important. The IS scoring was based on a 5-point grading. The items were considered appropriate if they had an impact score of 1.5 or higher in terms of face validity.

In order to test the convergence and the discriminant validity, the questionnaire was delivered to the academic staff in Imam-Reza Hospital’s outpatient clinics, an educational and referral center and the largest outpatient clinics in Northwest of Iran. The advised sample size for factor analysis was supposed to be ten subjects per item.¹⁸ Therefore, we requested all academic staff who worked at outpatient clinics in Imam-Reza Hospital, to fill out the questionnaire consisting of 52 internal medicine specialists, nine infectious disease specialists, six neurologists, eight general surgeons, ten urologists, and seven otorhinolaryngologists. The checklist was based on a 5-point scale including always, often, sometimes, rarely, and never that scored 5 to 1, respectively. The demographic data of raters including age, sex, work experience and academic degree were also collected.

The discriminant validity was measured using the EFA. The KMO measure of sampling adequacy and Bartlett’s test of sphericity were applied to determine whether factor analysis could be appropriate for our data considering proportion of variances in the variables. The KMO value equivalent to or more than 0.70 was regarded as adequate for factor analysis. A significant Bartlett’s test of sphericity was regarded as acceptable. Various mathematical ways were applied to specify which items fitted together but the most ordinary one was the principle component analysis (PCA). This analysis allowed us to recognize the components, and defined the related variance accounted for each component and their agreement. Each component had a quality score (Eigen value). Exclusive components with high Eigenvalues were expected to suggest a real factor. Furthermore, if an item had more than one factor loading (cross loadings), the Varimax rotation was applied to rearrange the factor loadings such that each item measured absolutely one component. Additionally, the CFA was tested using Lisrel 8.80 software.

In terms of convergence validity, Cronbach’s alpha coefficient was calculated to examine the internal consistency of the questionnaire. Furthermore, in order to assess the repeatability of the questionnaire, the participants responded to the questionnaire with a 2-week interval and the ICC was calculated. The ICC values of 0.40 or more were considered satisfactory.¹⁹ For testing the reliability, the concordance rate of participant’s answers between the two times of filling out the questionnaire were compared by Kappa coefficient.²⁰ The data were analyzed using SPSS software, Version 16.

Ethical Considerations

Our study was in accordance with the Declaration of Helsinki. We respected the autonomy of participants and protected their anonymity. Furthermore, all participants provided written informed consent to participate in this study. This research project was approved by the Ethics Committee of Tabriz University of Medical Sciences (IR.TBZMED.REC.810).

Results

The expert panel consisted of ten academic members of the Tabriz University of Medical Science with a mean age (SD) of 48.12 (4.13) years and all were men. To assess the construct validity and reliability, the questionnaire was sent to 92 academic staff, of which 85 had a mean (SD) age of 44.36 (5.41) years and agreed to participate (response rate was 92%), and 39 (45%) were women and 46 (54%) were men with an average of 16 years work experience.

In terms of face validity, the experts modified just one item to make it relevant, easy, and clear. The lowest and highest CVR were 0.51 and 0.95, respectively. Total CVR for whole questionnaire (mean CVRs of all items) was 0.78. The lowest and the highest CVI (mean of CVIs for the relevance, the clarity and the simplicity criteria) were 0.51 and 0.95, respectively. Total CVI (mean CVIs of all items) was 0.79. Additionally, total IS score of the questionnaire was 3.26. The final decision to modify, adopt or omit each item is shown in Table 1. The 10-item questionnaire was considered acceptable regarding the CVI, CVR, and IS.

Table 1 The Content and Face Validity of Each of the Final Items of the Questionnaire

The KMO measure of sampling adequacy for data was 0.726 (p-value= 0.0001). The results demonstrated that the data is appropriate for factor analysis and samples are adequate. The PCA extracted ten components and only three had an initial Eigen value higher than one. Rotated component matrix analysis for the questionnaire is shown in Table 2. Accordingly, these items were classified into three domains. The major components derived from three factors, had an initial Eigen value higher than one. Preparation (factor 1) with three items accounted for 34% of total variance. Scheduling (factor 2) with two items explained 23% of variance. Finally, implementation (factor 3) explained 14% of variance. As a result, exploratory factor analysis summed up the 10 questions of this questionnaire in three areas which explained 73% of the entire variance. We applied a confirmatory factor analysis model to establish the model of exploratory factor analysis (Figure 1).

Table 2 Exploratory Factors and Explained Variance After Rotation of the Questionnaire

Figure 1 Confirmatory factor model for outpatient clinical teaching.

To test the reliability, we examined the test–retest correlation coefficients between the first and the second time completing the questionnaire by Kappa coefficient. There was a strong correlation between 2-week tests (Table 3). The response rate for the ICC analysis was 100% (N= 85). The ICC and Cronbach’s alpha were 0.94 and 0.89, respectively. These showed high internal consistency for the questionnaire. Cronbach’s alpha was 0.88 (n = 3 items), 0.89 (n = 2 items), and 0.97 (n = 5 items) for the preparation, scheduling, and implementation domain, respectively.

Table 3 The Test–Retest Correlation Coefficients Between the First and the Second Time Completing the Questionnaire

Discussion

Based on a literature review there were a few relevant questionnaires available up to the time of performing the current study.⁴ Moreover, clinical teaching quality at the outpatient settings in Iranian medical universities has not been addressed sufficiently.¹² Hence, in order to evaluate the clinical teaching quality at the outpatient settings, it was necessary to develop a valid and reliable questionnaire.

In this study, the Persian version of the questionnaire evaluating clinical teaching in outpatient settings was developed. According to the findings, the OCTE questionnaire had acceptable psychometric properties. Thus, medical faculties and departments in all Persian speaking countries can apply this tool in evaluating clinical teaching quality at outpatient clinics.

Our questionnaire was derived from Iran’s Ministry of Health clinical teaching standards booklet. This booklet was based on the tips on developing an ambulatory teaching setting in a new venue. It included preparation, delivery and evaluation domains.²¹ For this reason, we observed similar structure compared to our questionnaire.

In order to address possible biases in scoring the relevancy, clarity, simplicity, and necessity, we requested the experts of the medical education to provide their views on each item for the evaluation of the teaching in outpatient clinics. The experts had consensus on ten selected items, but their views on best format for the response scale differed from Yes/No answers to a 5-point Likert scale. They suggested an odd number of scoring on a response scale, and they recommended adding “sometimes” in response options to allow participants to be neutral and easily express their opinions. It was similar to some studies demonstrating that investigators may have preferred a 5-point Likert scale.¹⁸

We calculated the CVI and the CVR because the content validity is important in determining the concept of measurements. Although these are robust indices, they could not adequately support the construct validity of an instrument.¹⁹ So, we applied a construct validity test to solve this problem. The questionnaire was found to have favorable internal validity and reliability. After EFA, the 10-item questionnaire loaded on three factors. The percentage of total variance of 72.8 was acceptable. The derived three factors in the questionnaire appeared to be logical and rational. Five items loaded on the implementation of the clinical teaching at the outpatient setting domain, three items remained in the preparation domain, and finally, scheduling domain covered two other items.

The questionnaire had great internal consistency, as measured by Cronbach’s alpha. It was 0.89, which was consistent with the results of the study by Tayyebi et al¹² in 2015. Favorable stability of the questionnaire might be a result of stability of academic staff’s views about the clinical teaching situation during the 2- week interval. Additionally, all the participants took part in the second test, leading to stable estimation. Accordingly, the questionnaire had strong test re-test reliability, which was assessed by the Kappa coefficient.

A study has assessed the validity and reliability of the Persian version of WATCH questionnaire according to the viewpoints of medical students.²² However, we used the experts’ opinions to validate the questionnaire, and it evaluated different domains of clinical teaching. Another study has evaluated the validity and the reliability of the Maastricht Clinical Teaching Questionnaire (MCTQ) in Bahrain. The MCTQ was a suitable tool to evaluate teachers’ skill during undergraduate clinical rotations. It was according to the cognitive apprenticeship model.²³ Furthermore, Nation et al⁶ investigated the reliability and validity of a new clinical teaching assessment instrument (CTAI). Although MCTQ and the CTAI were valid and reliable instruments to evaluate clinical teachers’ skills, they were not designed to evaluate other aspects of educational environments in the outpatient settings such as policies and governance structures. The Persian version of ACLEEM questionnaire was a valid and reliable instrument for Iranian residents to evaluate residency outpatient settings. There was a large difference in structure compared to our questionnaire. Maybe the difference between residency and undergraduate ambulatory settings led to this mismatch.

Although this study had some strengths such as performing both qualitative and quantitative methods to evaluate validity and reliability of the questionnaire, there were also limitations. First, the medical education experts and academic staff were mainly considering the local priorities of teaching in outpatient settings, which may have impacted on content validity. Second, medical education experts and academic staff were selected by convenience sampling, so they may not be representative sample of all academic staff’s community. Third, age, gender and prior experience were some factors that could affect academic staff’s views toward clinical teaching in outpatient settings. We propose that future studies test the validity and reliability of the questionnaire in other universities and even other countries to assess generalization of this questionnaire.

Conclusion

The study findings demonstrated that the Persian OCTE questionnaire had optimal psychometric properties. Hence, application of this questionnaire in outpatient settings can be helpful in the evaluation of clinical teaching facilities and strengths by faculties and departments.

Acknowledgments

The present study was a dissertation by the first author for obtaining a master degree in medical education which was approved by Tabriz University of Medical Sciences in 2018 and was conducted in the Medical Education Research Center. We appreciate the considerable contributions by the academic staff at Tabriz University of Medical Sciences.

Author Contributions

All authors made substantial contributions to conception and design, acquisition of data, or analysis and interpretation of data; took part in drafting the article or revising it critically for important intellectual content; gave final approval of the version to be published; and agree to be accountable for all aspects of the work.

Disclosure

The authors report that they have no conflicts of interest.

References

1. Janicik RW, Fletcher KE. Teaching at the bedside: a new model. Med Teach. 2003;25(2):127–130.

2. Dent JA. AMEE guide no 26: clinical teaching in ambulatory care settings: making the most of learning opportunities with outpatients. Med Teach. 2007;5(4):302–315.

3. Peters M, Ten Cate O. Bedside teaching in medical education: a literature review. Perspect Med Educ. 2014;3(2):76–88. doi:10.1007/s40037-013-0083-y

4. Kikukawa M, Stalmeijer RE, Emura S, Roff S, Scherpbier AJ. An instrument for evaluating clinical teaching in Japan: content validity and cultural sensitivity. BMC Med Educ. 2014;14(1):179–186.

5. Melvyn MJ, Bashir N, Purushotham N, Friel R, Rosenthal J. Universities and primary care organizations working together to recruit GPs: a qualitative evaluation of the Enfield Clinical Teaching Fellow Programme. BJGP Open. 2018;2(1):43–51.

6. Nation JG, Carmichael E, Fidler H, Violato C. The development of an instrument to assess clinical teaching with linkage to CanMEDS roles: a psychometric analysis. Med Teach. 2011;33(6):290–296.

7. Patil AA, Chaudhari VL. Students’ perception of the educational environment in medical college: a study based on DREEM questionnaire. Korean J Med Educ. 2016;28(3):281.

8. Karbakhsh A, Khalooei A. The quality of educational services for internship and apprenticeship courses at the community medicine department of Kerman University of Medical Sciences, from the trainees’ view point. Stride J Dev Med Educ. 2018;15(1):e57542.

9. Parvizi MM, Amini M, Dehghani MR, Jafari P, Parvizi Z. Psychometric properties of the Persian version of the ambulatory care learning educational environment measure (ACLEEM) questionnaire, Shiraz, Iran. Adv Med Educ Pract. 2016;7:559–566.

10. Vakili Z, Momen HM, Moraveji SA, Abdi F, Yavari M. Evaluation of educational departments of Kashan Shahid Beheshti Teaching Hospital. Iran J Med Educ. 2017;17(1):43–53.

11. Yazdi-Feyzabadi V, Gozashti MH, Komsari S, Mohammadtaghizadeh S, Amiresmaili M. Quality assessment of clinical education services in teaching hospitals located in Kerman, Iran. ElectronPhysician. 2015;7(7):14–27.

12. Tayyebi S, Hosseini SH, Noori S, Hosseini SM, Derakhshanfar H. Evaluation of clinical education in pediatric wards of hospitals affiliated to Shahid Beheshti University of Medical Sciences according to the Ministry of Health Standards in 2015. J Mil Med. 2017;19(1):63–71.

13. Yadollahi Farsani S, Mahdian M, Ahmadi S, Hosseini MA. The evaluation of clinical journal clubs in educational hospitals of Kashan University of Medical Sciences based on national standards: lecturers versus residents viewpoints. J Adv Med Educ. 2018;1(2):35–39.

14. Moharari RS, Soleymani HA, Nejati A, Rezaeefar A, Khashayar P, Meysamie AP. Evaluation of morning report in an emergency medicine department. Emerg Med J. 2010;27(1):32–36. doi:10.1136/emj.2008.067256

15. Yazdani S, Arab M, Hosseini F, et al. Evaluation of the structure of morning report sessions of the wards of type one educational hospitals and comparison with announced standards of the Ministry of Health and Medical Education. Qom Univ Med Sci J. 2013;7(Suppl 1):43–50.

16. Yahkforoshha A, Shirazi M, Yousefzadeh N, et al. Psychometric properties of the communication skills attitude scale (CSAS) measure in a sample of Iranian medical students. J Adv Med Educ Prof. 2018;6(1):14–20.

17. Pezeshki MZ, Shadman A, Alizadeh M, Hakimi S, Heidari F. Validity and reliability of the questionnaire for assessing women’s reproductive history in Azar cohort study. J Caring Sci. 2017;6(2):183–190. doi:10.15171/jcs.2017.018

18. Vackova D, Chen CK, Lui JN, Johnston JM. A validation study of public health knowledge,skills, social responsibility and applied learning. Int J Med Educ. 2018;9(1):175–181.

19. Mulder J, Fox JP. Bayes factor testing of multiple intraclass correlations. Bayesian Anal. 2019;14(2):521–552. doi:10.1214/18-BA1115

20. Moosavian SP, Feizi A, Esmaillzadeh A, Brett NR, Bellissimo N, Azadbakht L. Developing and assessing the validity and reliability of an Iranian food security questionnaire. Arch Iran Med. 2019;22(1):11–23.

21. Yaghmaie F. Content validity and its estimation. J Med Educ. 2009;3(1):870–876.

22. Nejatdarabi H, Amini M, Yarahmadi J, Bikineh P, Kaveh M, Gholampoor H. Validity and reliability of the Persian version of WATCH questionnaire in assessing the clinical learning environment. Biomed Res. 2018;29(17):50–55. doi:10.4066/biomedicalresearch.29-18-1023

23. Al Ansari A, Tabbara KS. Evaluating the reliability and validity of the Maastricht clinical teaching questionnaire in Bahrain. Oman Med J. 2019;34(5):427–433. doi:10.5001/omj.2019.78

Creative Commons License © 2020 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF]