Back to Journals » Journal of Multidisciplinary Healthcare » Volume 17

The Application of ChatGPT in Medicine: A Scoping Review and Bibliometric Analysis

Authors Wu J, Ma Y, Wang J, Xiao M

Received 14 February 2024

Accepted for publication 25 March 2024

Published 18 April 2024 Volume 2024:17 Pages 1681—1692

DOI https://doi.org/10.2147/JMDH.S463128

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Dr Scott Fraser



Jie Wu,1 Yingzhuo Ma,1 Jun Wang,1 Mingzhao Xiao2

1Department of Nursing, the First Affiliated Hospital of Chongqing Medical University, Chongqing, People’s Republic of China; 2Department of Urology, the First Affiliated Hospital of Chongqing Medical University, Chongqing, People’s Republic of China

Correspondence: Mingzhao Xiao, Department of Urology, the First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, People’s Republic of China, Email [email protected] Jun Wang, Department of Nursing, the First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, People’s Republic of China, Email [email protected]

Purpose: ChatGPT has a wide range of applications in the medical field. Therefore, this review aims to define the key issues and provide a comprehensive view of the literature based on the application of ChatGPT in medicine.
Methods: This scope follows Arksey and O’Malley’s five-stage framework. A comprehensive literature search of publications (30 November 2022 to 16 August 2023) was conducted. Six databases were searched and relevant references were systematically catalogued. Attention was focused on the general characteristics of the articles, their fields of application, and the advantages and disadvantages of using ChatGPT. Descriptive statistics and narrative synthesis methods were used for data analysis.
Results: Of the 3426 studies, 247 met the criteria for inclusion in this review. The majority of articles (31.17%) were from the United States. Editorials (43.32%) ranked first, followed by experimental studys (11.74%). The potential applications of ChatGPT in medicine are varied, with the largest number of studies (45.75%) exploring clinical practice, including assisting with clinical decision support and providing disease information and medical advice. This was followed by medical education (27.13%) and scientific research (16.19%). Particularly noteworthy in the discipline statistics were radiology, surgery and dentistry at the top of the list. However, ChatGPT in medicine also faces issues of data privacy, inaccuracy and plagiarism.
Conclusion: The application of ChatGPT in medicine focuses on different disciplines and general application scenarios. ChatGPT has a paradoxical nature: it offers significant advantages, but at the same time raises great concerns about its application in healthcare settings. Therefore, it is imperative to develop theoretical frameworks that not only address its widespread use in healthcare but also facilitate a comprehensive assessment. In addition, these frameworks should contribute to the development of strict and effective guidelines and regulatory measures.

Keywords: ChatGPT, large language model, medicine, nursing, scoping review, bibliometric analysis

Introduction

ChatGPT (Chat Generative Pre-trained Transformer), an advanced AI-driven language model, has been developed by OpenAI. It is grounded in the Transformer neural network architecture, particularly the GPT-3.5 variant. This model undergoes extensive pre-training, assimilating statistical laws and discernible patterns from vast datasets. As a result, it exhibits the capability to autonomously generate responses to queries, participate in interactive dialogues with users, and compose various forms of text, including emails and academic papers.1 Following its public release on November 30, 2022, ChatGPT has rapidly garnered immense popularity, evidenced by millions of user registrations in a brief timeframe and exceeding 100 million active users within two months. This growth trajectory establishes ChatGPT as the fastest-growing consumer application in history, underscoring its exceptional appeal.2 Mohammad Fraiwan’s survey elucidates ChatGPT’s extensive applications across diverse domains such as automation technology, computer science, journalism, ethics, and medicine. Concurrently, comparable language models, notably Google’s Bard and Meta’s LLaMA, have emerged, seeking to establish their presence in the market. The proliferating adoption of such models, including ChatGPT, is poised to exert a profound influence across various sectors. Consequently, ChatGPT has captivated global scholarly interest, stimulating discourse on its current utility and prospective research trajectories.

ChatGPT is emerging as a transformative force in various sectors, with a notable impact in the realm of medicine. The burgeoning corpus of research on ChatGPT’s medical applications is expanding. Survey findings reveal that current studies in this area predominantly concentrate on (1) Medical Education, such as assessing ChatGPT’s efficacy in medical examinations,3,4 facilitating case-based learning,5 and enhancing communication skills;6 (2) Clinical practice, including optimizing clinical decisions,7 generating reports8 and providing personalized guidance to patients.9 (3) Scientific Research, For example, helping researchers to collect and analyze medical literature,10 writing abstracts11 and exploring the accuracy of information generated by ChatGPT. Despite the considerable advantages and substantial contributions of ChatGPT to medical advancements, concerns linger regarding the reliability of information sources, ethical implications in medicine, and the potential for academic misconduct.

The majority of current ChatGPT literature in the medical field consists of preprints, reviews, or letters to the editor that cover various aspects of medicine. However, it is unclear what these researches specifically examine. Current reviews on ChatGPT in the medical field are relatively homogeneous. Blanco-Gonzalez’s review described the advantages of ChatGPT in drug discovery, while Aydın and Karaarslan’s review focused on its benefits in knowledge compilation and representation.12 Malik Sallam13 also reviewed the use of ChatGPT in medicine, analyzing the advantages and disadvantages of its use in three areas: medical education, research and practice, but included a high proportion of preprints and lacked analysis of the external characteristics of the literature. Daniel Gödded14 used SWOT to review ChatGPT in the medical literature, which analyzed its application advantages and limitations from the perspective of medicine as a whole, but the search database was limited to PubMed and to the theoretical level of medicine, without further analysis of the use of ChatGPT in clinical practice. Therefore, the current reviews were limited to specific fields; however, there is no review that explores the current state of ChatGPT research in medicine from a global perspective. In addition, as an emerging technology, research on ChatGPT was scattered and fragmented without a clear theoretical structure. Therefore, it is important for researchers to synthesize the existing literature to gain a comprehensive understanding of the current state of ChatGPT research and future trends.

A scoping review is an established methodology for exploring the design and execution of research within a specific thematic area.15 This approach is instrumental in disseminating knowledge by delineating the extent, characteristics, and lacunae within existing literature, whilst ensuring the robustness and rigorous quality of the research.16 Consequently, it is extensively utilized to synthesize evidence pertaining to a given subject, especially in the context of burgeoning fields or in assessing the research trajectory of a particular topic.17 In light of the aforementioned rationale and our research objectives, a scoping review emerges as the most suitable methodology for our study. Bibliometrics, a method of quantitative analysis employing mathematical and statistical techniques, is utilized to scrutinize various facets of literature, encompassing the volume of publications, authorship, affiliations, publication year, and key terms.18 The primary aim of this approach is to elucidate the current research landscape and discern emerging trends within a study domain. Of late, ChatGPT has piqued significant interest in the medical sector, leading to an uptick in related research endeavors. Nonetheless, there exists a notable deficiency in comprehensive reports delineating the current state and prospective developments of ChatGPT research in this realm. To bridge this gap, we have undertaken bibliometric analysis, offering insights that could contribute to the ongoing evolution of this field.

Methods

The scoping review used an approach based on the 2005 Arksey and O’Malley19 framework. The review was conducted in two steps: a scoping review and bibliometric analysis. First, the framework was adopted for a scoping review. Then, a bibliometric analysis was conducted to analyze the key topic domains, and future study trends using retrieved scoping review results. No quality evaluation was carried out because the aim was to broaden the scope of included literature, consistent with the overall objectives and methodology of scope evaluation, and critical evaluation was not a necessary part of the scope evaluation framework. At the same time, in order to more accurately reflect the research trends and applications of ChatGPT in related professional fields. Priority was therefore given to databases focused on medicine, health, engineering, and computer science.

Step1 Scoping Review

Stage1 Identifying the Research Question

The overall research objective of this study was to draw up the current status and trends of ChatGPT research in the medical field.

To accomplish the purpose of this study, the following research questions were identified:

  1. What are the key characteristics of publications, such as country and type of publications on the application of ChatGPT in medicine?
  2. Regarding the use of ChatGPT in medicine, the purpose of these researches in the literature?
  3. To which secondary medical disciplines does the existing literature on ChatGPT predominantly pertain?
  4. What are the advantages and disadvantages of using ChatGPT in medicine?

Stage2 Identifying Relevant Studies

Six electronic databases, including PubMed, CINAHL, Embase, Web of Science, IEEE Xplore, and ACM Digital Library were systematically searched from November 30, 2022, to August 16, 2023. The search strategies combined the subject terms and free words, consisting of terms of ChatGPT, medicine, and nursing. The search strategy of each database is shown in Supplementary Material S1. Relevant references of articles and reviews included in this review were manually searched to ensure that all relevant primary studies were contained.

Stage3 Study Selection

The review considered the studies in line with the following inclusion and exclusion criteria. Inclusion criteria: (a) Studies published in English; (b) literature publication year from November 30, 2022, to August 16, 2023; (c) Studies relevant to ChatGPT or aiming to apply it in the medicine field; (d) All peer-reviewed studies, with no restrictions on the type of literature (eg, papers, conference abstracts).

Exclusion criteria: (a) Preprints; (b) Studies involving animals; (c) Literature without full content.

Stage4 Charting the Data

Data were systematically collated from each paper utilizing a structured Microsoft Excel data-extraction sheet. Subsequently, formal data extraction was meticulously conducted by two researchers. The primary information extracted encompassed various dimensions: (a) characteristics of the articles, including author, country of origin, and journal name; (b) types of studies, comprising editorials, cross-sectional, longitudinal, qualitative, experimental studies, reviews, among others; (c) central theme of the articles, such as clinical practice, medical education, or scientific research; (d) secondary disciplines employed in the studies; and (e) advantages and disadvantages noted.

Stage5 Collating, Summarizing, and Reporting the Results

Frequencies and percentages were utilized to describe key characteristics of included publications.

We performed qualitative analyses of the results of the scoping review. The qualitative analysis, for its part, consisted of using content analysis to categorize the study objectives, to group them by category.20

Step2 Bibliometric Analysis

VOSviewer version 1.6.19 was performed to draw a network of keywords co-occurrences, and a network of co-authors. In these visualization images, researchers or keywords are depicted as dots, the size of each dot corresponds to the frequency of co-occurrences, and the connecting lines between nodes indicate interrelationships. The thicker the lines, the more connections. Additionally, NVivo version 12 played a pivotal role in analyzing the term frequency within the titles of the included literature, thereby enriching our understanding of the research topic.

Results

A total of 3426 articles were retrieved from the six databases and 1328 duplicates were removed. A total of 247 articles were screened after careful selection of titles and abstracts, as well as a thorough evaluation of the full text. This included one article that was found in a separate search. Detailed information on these included articles can be found in Supplementary Material S2. Figure 1 presents the process of the article screening and eligibility.

Figure 1 PRISMA flowchart.

Key Characteristics and Bibliometric Properties of the Included Literature

Table 1 presents the characteristics of publications. Of the 247 publications, The United States (n = 77, 31.17%) contributed the most articles, followed by China (n = 24, 9.71%), India (n = 16, 6.48%), and the United Kingdom (n = 13, 5.26%). Most publications (n = 113, 45.75%) pertained to clinical practice, followed by medical education (n = 67, 27.16%) and scientific research (n = 67, 27.13%). Publications that were difficult to classify were “other” (n =27, 10.93%). About 41.30% of the retrieved documents were editorial/ letter to editor (n=107), whereas 11.74% were experimental studies and 7.69% employed review methods. The top three journals were ANNALS OF BIOMEDICAL ENGINEERING, CUREUS, and AESTHETIC SURGERY JOURNAL, accounting for 21.25% of the total.

Table 1 Summary of Key Characteristics of Included Publications

Network of Researchers and Co-Authors

Of 247 articles, 948 authors were included, ranging from one author to 5 authors, According to Price’s law,21 the core author’s publication volume Mp = 0.749 × (NPmax)1/2, the calculation Mp ≈ 1.67, that was, the author who published more than 2 papers was the core author, and there were a total of 64 authors, Figure 2C and 2D presents the network of co-authors with a large set of connected authors consisting of 36 items. In this network, a total of 3 clusters occurred (cluster 1 = 13 authors, cluster 2 = 12 authors, and cluster 3 = 11 authors). These three clusters generated co-authorship advantages dominated by Abouammoh, Noura A, Al-Eyadhy, Ayman, Abdulmajeed, and Naif.

Figure 2 Authors and keywords networks analysis using the density visualization module of VOSviewer software. (A and B), networks of keywords; (C and D), network of authors.

Network of Keywords

In this study, a total of 610 keywords were identified, with “artificial intelligence”, “ChatGPT”, “chatbot”, and “medical education” emerging as the most frequently occurring terms (Figure 2A and 2B). By setting a threshold of at least three occurrences, 597 keywords were selected for constructing the keyword network. This network was comprised of seven distinct clusters (as depicted in Figure 2A and 2B): Cluster 1 (in red) and Cluster 2 (in green) predominantly focused on the application of ChatGPT in medicine. Cluster 3 (in blue) concentrated on clinical practice, medical education, scientific research, and the role of OpenAI in medicine. Cluster 4 (in yellow) was mainly related to secondary disciplines within medicine, while Cluster 5 (in purple) focused on applications in medical learning and neurosurgery. Cluster 6 (in cyan) centered on urology applications, and Cluster 7 (in orange) addressed ethical considerations, privacy, and security concerns.

Secondary Disciplines

Out of the 247 publications, 148 mentioned specific disciplines. According to Figures 3 and 4, the top 5 disciplines in order were Radiology (n=12), Oncology (n=9), Surgical (n=9), Orthopedics (n=8), Dentistry (n=7), Dermatology (n=7), Nursing (n=7), Plastic Surgery (n=6), and Urology (n=6). The Figure 4 illustrates the initial exploration of ChatGPT by various disciplines, particularly the disciplinary studies involving image analysis.

Figure 3 Word cloud analysis of secondary discipline.

Figure 4 Specific number of secondary disciplines.

The Application Fields of These Studies

The literature included in this review were original research, review articles, case reports and letters to editors. According to Table 1 and Figure 2A and 2B, it can be seen that ChatGPT is mainly involved in three aspects of the medical field: clinical practice, medical education, and scientific research, such as assisting in diagnosis, providing expertise, and writing abstracts.

ChatGPT in the Clinical Practice

Clinical practice focuses on the following four areas, (1) Clinical decision making: (a) Generating disease differential diagnosis. (b) Optimizing the clinical decision-making process. (2) Disease Information and Medical Consultation: (a) Providing reliable disease information, including drug use analysis, personalized drug recommendations. (b) Identify disease problems and treatment options. (c) Provide comprehensive and reliable medical information. (d) Understand performance assessment in terms of clinical problems and drug-related advice. (e) Optimize radiation dose for age to enhance safety. (3) Medical Documentation and Reporting: (a) Generate and streamline radiology reports. (b) Assist in selecting imaging tests to enhance report quality. (c) Assist in writing case reports. (4) Clinical Discussion and Predictive Modelling: (a) Summarize core elements of clinical discussion. (b) Solve patient problems. (c) Provide predictive modeling of deaths.

ChatGPT in Medical Education

Current research has assessed the applicability of ChatGPT in medical education and has divided its use into two broad segments: assessing its performance in various medical examinations and its ability to provide information support to students and healthcare professionals. Increasingly, scholars are recognizing ChatGPT as a tool for improving the effectiveness, appeal and personalization of medical education systems. This includes its potential role in curriculum development, innovation in teaching methods, and the creation of new assessment and evaluation frameworks. In addition, ChatGPT provides opportunities to enhance students’ knowledge, competencies and capabilities. For example, it can simulate actual differential diagnoses, provide comprehensive and streamlined learning resources for designing lesson plans, act as a virtual patient or professor for interactive clinical case studies, help students review and generate practice questions, and create a coherent framework for integrating existing and new knowledge based on the curriculum. Thus, these findings suggest that the field of medical education is looking forward to the use of ChatGPT.

ChatGPT in Scientific Research

ChatGPT has been active in academia and in the field of manuscript writing due to its unique “learning” capabilities and has been explored as a tool for generating scholarly content, both in scientific research and in scholarly creation (eg, collating and proofreading materials and forming first drafts). Existing research has mainly focused on the use of ChatGPT for aggregating and synthesizing complex datasets, such as generating concise abstracts, creating systematic evaluations, and essay writing. Secondly, ChatGPT also helps researchers discover new connections and hypotheses in the data and drive scientific innovation. Although ChatGPT has the potential to increase efficiency and academic creativity in the research field, and a small amount of literature has been produced through ChatGPT-based abstracts, articles, titles, and references. However, academic institutions and researchers continue to express concerns about it.

The Pros and Cons of ChatGPT

The application of ChatGPT in the medical field presents several advantages: (1) Large Data Processing: ChatGPT’s capability to process vast amounts of data results in more detailed and realistic text generation. (2) Pre-Training Generation: It can create text from extensive datasets, tailored for specific tasks, thereby enhancing text quality. (3) Complex Task Handling: ChatGPT is adept at language translation, answering queries, and summarizing text. (4) Context-Based Text Creation: It produces responses grounded in the provided input knowledge. (5) Real-Time Processing: Offering 24/7 online service availability. (6) Efficiency Improvement: The pre-training aspect facilitates swift responses. These benefits are integral to its practical applications in healthcare. Conversely, an investigation into the ethical dimensions uncovers certain drawbacks. As Wang22 elucidated, several ethical issues were associated with ChatGPT’s usage in healthcare. To further this understanding, we synthesizethe perspectives and attitudes presented in the literature. Out of 247 documents, 149 expressed views on ethics, comprising a total of 373 items. The breakdown of these viewpoints is as follows (refer to Figure 5): Legal ethics (63 items), Humanistic ethics (13 items), Algorithmic ethics (113 points), Information ethics (81 items), Uncategorized biases (36 items), General ethical concerns (54 items), and Miscellaneous issues like misuse, energy consumption, and hallucinations (16 items). Legal ethics issues stem from patient data collection and potential privacy breaches. Humanistic ethics emphasize honesty and the physician-patient relationship. Algorithmic ethics cover concerns such as bias, transparency deficiencies, inappropriate data use, and excessive reliance on data without human intervention. Information ethics involve data bias, inaccuracy, and knowledge limitations. Figure 5 indicates that algorithmic and information ethics are predominant concerns, with the most significant being the risk of incorrect data leading to unreliable information or distrust in ChatGPT-generated content due to its inherent limitations.

Figure 5 Number of ethical categories.

Discussion

In this review, we reviewed published articles (from 30 November 2022 to 16 August 2023) that used ChatGPT within the medicine. Through an extensive literature search, we obtained a large number of research results revealing the exploration of ChatGPT by medical scholars worldwide. Analysis of characteristics, key topic areas using scope review nested bibliometric analysis revealed that current research on ChatGPT in medicine focuses on three main areas: clinical practice, medical education, and scientific research, in addition to outlining the pros and cons, and to the best of our knowledge, this is the first study to utilize such an approach.

It is perhaps not a coincidence that existing research has focused on three areas: clinical practice, education, and scientific research. The integration of advanced science and technology is not only in keeping with the times but can also go a long way towards solving existing problems in the relevant fields. In clinical practice, according to the World Health Statistics Report 2023 released by WHO,23 the global health workforce is still in a state of shortage and suffers from unequal distribution, inefficiency, lack of support and protection, and shortages at the national and subnational levels. The rational application of ChatGPT may be an effective approach. Clinical application studies currently exist in various processes of clinical work, such as diagnosis,24,25 decision-making instruments,26 and generating reports,27 and some of these studies have shown that ChatGPT is effective in improving the efficiency of nursing care,28,29 assisting in diagnosis,30 and so on, but it is important to note that these studies have not been practically deployed in the clinical setting and practice.

In the field of contemporary medical education, the use of advanced technologies such as ChatGPT is gradually changing traditional teaching and learning methods. It is important to note that the audience for education is not limited to students, but also to educators themselves. For students, ChatGPT can serve as a powerful learning tool, providing personalized learning assistance,31 and instant feedback to deepen understanding of complex medical concepts. At the same time, the technology simulates clinical situations and helps students practice diagnostic and therapeutic skills in a safe environment,32 which is critical to improving their clinical decision-making and hands-on skills. For educators, the use of technology products such as ChatGPT can greatly improve the efficiency and quality of instruction.33 Teachers can use these tools to access the latest medical research, update course content, and even use these technologies for classroom interactions and student assessments to better accommodate students’ learning needs and styles. In addition, ChatGPT can also be used as an educational research aid to help teachers analyze student learning data to optimize teaching methods and strategies.

In the field of scientific research, especially medical research, human original thinking, critical analysis and the pursuit of scientific rigor together form the cornerstone of authoritative scientific research. These elements not only drive progress in the field of medicine, but also form the unique “evidence base” of the medical community. Against this backdrop, it is important to weigh the advantages and limitations of introducing AI technologies such as ChatGPT into research activities to ensure that the quality and authority of research are not compromised. While ChatGPT has demonstrated significant advantages in processing big data, accelerating literature search, and assisting in experimental design,34 it still lacks human capabilities in innovative thinking and in-depth critical analyses. AI’s analyses and recommendations may rely to some extent on existing data and prior knowledge, which may limit the possibilities for innovation.

However, benefits are often accompanied by risks. ChatGPT, surpassing basic chatbots in performance metrics, has been tested across diverse knowledge domains. Its versatility stems from deep learning techniques, including the Transformer architecture and pre-training, enabling applications in Q&A, text summarization, language translation, dialogue generation, text completion, and content auditing. Concerns about data bias and academic integrity arise due to opaque training data sources and processes. Some literature suggests that ChatGPT-generated references may be fictitious.35–38 Currently, the medical community’s views on ChatGPT are mixed, with criticisms mainly focused on tasks requiring creativity or highly customized responses. For example, Molligoda Arachchige39 points out that ChatGPT cannot quite capture complex nuanced details like an expert, and that the limitations of high-quality data will make it difficult to provide the right kind of personalized service. The absence of professional instructions for ChatGPT results in varied user experiences and outcomes. Despite high expectations, the acquisition and application of knowledge by ChatGPT are distinct processes. The medical field’s specificity and privacy issues necessitate focusing on ethical considerations like patient privacy39 and doctor-patient relationships in current literature.

Users expect seamless interactions, yet ChatGPT’s limitations may not fulfill these expectations, highlighting a gap between expectations and reality. While these limitations cannot be fully resolved, they can be mitigated through program improvements, data updates, chatbot training, and user experience standardization. Addressing plagiarism and academic misconduct requires establishing robust ethical standards that balance AI’s value with individual interests. Authors using ChatGPT in academic writing should transparently disclose its usage, allowing for clear identification of authorship. With proper oversight, ChatGPT can assist healthcare professionals and students in task completion efficiently.1

Despite the current challenges and ethical issues associated with the use of ChatGPT in the medical field, there is still some potential for ChatGPT in the medical field. More empirical studies are needed to assess the exact impact of ChatGPT in the medical field. This is because the available literature so far has focused on the views or perspectives about ChatGPT. It is also worth noting that ChatGPT was not specifically developed for the medical field and further work is needed to enhance its depth of medical knowledge. In addition, it is important to develop appropriate ethical guidelines, restrictions in its use to improve accuracy, originality, bias, and misuse, and to overcome privacy, academic integrity, and ethics related issues.

Limitations

It is worth noting that although our review was designed to provide an indication of the use of ChatGPT in the medical field, the vast majority of the papers reviewed were editorials/letters to the editor, which may have affected the quality of the evidence used in this review, but an assessment of quality was not within the remit of the scoping study. As of the date of conducting the review, there may be little original research on the use of ChatGPT in medicine, which may limit the validity of ChatGPT over time and the ability to provide a comprehensive description of potential problems. Despite the different databases we took to search for research evidence, some literature was missed, and the inclusion of only English language literature may have exacerbated this limitation.

Conclusion

ChatGPT is an advanced language model with numerous advantages and applications in the healthcare and medical fields. It can assist healthcare professionals in various tasks, including research, diagnosis, patient monitoring, and medical education. However, the use of ChatGPT also presents ethical considerations and limitations, such as trustworthiness, plagiarism, copyright infringement, and bias. Therefore, it is crucial to thoroughly evaluate and address potential limitations and ethical considerations before implementing ChatGPT in the healthcare and medical sectors. Future research could focus on developing methods to mitigate these limitations while utilizing the benefits of ChatGPT.

Acknowledgments

We would like to thank all team members for their considerable support and assistance.

Funding

This study was funded by the Chongqing Elite Master Grant 2021(CQYC202003).

Disclosure

The authors report no conflicts of interest in this work.

References

1. Wu T, He S, Liu J, et al. A brief overview of ChatGPT: the history, status quo and potential future development. IEEECAA J Autom Sin. 2023;10(5):1122–1136. doi:10.1109/JAS.2023.123618

2. Currie GM. Academic integrity and artificial intelligence: is ChatGPT hype, hero or heresy? Semin Nucl Med. 2023;53(5):719–730. doi:10.1053/j.semnuclmed.2023.04.008

3. Bhayana R, Krishna S, Bleakney RR. Performance of ChatGPT on a radiology board-style examination: insights into current strengths and limitations. Radiology. 2023;307(5). doi:10.1148/radiol.230582

4. Weng TL, Wang YM, Chang S, Chen TJ, Wang SJ. ChatGPT failed Taiwan’s Family Medicine Board Exam. J Chin Med Assoc. 2023;86(8):762–766. doi:10.1097/JCMA.0000000000000946

5. Sevgi UT, Erol G, Dogruel Y, Sonmez OF, Tubbs RS, Gungor A. The role of an open artificial intelligence platform in modern neurosurgical education: a preliminary study. Neurosurg Rev. 2023;46(1):86. doi:10.1007/s10143-023-01998-2

6. Li SW, Kemp MW, Logan SJS, et al. ChatGPT outscored human candidates in a virtual objective structured clinical examination in obstetrics and gynecology. Am J Obstet Gynecol. 2023;229(2):172.e1–172.e12. doi:10.1016/j.ajog.2023.04.020

7. Seth I, Cox A, Xie Y, et al. Evaluating chatbot efficacy for answering frequently asked questions in plastic surgery: a ChatGPT case study focused on breast augmentation. Aesthet Surg J. 2023;43(10):1126–1135. doi:10.1093/asj/sjad140

8. Liu G, Ma X, Zhang Y, Su B, Liu P. GPT4: the indispensable helper for neurosurgeons in the new era. Ann Biomed Eng. 2023;51(10):2113–2115. doi:10.1007/s10439-023-03241-x

9. Shay D, Kumar B, Bellamy D, et al. Assessment of ChatGPT success with specialty medical knowledge using anaesthesiology board examination practice questions. Br J Anaesth. 2023;131(2):e31–e34. doi:10.1016/j.bja.2023.04.017

10. Valentín-Bravo FJ, Mateos-álvarez E, Usategui-Martín R, Andrés-Iglesias C, Pastor-Jimeno JC, Pastor-Idoate S. Artificial Intelligence and new language models in Ophthalmology: complications of the use of silicone oil in vitreoretinal surgery. Arch Soc Esp Oftalmol Engl Ed. 2023;98(5):298–303. doi:10.1016/j.oftale.2023.04.011

11. Xie Y, Seth I, Rozen WM, Hunter-Smith DJ. Evaluation of the artificial intelligence chatbot on breast reconstruction and its efficacy in surgical research: a case study. Aesthetic Plast Surg. 2023;47(6):2360–2369. doi:10.1007/s00266-023-03443-7

12. Aydın Ö, Karaarslan E. OpenAI ChatGPT generated literature review: digital twin in healthcare. SSRN J. 2022. doi:10.2139/ssrn.4308687

13. Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Health Care. 2023;11(6):887. doi:10.3390/healthcare11060887

14. Gödde D, Breuckmann F, Sellmann T. A SWOT (Strengths, Weaknesses, Opportunities, and Threats) analysis of ChatGPT in the medical literature: concise review. J Med Internet Res. 2023;25:e49368. doi:10.2196/49368

15. Peterson J, Pearce PF, Ferguson LA, Langford CA. Understanding scoping reviews: definition, purpose, and process. J Am Assoc Nurse Pract. 2017;29(1):12–16. doi:10.1002/2327-6924.12380

16. Levac D, Colquhoun H, O’Brien KK. Scoping studies: advancing the methodology. Implement Sci. 2010;20(5):69. doi:10.1186/1748-5908-5-69

17. Munn Z, Peters MDJ, Stern C, Tufanaru C, McArthur A, Aromataris E. Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach. BMC Med Res Methodol. 2018;19(1):143. doi:10.1186/s12874-018-0611-x

18. Ninkov A, Frank JR, Maggio LA. Bibliometrics: methods for studying academic publishing. Perspectives on Medical Education. 2022;11(3):173–176. doi:10.1007/S40037-021-00695-4

19. Arksey H, O’Malley L. Scoping studies: towards a methodological framework. Int J Soc Res Methodol. 2005;8(1):19–32. doi:10.1080/1364557032000119616

20. Navas-Martín MÁ, Ovalle-Perandones MA, López-Bueno JA, Díaz J, Linares C, Sánchez-Martínez G. Population adaptation to heat as seen through the temperature-mortality relationship, in the context of the impact of global warming on health: a scoping review. Sci Total Environ. 2024;908:168441. doi:10.1016/j.scitotenv.2023.168441

21. Fraga-Graells E, Povedano-Montero FJ, Alvarez-Peregrina C, Villa-Collar C, Arance-Gil A, Sánchez-Tena MA. Bibliometric study of refractive surgery and dry eye scientific literature. Arch Soc Esp Oftalmol Engl Ed. 2022;97(6):323–330. doi:10.1016/j.oftale.2022.02.011

22. Wang C, Liu S, Yang H, Guo J, Wu Y, Liu J. Ethical Considerations of Using ChatGPT in Health Care. J Med Internet Res. 2023;25:e48009. doi:10.2196/48009

23. World Health Organization. World health statistics 2023; 2023. Available from: https://www.who.int/publications/i/item/9789240074323. Accessed April 2, 2024.

24. Castro H. ChatGPT-4: dermatology aid in the emergency room. Dermatol Heidelb. 2023;74(8):633–634. doi:10.1007/s00105-023-05186-7

25. Chen J, Liu L, Ruan S, Li M, Yin C. Are different versions of ChatGPT’s ability comparable to the clinical diagnosis presented in case reports? A descriptive study. J Multidiscip Healthc. 2023;16:3825–3831. doi:10.2147/JMDH.S441790

26. Liu S, Wright AP, Patterson BL, et al. Using AI-generated suggestions from ChatGPT to optimize clinical decision support. J Am Med Inform Assoc. 2023;30(7):1237–1245. doi:10.1093/jamia/ocad072

27. Bosbach WA, Senge JF, Nemeth B. Ability of ChatGPT to generate competent radiology reports for distal radius fracture by use of RSNA template items and integrated AO classifier. Curr Probl Diagn Radiol. 2024;53(1):102–110. doi:10.1067/j.cpradiol.2023.04.001

28. Chiesa-Estomba CM, Lechien JR, Vaira LA. Exploring the potential of Chat-GPT as a supportive tool for sialendoscopy clinical decision making and patient information support. Eur Arch Otorhinolaryngol. 2023. doi:10.1007/s00405-023-08104-8

29. Alanzi TM. Impact of ChatGPT on teleconsultants in healthcare: perceptions of healthcare experts in Saudi Arabia. J Multidiscip Healthc. 2023;16:2309–2321. doi:10.2147/JMDH.S419847

30. Koga S, Martin NB, Dickson DW. Evaluating the performance of large language models: chatGPT and Google Bard in generating differential diagnoses in clinicopathological conferences of neurodegenerative disorders. Brain Pathol. 2023;e13207. doi:10.1111/bpa.13207

31. Bhattacharyya M, Miller VM, Bhattacharyya D, Miller LE. High rates of fabricated and inaccurate references in ChatGPT-generated medical content. Cureus. 2023. doi:10.7759/cureus.39238

32. Zaabi M, Hariri W, Smaoui N. A review study of ChatGPT applications in education; 2023. doi:10.1109/INISTA59065.2023.10310439

33. Wong RSY, Ming LC, Hons B, Affendi R, Ali R. The intersection of ChatGPT, clinical medicine, and medical education. JMIR Med Educ. 2023;9:e47274. doi:10.2196/47274

34. Ruksakulpiwat S, Kumar A, Ajibade A. Using ChatGPT in medical research: current status and future directions. J Multidiscip Healthc. 2023;16:1513–1520. doi:10.2147/JMDH.S413470

35. Biswas S. ChatGPT and the future of medical writing. Radiology. 2023;307(2):e223312. doi:10.1148/radiol.223312

36. Thorp HH. ChatGPT is fun, but not an author. Am Assoc Adv Sci. 2023;379(6630):313. doi:10.1126/science.adg7879

37. Haman M, Školník M. Using ChatGPT to conduct a literature review. Acc Res. 2023;6:1–3. doi:10.1080/08989621.2023.2185514

38. Eysenbach G. The role of ChatGPT, generative language models, and artificial intelligence in medical education: a conversation with ChatGPT and a call for papers. JMIR Med Educ. 2023;9:e46885. doi:10.2196/46885

39. Perera Molligoda Arachchige AS, Stomeo N. Controversies surrounding AI-based reporting systems in echocardiography. J Echocardiogr. 2023;21(4):184–185. doi:10.1007/s12574-023-00620-0

Creative Commons License © 2024 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.