Assessing the feasibility of ChatGPT-4o and Claude 3-Opus in thyroid nodule classification based on ultrasound images

C.M. Kitahara, A.B. Schneider, Epidemiology of thyroid cancer. Cancer Epidemiol. Biomark. Prev. 31(7), 1284–1297 (2022)

Article  CAS  Google Scholar 

D.W. Chen, B.H.H. Lang, D.S.A. McLeod, K. Newbold, M.R. Haymart, Thyroid cancer. LANCET 401(10387), 1531–1544 (2023)

Article  PubMed  Google Scholar 

J.Y. Park, W. Choi, A.R. Hong, J.H. Yoon, H.K. Kim, H.C. Kang, A comprehensive assessment of the harms of fine-needle aspiration biopsy for thyroid nodules: a systematic review. Endocrinol. Metab. 38(1), 104–116 (2023)

Article  CAS  Google Scholar 

J. de Carlos, J. Garcia, F.J. Basterra, J.J. Pineda, M. Dolores Ollero, M. Toni, P. Munarriz, E. Anda, Interobserver variability in thyroid ultrasound. ENDOCRINE 85(2), 730–736 (2024)

Article  PubMed  Google Scholar 

C. Zhang, J. Chen, J. Li, Y. Peng, Z. Mao, Large language models for human–robot interaction: a review. Biomim. Intell. Robot. 3(4), 100131 (2023)

Google Scholar 

K.S. Kalyan, A survey of GPT-3 family large language models including ChatGPT and GPT-4. Nat. Lang. Process. J. 6, 100048 (2024)

Article  Google Scholar 

D.-M. Petroșanu, A. Pîrjan, A. Tăbușcă, Tracing the influence of large language models across the most impactful scientific works. Electronics 12(24), 4957 (2023)

Article  Google Scholar 

H. Zong, J. Li, E. Wu, R. Wu, J. Lu, B. Shen, Performance of ChatGPT on Chinese national medical licensing examinations: a five-year examination evaluation study for physicians, pharmacists and nurses. BMC Med. Educ. 24(1), 143 (2024)

Article  PubMed  PubMed Central  Google Scholar 

D. Horiuchi, H. Tatekawa, T. Shimono, S.L. Walston, H. Takita, S. Matsushita, T. Oura, Y. Mitsuyama, Y. Miki, D. Ueda, Accuracy of ChatGPT generated diagnosis from patient’s medical history and imaging findings in neuroradiology cases. NEURORADIOLOGY 66(1), 73–79 (2024)

Article  PubMed  Google Scholar 

J.R. Lechien, T.L. Carroll, M.N. Huston, M.R. Naunheim, ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux. Eur. Arch. Otorhinolaryngol. 281(5), 2547–2552 (2024)

Article  PubMed  Google Scholar 

H. Jiang, S. Xia, Y. Yang, J. Xu, Q. Hua, Z. Mei, Y. Hou, M. Wei, L. Lai, N. Li, Y. Dong, J. Zhou, Transforming free-text radiology reports into structured reports using ChatGPT: a study on thyroid ultrasonography. Eur. J. Radio. 175, 111458 (2024)

Article  Google Scholar 

B. Cavnar Helvaci, S. Hepsen, B. Candemir, O. Boz, H. Durantas, M. Houssein, E. Cakal, Assessing the accuracy and reliability of ChatGPT’s medical responses about thyroid cancer. Int J. Med Inf. 191, 105593 (2024)

Article  Google Scholar 

M. Sievert, M. Aubreville, S.K. Mueller, M. Eckstein, K. Breininger, H. Iro, M. Goncalves, Diagnosis of malignancy in oropharyngeal confocal laser endomicroscopy using GPT 4.0 with vision. Eur. Arch. Otorhinolaryngol. 281(4), 2115–2122 (2024)

Article  PubMed  Google Scholar 

F.N. Tessler, W.D. Middleton, E.G. Grant, J.K. Hoang, L.L. Berland, S.A. Teefey, J.J. Cronan, M.D. Beland, T.S. Desser, M.C. Frates, L.W. Hammers, U.M. Hamper, J.E. Langer, C.C. Reading, L.M. Scoutt, A.T. Stavros, ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J. Am. Coll. Radio. 14(5), 587–595 (2017)

Article  Google Scholar 

G. Russ, S.J. Bonnema, M.F. Erdogan, C. Durante, R. Ngu, L. Leenhardt, European Thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: The EU-TIRADS. Eur. Thyroid J. 6(5), 225–237 (2017)

Article  PubMed  PubMed Central  Google Scholar 

B.R. Haugen, E.K. Alexander, K.C. Bible, G.M. Doherty, S.J. Mandel, Y.E. Nikiforov, F. Pacini, G.W. Randolph, A.M. Sawka, M. Schlumberger, K.G. Schuff, S.I. Sherman, J.A. Sosa, D.L. Steward, R.M. Tuttle, L. Wartofsky, 2015 American Thyroid Association Management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. THYROID 26(1), 1–133 (2016)

Article  PubMed  PubMed Central  Google Scholar 

E.J. Ha, S.R. Chung, D.G. Na, H.S. Ahn, J. Chung, J.Y. Lee, J.S. Park, R.E. Yoo, J.H. Baek, S.M. Baek, S.W. Cho, Y.J. Choi, S.Y. Hahn, S.L. Jung, J.H. Kim, S.K. Kim, S.J. Kim, C.Y. Lee, H.K. Lee, J.H. Lee, Y.H. Lee, H.K. Lim, J.H. Shin, J.S. Sim, J.Y. Sung, J.H. Yoon, M. Choi, 2021 Korean thyroid imaging reporting and data system and imaging-based management of thyroid nodules: Korean Society of Thyroid Radiology Consensus Statement and Recommendations. Korean J. Radio. 22(12), 2094–2123 (2021)

Article  Google Scholar 

T. Piticchio, G. Russ, M. Radzina, F. Frasca, C. Durante, P. Trimboli, Head-to-head comparison of American, European, and Asian TIRADSs in thyroid nodule assessment: systematic review and meta-analysis. Eur. Thyroid J. 13(2), e230242 (2024)

Article  PubMed  PubMed Central  Google Scholar 

E.J. Ha, D.G. Na, W.J. Moon, Y.H. Lee, N. Choi, Diagnostic performance of ultrasound-based risk-stratification systems for thyroid nodules: comparison of the 2015 American Thyroid Association Guidelines with the 2016 Korean Thyroid Association/Korean Society of Thyroid Radiology and 2017. Am. Coll. Radiol. Guidel., THYROID 28(11), 1532–1537 (2018)

Google Scholar 

W. Mai, M. Zhou, J. Li, W. Yi, S. Li, Y. Hu, J. Ji, W. Zeng, B. Gao, H. Liu, The value of the Demetics ultrasound-assisted diagnosis system in the differential diagnosis of benign from malignant thyroid nodules and analysis of the influencing factors. Eur. Radio. 31(10), 7936–7944 (2021)

Article  Google Scholar 

B. Wang, Z. Wan, C. Li, M. Zhang, Y. Shi, X. Miao, Y. Jian, Y. Luo, J. Yao, W. Tian, Identification of benign and malignant thyroid nodules based on dynamic AI ultrasound intelligent auxiliary diagnosis system. Front Endocrinol. 13, 1018321 (2022)

Article  Google Scholar 

L. Zhou, L.L. Zheng, C.J. Zhang, H.F. Wei, L.L. Xu, M.R. Zhang, Q. Li, G.F. He, E.P. Ghamor-Amegavi, S.Y. Li, Comparison of S-Detect and thyroid imaging reporting and data system classifications in the diagnosis of cytologically indeterminate thyroid nodules. Front Endocrinol. 14, 1098031 (2023)

Article  Google Scholar 

S.H. Wu, W.J. Tong, M.D. Li, H.T. Hu, X.Z. Lu, Z.R. Huang, X.X. Lin, R.F. Lu, M.D. Lu, L.D. Chen, W. Wang, Collaborative enhancement of consistency and accuracy in US diagnosis of thyroid nodules using large language models. RADIOLOGY 310(3), e232255 (2024)

Article  PubMed  Google Scholar 

Z. Wang, Z. Zhang, A. Traverso, A. Dekker, L. Qian, P. Sun, Assessing the role of GPT-4 in thyroid ultrasound diagnosis and treatment recommendations: enhancing interpretability with a chain of thought approach. Quant. Imaging Med Surg. 14(2), 1602–1615 (2024)

Article  PubMed  PubMed Central  Google Scholar 

J. Cheng, Applications of large language models in pathology. Bioengineering 11(4), 342 (2024)

Article  PubMed  PubMed Central  Google Scholar 

C. Preiksaitis, N. Ashenburg, G. Bunney, A. Chu, R. Kabeer, F. Riley, R. Ribeira, C. Rose, The role of large language models in transforming emergency medicine: scoping review. JMIR Med Inf. 12, e53787 (2024)

Article  Google Scholar 

J. Clusmann, F.R. Kolbinger, H.S. Muti, Z.I. Carrero, J.-N. Eckardt, N.G. Laleh, C.M.L. Löffler, S.-C. Schwarzkopf, M. Unger, G.P. Veldhuizen, S.J. Wagner, J.N. Kather, The future landscape of large language models in medicine. Commun. Med. 3(1), 141 (2023)

Article  PubMed  PubMed Central  Google Scholar 

R. Loor-Torres, M. Duran, D. Toro-Tobon, M.M. Chavez, O. Ponce, C.S. Jacome, D.S. Torres, S.A. Perneth, V. Montori, E. Golembiewski, M.B. Osorio, J.W. Fan, N.S. Ospina, Y. Wu, J.P. Brito, A systematic review of natural language processing methods and applications in thyroidology. Mayo Clin. Proc. Digit Health 2(2), 270–279 (2024)

Article  PubMed  PubMed Central  Google Scholar 

K.I. Roumeliotis, N.D. Tselikas, ChatGPT and Open-AI models: a preliminary review. Future Internet 15(6), 192 (2023)

Article  Google Scholar 

T.P. Reith, D.M. D’Alessandro, M.P. D’Alessandro, Capability of multimodal large language models to interpret pediatric radiological images. Pediatr. Radio. 54(10), 1729–1737 (2024)

Article  Google Scholar 

D. Tian, S. Jiang, L. Zhang, X. Lu, Y. Xu, The role of large language models in medical image processing: a narrative review. Quant. Imaging Med. Surg. 14(1), 1108–1121 (2023)

Article  PubMed  PubMed Central  Google Scholar 

留言 (0)

沒有登入
gif