Evaluating the Efficacy of AI Chatbots as Tutors in Urology: A Comparative Analysis of Responses to the 2022 In-Service Assessment of the European Board of Urology

2.

Ray

PP

.

ChatGPT: a comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope

.

Internet Things Cyber-Physical Syst

.

2023

;

3

:

121

–

54

.

3.

May

M

,

Körner-Riffard

K

,

Marszalek

M

,

Eredics

K

.

Would Uro_Chat, a newly developed generative artificial intelligence large language model, have successfully passed the In-Service Assessment Questions of the European Board of Urology in 2022

.

Eur Urol Oncol

.

2024

;

7

(

1

):

155

–

6

.

4.

Kollitsch

L

,

Eredics

K

,

Marszalek

M

,

Rauchenwald

M

,

Brookman-May

SD

,

Burger

M

, et al.

How does Artificial Intelligence master urological board examinations? A comparative analysis of different Large Language Models’ accuracy and reliability in the 2022 In- Service Assessment of the European Board of Urology

.

World J Urol

.

2024

;

42

(

1

):

20

.

5.

Kung

TH

,

Cheatham

M

,

Medenilla

A

,

Sillos

C

,

De Leon

L

,

Elepaño

C

, et al.

Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models

.

PLOS Digit Health

.

2023

;

2

(

2

):

e0000198

.

6.

Lewandowski

M

,

Łukowicz

P

,

Świetlik

D

,

Barańska-Rybak

W

.

ChatGPT-3.5 and ChatGPT-4 dermatological knowledge level based on the specialty certificate examination in dermatology

.

Clin Exp Dermatol

.

2023

:

llad255

.

7.

Oh

N

,

Choi

G-S

,

Lee

WY

.

ChatGPT goes to the operating room: evaluating GPT-4 performance and its potential in surgical education and training in the era of large language models

.

Ann Surg Treat Res

.

2023

;

104

(

5

):

269

–

73

.

8.

Deebel

NA

,

Terlecki

R

.

ChatGPT performance on the American urological association self-assessment study program and the potential influence of artificial intelligence in urologic training

.

Urology

.

2023

;

177

:

29

–

33

.

9.

Caglar

U

,

Yildiz

O

,

Meric

A

,

Ayranci

A

,

Gelmis

M

,

Sarilar

O

, et al.

Evaluating the performance of ChatGPT in answering questions related to pediatric urology

.

J Pediatr Urol

.

2024

;

20

(

1

):

26.e1

–

5

.

10.

Deebel

NA

,

Terlecki

R

.

ChatGPT performance on the American Urological Association Self-assessment study program and the potential influence of artificial intelligence in urologic training

.

Urology

.

2023

;

177

:

29

–

33

.

11.

Huynh

LM

,

Bonebrake

BT

,

Schultis

K

,

Quach

A

,

Deibert

CM

.

New artificial intelligence ChatGPT performs poorly on the 2022 self-assessment study program for urology

.

Urol Pract

.

2023

;

10

(

4

):

409

–

15

.

12.

Antaki

F

,

Touma

S

,

Milad

D

,

El-Khoury

J

,

Duval

R

.

Evaluating the performance of ChatGPT in Ophthalmology: an analysis of its successes and shortcomings

.

Ophthalmol Sci

.

2023

;

3

(

4

):

100324

.

13.

Friederichs

H

,

Friederichs

WJ

,

März

M

.

ChatGPT in medical school: how successful is AI in progress testing

.

Med Edu Online

.

2023

;

28

(

1

):

2220920

.

14.

Suchman

K

,

Garg

S

,

Trindade

AJ

.

Chat generative pretrained transformer fails the multiple-choice American college of gastroenterology self-assessment test

.

Am J Gastroenterol

.

2023

;

118

(

12

):

2280

–

2

.

15.

Cakir

H

,

Caglar

U

,

Yildiz

O

,

Meric

A

,

Ayranci

A

,

Ozgor

F

.

Evaluating the performance of ChatGPT in answering questions related to urolithiasis

.

Int Urol Nephrol

.

2024

;

56

(

1

):

17

–

21

.

16.

Musheyev

D

,

Pan

A

,

Loeb

S

,

Kabarriti

AE

.

How well do artificial intelligence chatbots respond to the top search queries about urological malignancies

.

Eur Urol

.

2024

;

85

(

1

):

13

–

6

.

17.

Teoh

JY-C

,

Cacciamani

GE

,

Gomez Rivas

J

.

Social media and misinformation in urology: what can be done

.

BJU Int

.

2021

;

128

(

4

):

397

.

18.

Klang

E

,

Portugez

S

,

Gross

R

,

R

KL

,

A

B

,

M

G

, et al.

Advantages and pitfalls in utilizing artificial intelligence for crafting medical examinations: a medical education pilot study with GPT-4

.

BMC Med Educ

.

2023

;

23

(

1

):

772

.

19.

Cheung

BHH

,

Lau

GKK

,

Wong

GTC

,

Lee

EYP

,

Kulkarni

D

,

Seow

CS

, et al.

ChatGPT versus human in generating medical graduate exam multiple choice questions-A multinational prospective study (Hong Kong S.A.R., Singapore, Ireland, and the United Kingdom)

.

PLoS One

.

2023

;

18

(

8

):

e0290691

.

20.

Eppler

M

,

Ganjavi

C

,

Ramacciotti

LS

,

Piazza

P

,

Rodler

S

,

Checcucci

E

, et al.

Awareness and use of ChatGPT and large language models: a prospective cross-sectional global survey in urology

.

Eur Urol

.

2024

;

85

(

2

):

146

–

53

.

21.

Harrer

S

.

Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine

.

EBioMedicine

.

2023

;

90

:

104512

.