Chest / ThoracicGeneralEmergencyAI / InformaticsResearchTrainee

ChatGPT-4o Beats Clinicians on Critical Care Board Questions — but One-Third of Answers Carry Harm Risk

Radiology education & curriculum (PubMed)1w ago

ChatGPT-4o scored 74.9% on 183 multimodal critical care board questions vs. 71.1% for pooled clinicians (p=0.03), but 33.3% of its responses were flagged for potential clinical harm, driven largely by poor image interpretation (61.7% correct) and flawed reasoning.

Read the source

RadPigeon summaries are original and for information only. They are not clinical advice.