GeneralAI / InformaticsResearchTrainee

LLMs Ace Radiology MCQs: Gemini 2.5 Pro at 90%, Claude Most Reliable

Radiology AI literature (PubMed)3d ago

In a 100-question radiology MCQ test, Gemini 2.5 Pro achieved 90% accuracy, with Claude 4.5 Sonnet at 86%. All LLMs and 3rd-year residents outperformed juniors. No performance gap between Turkish and English (P=1.000). Claude showed best temporal reliability (κ=0.872).

Read the source

RadPigeon summaries are original and for information only. They are not clinical advice.