GeneralAI / InformaticsResearch

Generative Vision-Language Models Are Reshaping Medical Visual Question Answering, Review Finds

Radiology AI literature (PubMed)3w ago

A systematic review of 27 studies shows Med-VQA has shifted from text-only classifiers to generative vision-language models. Frameworks using chain-of-thought reasoning and retrieval-augmented generation improve interpretability, but lack standardized clinical benchmarks and rea…

Systematic review following PRISMA guidelines, covering 27 representative Med-VQA studies from 2023–2026.
Identifying the shift toward generative models using chain-of-thought, multi-agent frameworks, and retrieval-augmented generation (RAG) for free-form answering.
Major limitations include a lack of standardized evaluation, limited multi-view and multi-lingual capability, higher computational cost, and no real-world clinical validation.

Read the source

RadPigeon summaries are original and for information only. They are not clinical advice.