Vision-language model for visual question answering in medical imagery

HIGHLIGHTS

  • who: Yakoub Bazi et al. from the Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia have published the research: Vision-Language Model for Visual Question Answering in Medical Imagery, in the Journal: Bioengineering 2023, 10, x FOR PEER REVIEW of /2023/
  • what: This paper introduces an approach based on a transformer encoder-decoder architecture. In the experiments the authors validate the proposed model on two VQA datasets for radiology images termed VQA-RAD and PathVQA. The model shows promising results compared to existing solutions. The contrastive learning used . . .

     

    Logo ScioWire Beta black

    If you want to have access to all the content you need to log in!

    Thanks :)

    If you don't have an account, you can create one here.

     

Scroll to Top

Add A Knowledge Base Question !

+ = Verify Human or Spambot ?