HIGHLIGHTS
- who: Yakoub Bazi et al. from the Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia have published the research: Vision-Language Model for Visual Question Answering in Medical Imagery, in the Journal: Bioengineering 2023, 10, x FOR PEER REVIEW of /2023/
- what: This paper introduces an approach based on a transformer encoder-decoder architecture. In the experiments the authors validate the proposed model on two VQA datasets for radiology images termed VQA-RAD and PathVQA. The model shows promising results compared to existing solutions. The contrastive learning used . . .
If you want to have access to all the content you need to log in!
Thanks :)
If you don't have an account, you can create one here.