Vision-language model for visual question answering in medical imagery

HIGHLIGHTS

who: Yakoub Bazi et al. from the Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia have published the research: Vision-Language Model for Visual Question Answering in Medical Imagery, in the Journal: Bioengineering 2023, 10, x FOR PEER REVIEW of /2023/
what: This paper introduces an approach based on a transformer encoder-decoder architecture. In the experiments the authors validate the proposed model on two VQA datasets for radiology images termed VQA-RAD and PathVQA. The model shows promising results compared to existing solutions. The contrastive learning used . . .

If you want to have access to all the content you need to log in!

Thanks :)

Username or Email

Password

Remember me

Lost your password?

If you don't have an account, you can create one here.

Add A Knowledge Base Question !