HIGHLIGHTS
- who: Siyu Lu from the School of Automation, University of Electronic Science and Technology of China, Chengdu, China have published the research: Multiscale Feature Extraction and Fusion of Image and Text in VQA, in the Journal: (JOURNAL)
- what: The research shows that the shallow image feature has a small receptive field, high resolution, and rich location information, which is beneficial for detecting small objects. This paper introduces the multiscale feature technology into the VQA system, and the improved image multiscale feature extraction method is introduced. The research shows that the advantage of this method is . . .
If you want to have access to all the content you need to log in!
Thanks :)
If you don't have an account, you can create one here.