
Visual Question Answering
Visual Question Answering (VQA) usually combines visual inputs like image and video with a natural language question concerning the input and generates a natural language answer as the output.
- Undertittel
- From Theory to Application
- Forfatter
- Qi Wu, Peng Wang, Xin Wang, Xiaodong He, Wenwu Zhu
- Opplag
- 2022 ed.
- ISBN
- 9789811909634
- Språk
- Engelsk
- Vekt
- 446 gram
- Utgivelsesdato
- 14.5.2022
- Antall sider
- 238
