
Visual Question Answering
Visual Question Answering (VQA) usually combines visual inputs like image and video with a natural language question concerning the input and generates a natural language answer as the output.
- Undertitel
- From Theory to Application
- Författare
- Qi Wu, Peng Wang, Xin Wang, Xiaodong He, Wenwu Zhu
- Upplaga
- 2022 ed.
- ISBN
- 9789811909665
- Språk
- Engelska
- Vikt
- 310 gram
- Utgivningsdatum
- 15.5.2023
- Sidor
- 238