Abstract: Solving visual question answering (VQA) task requires recognizing many diverse visual concepts as the answer. These visual concepts contain rich structural semantic meanings, e.g., some ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results