Visual programming language Visual Programming Tutorials

Zero-Shot Knowledge-Based Visual Question Answering with Frozen Language Models

Abstract: Knowledge-based Visual Question Answering (VQA) is a challenging task that requires models to access external knowledge for reasoning. Large Language Models (LLMs) have recently been ...

IEEE

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling

Abstract: Open-world interpretation aims to accurately localize and recognize all objects within images by vision-language models (VLMs). While substantial progress has been made in this task for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Zero-Shot Knowledge-Based Visual Question Answering with Frozen Language Models

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling

Trending now