Alright in Sign Language Instruction

LANA: A Language-Capable Navigator for Instruction Following and Generation

Abstract: Recently, visual-language navigation (VLN) - entailing robot agents to follow navigation instructions - has shown great advance. However, existing literature put most emphasis on ...

IEEE

Is ‘Right’ Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning

Abstract: Multimodal large language models (MLLMs) act as essential interfaces, connecting humans with AI technologies in multimodal applications. However, current MLLMs face challenges in accurately ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

LANA: A Language-Capable Navigator for Instruction Following and Generation

Is ‘Right’ Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning

Trending now