Visual Object Recognition

Image Coding for Object Recognition Tasks Based on Contour Feature Learning With Flexible Object Selection

Abstract: The consumption of image data by machines is rapidly increasing due to the growing adoption of image recognition technologies. This trend has accelerated research in image compression ...

GitHub

InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition

InstructSAM is a training-free framework for Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS). We construct EarthInstruct, an InstructCDS benchmark for remote sensing.

IEEE

An Energy-Efficient Block-Based Nonmaximum Suppression Engine for High-Parallel Postprocessing of Visual Object Detection

Abstract: Nowadays, visual object detection (VOD) is widely used in many AI applications, such as autonomous driving, intelligent robotics, and smart surveillance. As an essential postprocessing step ...

eLife

Human EEG and artificial neural networks reveal disentangled representations and processing timelines of object real-world size and depth in natural images

Neural and computational evidence reveals that real-world size is a temporally late, semantically grounded, and hierarchically stable dimension of object representation in both human brains and ...

Psychology Today

"Stop": How Visual Cues Trigger Automatic Reactions

Why does stopping at a red light become automatic? New neuroscience shows how the cerebellum turns visual cues into fast, ...

GitHub

R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning

Visual (Single) Object Tracking aims to continuously localize and estimate the scale of a target in subsequent video frames, given only its initial state in the first frame. This task can be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results