Bootstrap Modal with Image and Text

Reading When Translating: Multi-Modal Document Image Machine Translation With Reading Flow Prediction

Abstract: Document Image Translation (DIT) aims to translate documents in images from one language to another. It is a multi-modal task that involves the cooperation of text, visual layout, and ...

IEEE

Incorporating Contextual Cues for Image Recognition: A Multi-Modal Semantic Fusion Model Sensitive to Key Information

Abstract: Multi-modal data feature fusion can effectively improve the accuracy of primary modal pattern recognition and address the issue of missing data through multi-modal collaboration. To some ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Reading When Translating: Multi-Modal Document Image Machine Translation With Reading Flow Prediction

Incorporating Contextual Cues for Image Recognition: A Multi-Modal Semantic Fusion Model Sensitive to Key Information

Trending now