1 results found

Haifang Mo, Zili Su, Huili Zhang, Meng Xia, Linhui Cheng, Qian Luo

Cross-modal image-text retrieval aims to precisely match visual content with natural language descriptions, a task pivotal in multimodal understanding. Despite advancements in feature extraction and a...

The Visual Computer 2026-04-21 rs-8805622
Cross-Modal Image-Text Retrieval Multi-Scale Feature Extraction Adaptive Similarity Fusion Semantic Complexity Dynamic Sparse Aggregation
Back to Top
Home
Browse
Submit
About
0.036057s