Youliang Zhang
2025
BrainLoc: Brain Signal-Based Object Detection with Multi-modal Alignment
Jiaqi Duan
|
Xiaoda Yang
|
Kaixuan Luan
|
Hongshun Qiu
|
Weicai Yan
|
Xueyi Zhang
|
Youliang Zhang
|
Zhaoyang Li
|
Donglin Huang
|
JunYu Lu
|
Ziyue Jiang
|
Xifeng Yang
Findings of the Association for Computational Linguistics: EMNLP 2025
Object detection is a core challenge in computer vision. Traditional methods primarily rely on intermediate modalities such as text, speech, or visual cues to interpret user intent, leading to inefficient and potentially distorted expressions of intent. Brain signals, particularly fMRI signals, emerge as a novel modality that can directly reflect user intent, eliminating ambiguities introduced during modality conversion. However, brain signal-based object detection still faces challenges in accuracy and robustness. To address these challenges, we present BrainLoc, a lightweight object detection model guided by fMRI signals. First, we employ a multi-modal alignment strategy that enhances fMRI signal feature extraction by incorporating various modalities including images and text. Second, we propose a cross-domain fusion module that promotes interaction between fMRI features and category features, improving the representation of category information in fMRI signals. Extensive experiments demonstrate that BrainLoc achieves state-of-the-art performance in brain signal-based object detection tasks, showing significant advantages in both accuracy and convenience.
Search
Fix author
Co-authors
- Jiaqi Duan 1
- Donglin Huang 1
- Ziyue Jiang 1
- Zhaoyang Li 1
- Junyu Lu 1
- show all...