Multi-modal Intelligence Group

Publications

Multi-modal Intelligence Group (MIG) focuses on Intelligent Visual Capturing Systems, Intelligent Visual Perception Systems, and Intelligent Embodied Aerial AI Systems and it has continuously published papers in these areas.

Highlighted

DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
Minghang Zhou, Tianyu Li, Chaofan Qiao, Dongyu Xie, Guoqing Wang, Ningjuan Ruan, Lin Mei, Yang Yang, Heng Tao Shen
IEEE Transactions on Geoscience and Remote Sensing  ·  01 Jan 2025  ·  doi:10.1109/TGRS.2025.3578309
Heterogeneous Experts and Hierarchical Perception for Underwater Salient Object Detection
Heterogeneous Experts and Hierarchical Perception for Underwater Salient Object Detection
Mingfeng Zha, Guoqing Wang, Yunqiang Pei, Tianyu Li, Xiongxin Tang, Chongyi Li, Yang Yang, Heng Tao Shen
IEEE Transactions on Image Processing  ·  01 Jan 2025  ·  doi:10.1109/TIP.2025.3572760

Selected

2025

Toward Generalized and Realistic Unpaired Image Dehazing via Region-Aware Physical Constraints
Kaihao Lin, Guoqing Wang, Tianyu Li, Yuhui Wu, Chongyi Li, Yang Yang, Heng Tao Shen
IEEE Transactions on Circuits and Systems for Video Technology  ·  01 Mar 2025  ·  doi:10.1109/TCSVT.2024.3497594
Knowledge-Guided Multi-Task Network for Remote Sensing Imagery
Meixuan Li, Guoqing Wang, Tianyu Li, Yang Yang, Wei Li, Xun Liu, Ying Liu
Remote Sensing  ·  31 Jan 2025  ·  10.3390/rs17030496
Heterogeneous Experts and Hierarchical Perception for Underwater Salient Object Detection
Mingfeng Zha, Guoqing Wang, Yunqiang Pei, Tianyu Li, Xiongxin Tang, Chongyi Li, Yang Yang, Heng Tao Shen
IEEE Transactions on Image Processing  ·  01 Jan 2025  ·  doi:10.1109/TIP.2025.3572760
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
Minghang Zhou, Tianyu Li, Chaofan Qiao, Dongyu Xie, Guoqing Wang, Ningjuan Ruan, Lin Mei, Yang Yang, Heng Tao Shen
IEEE Transactions on Geoscience and Remote Sensing  ·  01 Jan 2025  ·  doi:10.1109/TGRS.2025.3578309

2024

Towards a Flexible Semantic Guided Model for Single Image Enhancement and Restoration
Yuhui Wu, Guoqing Wang, Shaochong Liu, Yang Yang, Wei Li, Xiongxin Tang, Shuhang Gu, Chongyi Li, Heng Tao Shen
IEEE Transactions on Pattern Analysis and Machine Intelligence  ·  01 Dec 2024  ·  doi:10.1109/TPAMI.2024.3432308
Dual Domain Perception and Progressive Refinement for Mirror Detection
Mingfeng Zha, Feiyang Fu, Yunqiang Pei, Guoqing Wang, Tianyu Li, Xiongxin Tang, Yang Yang, Heng Tao Shen
IEEE Transactions on Circuits and Systems for Video Technology  ·  01 Nov 2024  ·  doi:10.1109/tcsvt.2024.3426673
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang, Yunqiang Pei, Guoqing Wang, Yangming Zhang, Yang Yang, Peng Wang, Hengtao Shen
European Conference on Computer Vision  ·  29 Oct 2024  ·  doi:10.1007/978-3-031-72983-6_1
Region-Aware Distribution Contrast: A Novel Approach to Multi-task Partially Supervised Learning
Meixuan Li, Tianyu Li, Guoqing Wang, Peng Wang, Yang Yang, Jie Zou
European Conference on Computer Vision  ·  29 Oct 2024  ·  doi:10.1007/978-3-031-72983-6_14
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement
Yuhui Wu, Guoqing Wang, Zhiwen Wang, Yang Yang, Tianyu Li, Malu Zhang, Chongyi Li, Heng Tao Shen
Proceedings of the 32nd ACM International Conference on Multimedia  ·  28 Oct 2024  ·  doi:10.1145/3664647.3680876
Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks
Zhiwen Wang, Yuhui Wu, Zheng Wang, Jiwei Wei, Tianyu Li, Guoqing Wang, Yang Yang, Hengtao Shen
Proceedings of the 32nd ACM International Conference on Multimedia  ·  28 Oct 2024  ·  doi:10.1145/3664647.3681475
Generalizing ISP Model by Unsupervised Raw-to-raw Mapping
Dongyu Xie, Chaofan Qiao, Lanyue Liang, Zhiwen Wang, Tianyu Li, Qiao Liu, Chongyi Li, Guoqing Wang, Yang Yang
Proceedings of the 32nd ACM International Conference on Multimedia  ·  28 Oct 2024  ·  doi:10.1145/3664647.3681666
Emotion Recognition in HMDs: A Multi-task Approach Using Physiological Signals and Occluded Faces
Yunqiang Pei, Jialei Tang, Qihang Tang, Mingfeng Zha, Dongyu Xie, …, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen
Proceedings of the 32nd ACM International Conference on Multimedia  ·  28 Oct 2024  ·  doi:10.1145/3664647.3681365
Improving Interaction Comfort in Authoring Task in AR-HRI through Dynamic Dual-Layer Interaction Adjustment
Yunqiang Pei, Keiyue Zhang, Hongrong Yang, Yong Tao, Qihang Tang, …, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen
Proceedings of the 32nd ACM International Conference on Multimedia  ·  28 Oct 2024  ·  doi:10.1145/3664647.3681364
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding
Ziyang Lu, Yunqiang Pei, Guoqing Wang, Peiwei Li, Yang Yang, Yinjie Lei, Heng Tao Shen
Proceedings of the AAAI Conference on Artificial Intelligence  ·  24 Mar 2024  ·  doi:10.1609/aaai.v38i4.28186
Weakly-Supervised Mirror Detection via Scribble Annotations
Mingfeng Zha, Yunqiang Pei, Guoqing Wang, Tianyu Li, Yang Yang, Wenbin Qian, Heng Tao Shen
Proceedings of the AAAI Conference on Artificial Intelligence  ·  24 Mar 2024  ·  doi:10.1609/aaai.v38i7.28521
Toward Optimized AR-Based Human-Robot Interaction Ergonomics: Modeling and Predicting Interaction Comfort
Yunqiang Pei, Bowen Jiang, Kaiyue Zhang, Ziyang Lu, Mingfeng Zha, Guoqing Wang, Zhitao Liu, Ning Xie, Yang Yang, Hengtao Shen
2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)  ·  16 Mar 2024  ·  doi:10.1109/vrw62533.2024.00195
Density-Aware Cloud Removal of Remote Sensing Imagery Using a Global–Local Fusion Transformer
Quan Rui, Shiyuan He, Tianyu Li, Guoqing Wang, Ningjuan Ruan, Lin Mei, Yang Yang, Heng Tao Shen
IEEE Transactions on Geoscience and Remote Sensing  ·  01 Jan 2024  ·  doi:10.1109/tgrs.2024.3477739

2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement
Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li, Heng Tao Shen
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  ·  01 Jun 2023  ·  doi:10.1109/cvpr52729.2023.00166
Multimodal Apology: Using WebXR to Repair Trust with Virtual Companion
Yunqiang Pei, Renming Huang, Guoqing Wang, Yang Yang, Ning Xie, Heng Tao Shen
2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)  ·  01 Mar 2023  ·  doi:10.1109/vrw58643.2023.00206
NAS-StegNet: Lightweight Image Steganography Networks via Neural Architecture Search
Zhixian Wang, Guoqing Wang, Yang Yang
European Conference on Computer Vision  ·  01 Jan 2023  ·  doi:10.1007/978-3-031-30111-7_20
Physics Guided Remote Sensing Image Synthesis Network for Ship Detection
Weichang Zhang, Rui Zhang, Guoqing Wang, Wei Li, Xun Liu, Yang Yang, Die Hu
IEEE Transactions on Geoscience and Remote Sensing  ·  01 Jan 2023  ·  doi:10.1109/tgrs.2023.3248106