Publications
Multi-modal Intelligence Group (MIG) focuses on Intelligent Visual Capturing Systems, Intelligent Visual Perception Systems, and Intelligent Embodied Aerial AI Systems and it has continuously published papers in these areas.
Highlighted

DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing
·
01 Jan 2025
·
doi:10.1109/TGRS.2025.3578309

Heterogeneous Experts and Hierarchical Perception for Underwater Salient Object Detection
IEEE Transactions on Image Processing
·
01 Jan 2025
·
doi:10.1109/TIP.2025.3572760
Selected
2025
Toward Generalized and Realistic Unpaired Image Dehazing via Region-Aware Physical Constraints
IEEE Transactions on Circuits and Systems for Video Technology
·
01 Mar 2025
·
doi:10.1109/TCSVT.2024.3497594
Knowledge-Guided Multi-Task Network for Remote Sensing Imagery
Remote Sensing
·
31 Jan 2025
·
10.3390/rs17030496
Heterogeneous Experts and Hierarchical Perception for Underwater Salient Object Detection
IEEE Transactions on Image Processing
·
01 Jan 2025
·
doi:10.1109/TIP.2025.3572760
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing
·
01 Jan 2025
·
doi:10.1109/TGRS.2025.3578309
2024
Towards a Flexible Semantic Guided Model for Single Image Enhancement and Restoration
IEEE Transactions on Pattern Analysis and Machine Intelligence
·
01 Dec 2024
·
doi:10.1109/TPAMI.2024.3432308
Dual Domain Perception and Progressive Refinement for Mirror Detection
IEEE Transactions on Circuits and Systems for Video Technology
·
01 Nov 2024
·
doi:10.1109/tcsvt.2024.3426673
Diffusion Models as Optimizers for Efficient Planning in Offline RL
European Conference on Computer Vision
·
29 Oct 2024
·
doi:10.1007/978-3-031-72983-6_1
Region-Aware Distribution Contrast: A Novel Approach to Multi-task Partially Supervised Learning
European Conference on Computer Vision
·
29 Oct 2024
·
doi:10.1007/978-3-031-72983-6_14
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement
Proceedings of the 32nd ACM International Conference on Multimedia
·
28 Oct 2024
·
doi:10.1145/3664647.3680876
Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks
Proceedings of the 32nd ACM International Conference on Multimedia
·
28 Oct 2024
·
doi:10.1145/3664647.3681475
Generalizing ISP Model by Unsupervised Raw-to-raw Mapping
Proceedings of the 32nd ACM International Conference on Multimedia
·
28 Oct 2024
·
doi:10.1145/3664647.3681666
Emotion Recognition in HMDs: A Multi-task Approach Using Physiological Signals and Occluded Faces
Proceedings of the 32nd ACM International Conference on Multimedia
·
28 Oct 2024
·
doi:10.1145/3664647.3681365
Improving Interaction Comfort in Authoring Task in AR-HRI through Dynamic Dual-Layer Interaction Adjustment
Proceedings of the 32nd ACM International Conference on Multimedia
·
28 Oct 2024
·
doi:10.1145/3664647.3681364
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding
Proceedings of the AAAI Conference on Artificial Intelligence
·
24 Mar 2024
·
doi:10.1609/aaai.v38i4.28186
Weakly-Supervised Mirror Detection via Scribble Annotations
Proceedings of the AAAI Conference on Artificial Intelligence
·
24 Mar 2024
·
doi:10.1609/aaai.v38i7.28521
Toward Optimized AR-Based Human-Robot Interaction Ergonomics: Modeling and Predicting Interaction Comfort
2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)
·
16 Mar 2024
·
doi:10.1109/vrw62533.2024.00195
Density-Aware Cloud Removal of Remote Sensing Imagery Using a Global–Local Fusion Transformer
IEEE Transactions on Geoscience and Remote Sensing
·
01 Jan 2024
·
doi:10.1109/tgrs.2024.3477739
2023
Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
·
01 Jun 2023
·
doi:10.1109/cvpr52729.2023.00166
Multimodal Apology: Using WebXR to Repair Trust with Virtual Companion
2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)
·
01 Mar 2023
·
doi:10.1109/vrw58643.2023.00206
NAS-StegNet: Lightweight Image Steganography Networks via Neural Architecture Search
European Conference on Computer Vision
·
01 Jan 2023
·
doi:10.1007/978-3-031-30111-7_20
Physics Guided Remote Sensing Image Synthesis Network for Ship Detection
IEEE Transactions on Geoscience and Remote Sensing
·
01 Jan 2023
·
doi:10.1109/tgrs.2023.3248106