Research

Multi-modal Intelligence Group (MIG) focuses on three major research directions that integrate cutting-edge artificial intelligence, computer vision, and aerial systems. Each direction is organized into a dedicated sub-team with monthly rotating leadership.

👁‍🗨 Intelligent Visual Capturing Systems (FLY-IN)

Designing low-level vision algorithms that enable aerial vehicles to “see” in complex conditions — night, shining, hazy, cloudy, and rainy days. The focus is on deployment in resource-constrained edge devices on drones.

Team Leads (rotating monthly): Yangming Zhang, Junyu Liu, Lanyue Liang, Chaofan Qiao
Team Members: Quan Rui, Dongyu Xie, Zhibin Wang, Jiening Zhang, Zhuoyao Fan, Jiayi Zhou, Zixiao Hu, Wen Bo, Huilin Zhang, Yangyang Feng

Representative Research Topics:

Fly in the Dark Day (ISP imaging, Super-resolution)
Fly in the Hazy/Cloudy/Shining Day
Remote cloud removal, HDR enhancement
Spectral and video compression imaging
Task-driven joint optimization

🧠 Intelligent Visual Perception Systems (FLY-FOR)

Developing vision and multimodal large language models (LLMs) to understand the world at pixel, object, and scene level. Deployed into UAVs for object recognition, scene reconstruction, and trend prediction.

Team Leads (rotating monthly): Yupeng Gao, Xi Wu, Pengwei Yang, Jun Zhang
Team Members: Minghang Zhou, Mingfeng Zha, Chenxi Lan, Keli Wang, Yuchen Wu, Yirui Xu, Haixia Li

Representative Research Topics:

Multimodal large models for object understanding
Change detection, temporal reasoning
Salient object detection, 3D reconstruction
Low-speed aerial target detection
Semantic segmentation and few-shot learning

🤖 Intelligent Embodied Aerial AI Systems (FLY-WITH)

Designing embodied visual-language navigation (VLN) and visual-language action (VLA) algorithms, enabling UAVs and multi-agent systems to interact, self-control, and self-organize autonomously.

Team Leads (rotating monthly): Yunqiang Pei, Kaiyue Zhang, Rongyu Du
Team Members: Hongkun Chen, Ruyu Ye, Mian Zhang

Representative Research Topics:

Human instruction understanding
Aerial agent collaboration
Digital twin and world modeling