发布时间:2025-01-06
作者:光明实验室
浏览:6285次
李明 研究员
智绘空间团队负责人
学习经历:
2021-08 至 2024-05,新加坡国立大学,数据科学,博士
2015-09 至 2018-07,北京大学,电子工程,硕士
2011-08 至 2015-07,西安电子科技大学,电子工程,学士
工作经历:
2024-07 至今,光明实验室,研究员(天才新星)
2023-02 至 2023-10,Sea AI Lab,新加坡,研究实习生
2023-12 至2024-05,上海人工智能实验室,中国上海,研究实习生
2019-10 至 2021-04,伍斯特理工学院,研究助理(博士在读)
2018-09 至 2019-10,北卡罗来纳大学教堂山分校,研究助理
邮箱:liming@gml.ac.cn
研究领域:人工智能内容生成(文本到图像/3D/视频)、多模态大语言模型、具身智能
学术任职:
CCF会员
人工智能顶级期刊和会议如IEEE TPAMI, CVPR, NeurIPS, IEEE TMM, IEEE TNNLS, and Neurocomputing 审稿人
人工智能顶级会议AAAI Program Committee
荣誉头衔:
1. WACV 2026领域主席
2. 2025全球人工智能技术大会特邀嘉宾
3. 2024深圳智能机器人灵巧手大赛优胜奖
4. 新加坡国立大学博士生校长奖学金
5. 中国大学生高等数学竞赛一等奖
代表性成果:
1. EventGPT(CVPR 2025)首次用多模态大语言模型理解事件模态, 引起了学术界的广泛关注,特别是一些知名学者主动发来合作邀请。该工作还被美国知名 AI 平台 PromptLayer 专文报道,文章标题为《EventGPT:赋予 AI 超越人类的视觉能力》。
2. Instant3D(IJCV)提出了快速文生3D新方法,一经发布就在知名开源平台Hugging Face 上引起广泛关注,并收到华为、米哈游、高榕资本等投资机构的合作邀请。
3. 论文《Exploiting Multi-view Part-wise Correlation via an Efficient Transformer for Vehicle Re-Identification》( IEEE Transactions on Multimedia )多次入选 ESI 高被引论文。
4. 论文《Self-supervised Geometric Features Discovery with Interpretable Attention for Vehicle Re-Identification and Beyond》(ICCV 2021)提出的汽车重识别算法在英伟达主办的AI City Challenge 2020比赛的城市级汽车跟踪赛道上排名第一。
发表论文论著:
部分论文如下,完整列表请查看:https://ming1993li.github.io/
Yufei Shi†, Weilong Yan†, Gang Xu, Yumeng Li, Yucheng Chen, Zhenxi Li, Fei Richard Yu, Ming Li* and Si Yong Yeo. PVChat: Personalized Video Chat with One-Shot Learning. ICCV. 2025. (*Corresponding Authors)
Cui Miao, Tao Chang, Meihan Wu, Hongbin Xu, Chun Li, Ming Li*, Xiaodong Wang. FedVLA: Federated Vision-Language-Action Learning with Dual Gating Mixture-of-Experts for Robotic Manipulation. ICCV. 2025. (*Corresponding Authors)
Meihan Wu, Tao Chang, Miaocui, Jie Zhou, Chun Li, Xiangyu Xu, Ming Li*, and Xiaodong Wang. EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Edge Devices. ICCV. 2025. (*Corresponding Authors)
Songbai Tan, Yao Shu, Xuerui Qiu, Gang Xu, Linrui Xu, Xiangyu Xu, Huiping Zhuang, Ming Li* and Fei Richard Yu. WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models. ICML 2025. (*Corresponding Authors)
Gan Chen, Ying He, Mulin Yu, F.Richard Yu, Gang Xu, Fei Ma, Ming Li* and Guang Zhou. Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction. IJCAI 2025. (*Corresponding Authors)
Shaoyu liu, Jianing Li, Guanghui Zhao, Yunjian Zhang, Xin Meng, Fei Richard Yu, Xiangyang Ji, and Ming Li*. EventGPT: Event Stream Understanding with Multimodal Large Language Models. CVPR 2025. (*Corresponding Authors)
Zhicong Wu, Hongbin Xu, Gang Xu, Ping Nie, Zhixin Yan, Jinkai Zheng, Liangqiong Qu, Ming Li* and Liqiang Nie. TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting. ACM MM. 2025. (*Corresponding Authors)
Yan Zhang†, Ming Li†, Chun Li, Zhaoxia Liu, Ye Zhang and Fei Richard Yu. Uncertainty Quantification via Holder Divergence for Multi-View Representation Learning. IEEE TMM, 2025. (†Equal Contributors)
Ming Li, Pan Zhou, Jia-Wei Liu, Jussi Keppo, Min Lin, Shuicheng Yan and Xiangyu Xu. Instant3D: Instant Text-to-3D Generation. IJCV, 2024.
Ming Li, Huazhu Fu, Shengfeng He, Hehe Fan, Jun Liu, Jussi Keppo and Mike Zheng Shou. DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition. IEEE TMM, 2023.
Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, and Shuicheng Yan. STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition. ICCV 2023.
Ming Li, Xinming Huang, Ziming Zhang. Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond. ICCV 2021.
Ming Li, Jun Liu, Ce Zheng, Xinming Huang, Ziming Zhang. Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. IEEE TMM, 2021.