Hi there! I am a third-year Ph.D. candidate at ShanghaiTech universityβs PLUS lab, under the mentorship of Prof. Xuming He. Previously, I got my B.Sc. degree in 2022 from ShanghaiTech University. I also had the wonderful opportunity to spend time as a research intern at Shanghai AI Lab, supervised by Dr. Peng Gao.
My research interest includes Multimodel Learning and Data-centric AI. In pursuit of this goal, my current research work involves multimodal large language model, image captioning and human object detection.
π₯ News
- 2025.01: Β ππ one papers accepted by ICLR 2025
- 2024.07: Β ππ one papers accepted by ECCV 2024
- 2024.05: Β ππ one papers accepted by ICML 2024
- 2024.02: I attended the AAAI24 conference onsite in Vancouver and gave a poster presentation.
- 2023.12: Β ππ one papers accepted by AAAI 2024
- 2023.06: I attended the CVPR23 conference onsite in Vancouver and gave a poster presentation.
- 2023.02: Β ππ one papers accepted by CVPR 2023
- 2022.11: Β ππ one papers accepted by AAAI 2023
π Publications
*Equal contribution
- Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
Peng Gao*, Le Zhuo*, Dongyang Liu*, Ruoyi Du*, Xu Luo*, Longtian Qiu*, Yuhang Zhang, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xie, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, Tong He, Jingwen He, Junjun He, Yu Qiao, Hongsheng Li, ICLR 2025 - SPHINX: The joint mixing of weights, tasks, and visual embeddings for multi-modal large language models
Ziyi Lin*, Chris Liu*, Renrui Zhang*, Peng Gao*, Longtian Qiu*, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao, ECCV 2024 - SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Dongyang Liu*, Renrui Zhang*, Longtian Qiu*, Siyuan Huang*, Weifeng Lin*, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao, Peng Gao, ICML 2024 - Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu*, Shan Ning*, Xuming He, AAAI 2024 - HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models
Shan Ning*, Longtian Qiu*, Xuming He, CVPR 2023 - Joint-mae: 2d-3d joint masked autoencoders for 3d point cloud pre-training
Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzhi Li, Pheng-Ann Heng, IJCAI 2023 - Calip: Zero-shot enhancement of clip with parameter-free attention
Ziyu Guo*, Renrui Zhang*,Longtian Qiu*, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui, AAAI 2023
π Educations
- 2018.09 - 2022.06, B.E. in School of Information Science and Technology, ShanghaiTech University, Shanghai, China
- 2022.06 - now, Ph.D. in School of Information Science and Technology, ShanghaiTech University, Shanghai, China
π» Academic Service
- Reviewer of CVPR 2024~2025, ECCV 2024, NeurIPS 2024~2025, ICLR 2024~2025, ICCV 2025