I am currently a senior undergraduate student at the Zhi Class of Peking University (PKU). I am conducting research at BAIR, advised by Trevor Darrell and XuDong Wang. Previously, I worked as a research intern at the Wangxuan Institute of Computer Technology (WICT), advised by Prof. Yang Liu and collaborating closely with the wonderful seniors Sizhe Lee and Zhu Xu.
My research interests lie primarily in Vision-Language Models and Generative Models. At this stage of my academic journey, my guiding aspiration is to empower artificial intelligence to create works of epic significance.
My CV is here. If you have interesting ideas or questions, feel free to reach out! π§
π₯ News
- 2025.12: Β π’π’ Invited talk at BAAI.
- 2025.11: Β ππ Our CoVT is released.
- 2025.02: Β ππ Our HCoG is accepted by CVPR 2025!
- 2024.02: Β ππ Our Diff-BGM is accepted by CVPR 2024!
π Publications

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
Yiming Qin, Bomin Wei, Jiaxin Ge, Konstantinos Kallidromitis, Stephanie Fu, Trevor Darrell, XuDong Wang
[Project Page] Β [Arxiv] Β [PDF] Β [code]

Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation
Yiming Qin, Zhu Xu, Yang Liu
[Project Page] Β [Arxiv] Β [PDF] Β [code]
- IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025

Diff-BGM: A Diffusion Model for Video Background Music Generation
Sizhe Li, Yiming Qin, Minghang Zheng, Xin Jin, Yang Liu
[Project Page] Β [ArXiv] Β [PDF] Β [code]
- IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
π Honors and Awards
- 2025 Yuanpei College Award for Research Excellence.
- 2024 Zhi Class Scholarship.
- 2023 Peking University Scholarship.
- 2022 Peking University Freshman Scholarship.
π Educations
2025.01 - Present
Research Intern
Research Advisor: XuDong Wang
Academic Advisor: Trevor Darrell
2022.09 - Present
Undergraduate Student
Research Advisor: Yang Liu
Academic Advisor: Baoquan Chen