I am a first-year master student at University of Science and Technology of China (USTC), advised by Prof. Feng Zhao. I got a B.E. degree at Communication University of China in 2025. My current research focus is on Vision Language Action models, and I also have a keen interest in music production. I welcome any opportunities for communication!
My research focuses on a tri-fold approach to human-like artificial intelligence: Precision Sensing (CV), Deep Reasoning (FC & LRM), and Decisive Action (VLA & UMM). My goal is to synthesize these components into a seamless cognitive loop, exploring the uncharted frontiers of machine intelligence. Beyond the technical challenges, I am profoundly curious about the true potential of this field and remain eager to discover where the ultimate limit of AI truly lies.
EMNLP
ACMMM
TMM
Outside of my research, I am a creator at heart. I immerse myself in π reading (with Wang Xiaobo as my favorite author), π΅ music production (Lo-fi Hip-hop & EDM, Click to listen to my portfolioπ§), βοΈ creative writing, and β½ table football. I thrive on the sensation of bringing something new into existence. My journey also involves exploring π¨ visual design, π« anime, π· photography, and π¬ movie production. For me, life is all about perception and experience!
Powered by Jekyll and Minimal Light theme.