About Me
I am a first-year master student at University of Science and Technology of China (USTC), advised by Prof. Feng Zhao. I got a B.E. degree at Communication University of China in 2025. My current research focus is on Vision Language Action models, and I also have a keen interest in music production. I welcome any opportunities for communication!
My research focuses on a tri-fold approach to human-like artificial intelligence: Precision Sensing (CV), Deep Reasoning (FC & LRM), and Decisive Action (VLA & UMM). My goal is to synthesize these components into a seamless cognitive loop, exploring the uncharted frontiers of machine intelligence. Beyond the technical challenges, I am profoundly curious about the true potential of this field and remain eager to discover where the ultimate limit of AI truly lies.
Research Interests
- Unified Multimodal Model π₯π₯: Unstanding vs. Generation
- Agentπ₯π₯: Function Calling & Deep Research
- Embodied AI: Vision Language Action
- Computer Vision: image generation, image edit
News
- [Aug. 2025] One paper about function calling is accepted by EMNLP 2025.
- [Jul. 2024] One paper about image editing is accepted by ACMMM 2024.
Awards
- [Jul. 2025]We get the THIRD prize at Robotwin Dual-Arm Collaboration Challenge Within 2nd Meis Workshop! (CVPR 2025 Workshop)
Publications
-
Arxiv
Wenxuan Huang, Yu Zeng, Qiuchen Wang, Zhen Fang, Shaosheng Cao, Zheng Chu, Qingyu Yin, Shuang Chen, Zhenfei Yin, Lin Chen, Zehui Chen, Yao Hu, Philip Torr, Feng Zhao, Wanli Ouyang
-
Arxiv
Yu Zeng*, Wenxuan Huang*, Zhen Fang*, Shuang Chen, Yufan Shen, Yishuo Cai, Xiaoman Wang, Zhenfei Yin, Lin Chen, Zehui Chen, Shiting Huang, Yiming Zhao, Yao Hu, Philip Torr, Wanli Ouyang, Shaosheng Cao
-
Arxiv
Zhen Fang*(Project Leader), Ruiyan Han*,XinYu Sun*,Yuchen Ma,Ziheng Wang,Yu Zeng,Zehui Chen,Lin Chen,Wenxuan Huang,Wei-Jie Xu,Yi Cao,Feng Zhao
-
Arxiv
Zhen Fang*, Zhuoyang Liu*, Jiaming Liu, Hao Chen, Yu Zeng,Shiting Huang, Zehui Chen1, Lin Chen, Shanghang Zhang, Feng Zhao
-
EMNLP
Shiting Huang*, Zhen Fang*, Zehui Chen, Siyu Yuan, Junjie Ye, Yu Zeng, Lin Chen, Qi Mao, Feng Zhao
The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
-
ACMMM
Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, and Mike Zheng Shou
ACM International Conference on Multimedia (ACMMM), 2024.
-
TMM
Yuanhang Li, Qi Mao, Lan Chen, Zhen Fang, Lei Tian, Xinyan Xiao, Libiao Jin, Hua Wu
IEEE Transactions on Multimedia (TMM)
Miscellaneous
Outside of my research, I am a creator at heart. I immerse myself in π reading (with Wang Xiaobo as my favorite author), π΅ music production (Lo-fi Hip-hop & EDM, Click to listen to my portfolioπ§), βοΈ creative writing, and β½ table football. I thrive on the sensation of bringing something new into existence. My journey also involves exploring π¨ visual design, π« anime, π· photography, and π¬ movie production. For me, life is all about perception and experience!
Powered by Jekyll and Minimal Light theme.