I am currently an undergraduate student at Wuhan University, advised by Prof. Xue Yang. I have published papers at top-tier international CV conferences and journals such as CVPR and IJCV.

My research interests include Deep Learning and Computer Vision, with a focus on Generic/Oriented Object Detection and Vision-Language Models.

I am always open to academic collaboration—feel free to reach out to me at peiyuanzhangwhu@whu.edu.cn.

🔥 News

  • 2025.02:  🎉🎉 One paper related to OBB (Point2RBox-v2) is accepted by CVPR
  • 2025.05:  🎉🎉 One paper related to OBB (PointOBB-v3) is accepted by IJCV
  • 2025.05:  🎉🎉 One paper related to VLM (RISEBench) is now available on arXiv
  • 2025.07:  🎉🎉 One paper related to OBB (PWOOD) is now available on arXiv

📝 Publications

CVPR 2025
sym

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Yi Yu, Botao Ren, Peiyuan Zhang, Mingxin Liu, Junwei Luo, Shaofeng Zhang, Feipeng Da, Junchi Yan, Xue Yang

Project

  • This work rethinks point-supervised oriented object detection with the layout among instances. At the core are three principles: 1) Gaussian overlap loss. 2) Voronoi watershed loss. 3) Consistency loss.
IJCV 2025
sym

Pointobb-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

Peiyuan Zhang, Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Yue Zhou, Xiaosong Jia, Xudong Lu, Jingdong Chen, Xiang Li, Junchi Yan, Yansheng Li

Project

  • This work presents an extended conference version of PointOBB, which incorporates a novel Scale-Sensitive Feature Fusion (SSFF) module to improve the model’s capability of perceiving object scales, and further proposes an end-to-end optimized framework.
arxiv 2025
sym

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Xiangyu Zhao, Peiyuan Zhang, Kexian Tang, Xiaorong Zhu, Hao Li, Wenhao Chai, Zicheng Zhang, Renqiu Xia, Guangtao Zhai, Junchi Yan, Hua Yang, Xue Yang, Haodong Duan

Project

  • This paper proposes RISEBench, the first benchmark for reasoning-informed visual editing, covering four core reasoning tasks—Temporal, Causal, Spatial, and Logical—and introducing a comprehensive evaluation framework with three key dimensions: Instruction Reasoning, Appearance Consistency, and Visual Plausibility.
arxiv 2025
sym

Partial Weakly-Supervised Oriented Object Detection

Mingxin Liu, Peiyuan Zhang, Yuan Liu, Wei Zhang, Yue Zhou, Ning Liao, Ziyang Gong, Junwei Luo, Zhirui Wang, Yi Yu, Xue Yang

Project

  • This paper proposes PWOOD, a cost-effective framework for oriented object detection that uses partially weak and unlabeled data through orientation- and scale-aware learning, achieving competitive performance with much lower annotation cost.

🎖 Honors and Awards

  • 2025.06 China College Students Computer System Design Competition Second Prize

  • 2024.12 National College Students Computer System Capability Competition First Prize

  • 2024.09 National College Students Mathematical Modeling Contest Third Prize

  • 2024.05 Huazhong Cup Undergraduate Mathematical Modeling Challenge Second Prize

  • 2024.04 The 15th National College Students Mathematics Competition Third Prize

  • 2023, 2024 Second-class Scholarship of School of Computer Science, Wuhan University

  • 2023, 2024 Lei Jun Computer Innovation and Development Fund Recipient

📖 Educations

  • 2022.09 - now, Wuhan University, School of Computer Science.

💬 Invited Talks

-

💻 Internships

-