avatar

Yang Fei

Computer Science and Mathematics undergraduate @ HKUST

Yang Fei (费阳)

I am a senior undergraduate at The Hong Kong University of Science and Technology (HKUST), majoring in Computer Science and Mathematics. I also spent time as an exchange student at the University of Washington (UW).

I have been fortunate to conduct research under the supervision of Prof. Qifeng Chen at HKUST and Prof. Ranjay Krishna at UW. My research centers on Computer Vision, with a particular focus on enhancing motion modeling in generative video models. I aim to build motion-centric world models where temporal consistency is prioritized, pushing video generation toward a digital-twin-like reality.

I am actively seeking CS PhD positions for Fall 2026. If my research interests in generative video and motion modeling align with your lab, please feel free to reach out.

News

  • [2025.12] Our new preprint, "Structure from Tracking" (first author), is now available! See the project page.
  • [2025.06] VideoVAE+ (co-first author) has been accepted to ICCV 2025! Code and weights are available here.
  • [2025.04] Received the The Hong Kong, China - Asia-Pacific Scholarship!
  • [2025.01] Started an exchange program at the University of Washington (Seattle).

Research

Building on my experience improving motion architectures, from the tokenizer to the diffusion stage, I am focused on advancing motion modeling in video generation.
Interactive Causal Motion: I aim to transform video generation models into interactive simulators that support active intervention rather than mere observation, creating environments where agents can effectively plan and act.
Geometrically Consistent Motion: I am focused on training efficiency and geometric fidelity. Specifically, I investigate methods to address the sample inefficiency of learning 3D consistency solely from 2D statistics.
Physically Grounded Motion: My goal is to bridge the gap between visual plausibility and physical correctness. I explore how to enforce fundamental constraints to handle complex physical phenomena often underrepresented in training data.

Selected Publications

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Yang Fei, George Stoica, Jingyuan Liu, Qifeng Chen†, Ranjay Krishna*, Xiaojuan Wang*, Benlin Liu*†

Under Review

TL;DR: We introduce an algorithm to distill structure-preserving motion priors from an autoregressive video tracking model (SAM2) into a bidirectional video diffusion model.

Large Motion Video Autoencoding with Cross-modal Video VAE

Yazhou Xing*, Yang Fei*, Yingqing He*†, Jingye Chen, Jiaxin Xie, Xiaowei Chi, Qifeng Chen†

International Conference on Computer Vision (ICCV), 2025

TL;DR: We propose a video autoencoder that achieves high-fidelity video encoding by combining temporal-aware spatial compression, lightweight temporal compression, and textual guidance.

View All Publications →

Education

  • The Hong Kong University of Science and Technology (HKUST)
    B.Sc. in Computer Science and Mathematics | Sep 2022 - Jun 2026 (Expected)
    GPA: 4.06 / 4.30 (Rank: Top 2 in CS & Math)
  • University of Washington
    Exchange Student, College of Engineering | Jan 2025 - Mar 2025
    GPA: 3.97 / 4.00

Selected Honors

  • HKSAR Government Scholarship (Top 1%, 2023 - 2026)
  • The Hong Kong, China - Asia-Pacific Scholarship (2025)
  • Tse Cheuk Ng Tai Scholarship (2025)
  • Dean’s List (All active semesters)