Papers

I would have written a shorter letter, but I did not have the time.

—Blaise Pascal

  • X-Dancer: Expressive music to human dance video generation
    Zeyuan Chen, Hongyi Xu, Guoxian Song, You Xie, Chenxu Zhang, Xin Chen, Chao Wang, Di Chang, Linjie Luo.
    In ICCV 2025 (Highlight).
    [PDF],[Code],[Project page]
  • DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
    Qingcheng Zhao*, Xiang Zhang*, Haiyang Xu, Zeyuan Chen, Jianwen Xie, Yuan Gao, Zhuowen Tu.
    In ICCV 2025.
    [PDF],[Code],[Project page]
  • YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
    Guanning Zeng, Xiang Zhang, Zirui Wang, Haiyang Xu, Zeyuan Chen, Bingnan Li, Zhuowen Tu.
    In ICCV 2025.
    [PDF],[Code],[Project page]
  • X-dyna: Expressive dynamic human image animation
    Di Chang, Hongyi Xu, You Xie, Yipeng Gao, Zhengfei Kuang, Shengqu Cai, Chenxu Zhang, Guoxian Song, Chao Wang, Yichun Shi, Zeyuan Chen, Shijie Zhou, Linjie Luo, Gordon Wetzstein, Mohammad Soleymani.
    In CVPR 2025 (Highlight).
    [PDF],[Code],[Project page]
  • Dolfin: Diffusion Layout Transformers without Autoencoder
    Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu.
    In ECCV 2024.
    [PDF],[Code],[Project page]
  • Bayesian Diffusion Models for 3D Shape Reconstruction
    Haiyang Xu*, Yu Lei*, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu.
    In CVPR 2024.
    [PDF],[Code],[Project page]
  • Bliva: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
    Wenbo Hu*, Yifan Xu*, Yi Li, Weiyue Li, Zeyuan Chen, Zhuowen Tu.
    In AAAI 2024.
    [PDF],[Code],[Project page]
  • Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction
    Xiang Zhang*, Zeyuan Chen*, Fangyin Wei, Zhuowen Tu.
    In ICCV 2023.
    [PDF],[Code],[Project page]
  • CASA: Category-agnostic Skeletal Animal Reconstruction
    Yuefan Wu*, Zeyuan Chen*, Shaowei Liu, Zhongzheng Ren, Shenlong Wang.
    In Neurips 2022.
    [PDF],[Code],[Project page]
  • VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
    Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong Wang.
    In CVPR 2022.
    [PDF],[Code],[Project page]
  • PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors
    Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu.
    In CVPR 2021 (Oral).
    [PDF],[Code],[Project page]
  • CERL: A Unified Optimization Framework for Light Enhancement with Realistic Noise
    Zeyuan Chen, Yifan Jiang, Dong Liu, Zhangyang Wang.
    In IEEE Transactions on Image Processing (TIP).
    [PDF],[Code],[Project page]