Ruoyu Feng

Researcher

ByteDance Douyin Content Group

Research Interests

Diffusion Models

Image/Video Generation

AIGC

About

This is Ruoyu Feng (冯若愚), a Researcher at ByteDance Douyin Content Group. I received my Ph.D. degree from MOE-Microsoft Key Laboratory of Multimedia Computing and Communication, University of Science and Technology of China (USTC) in June 2025, supervised by Zhibo Chen.

Before that, I spent my undergraduate years in the Automation Department of Southeast University, from 2016 to 2020, and received the National Scholarship in 2019.

I was a research intern at Intelligent Multimedia Group of MSRA from March 2023 to September 2024 under the supervision of Chong Luo.

My research interests mainly focus on diffusion models, image/video generation, and AIGC.

Work Experience

Researcher | ByteDance

Douyin Content Group

Jun. 2025 - Present

Research Intern | Microsoft Research Asia (MSRA)

Intelligent Multimedia Group

Mar. 2023 - Sep. 2024

Education

Ph.D. | University of Science and Technology of China (USTC)

Information and Communication Engineering

Sep. 2020 - Jun. 2025

B.E. | Southeast University (SEU)

Automation

Sep. 2016 - Jun. 2020

News

2026

SFD accepted by CVPR 2026

2025-09

Diff-ICMH accepted by NeurIPS 2025

2024-02

CCEdit accepted by CVPR 2024

2024-02

MicroCinema accepted by CVPR 2024 as Highlight

2023-07

GIT-SSIC accepted by ICCV 2023

2022-07

Omni-ICM accepted by ECCV 2022

Selected Publications

View All →

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Yueming Pan^†, Ruoyu Feng^†, Qi Dai, Yuqi Wang, Wenfeng Lin, Mingyu Guo, Chong Luo, Nanning Zheng

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2026)

Harmonizing semantic and texture modeling with asynchronous latent diffusion, accepted by CVPR 2026.

Paper Project Code

Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior

Ruoyu Feng, Yunpeng Qi, Jinming Liu, Yixin Gao, Xin Li, Xin Jin, Zhibo Chen

The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS) (2025)

Harmonizing machine and human vision in image compression using generative priors from diffusion models.

Paper Code

HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion

Ding Ding, Yueming Pan, Ruoyu Feng, Qi Dai, Kai Qiu, Jianmin Bao, Chong Luo, Zhenzhong Chen

Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) (2025)

Enhanced video inpainting method using homography propagation and diffusion models.

Paper

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Ruoyu Feng, Wenming Weng, Yanhui Wang, Yuhui Yuan, Jianmin Bao, Chong Luo, Zhibo Chen, Baining Guo

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024)

Creative and controllable video editing framework using diffusion models.

Paper Project Code

Semantically Structured Image Compression via Irregular Group-Based Decoupling

Ruoyu Feng^†, Yixin Gao^†, Xin Jin, Runsen Feng, Zhibo Chen

International Conference on Computer Vision (ICCV) (2023)

Novel image compression approach using semantic structure and irregular group-based decoupling.

Paper Code

Friends

Tiankai Hang

Jinming Liu

Zongyu Guo

Ziqi Yin