About
This is Ruoyu Feng (冯若愚), a Researcher at ByteDance Douyin Content Group. I received my Ph.D. degree from MOE-Microsoft Key Laboratory of Multimedia Computing and Communication, University of Science and Technology of China (USTC) in June 2025, supervised by Zhibo Chen.
Before that, I spent my undergraduate years in the Automation Department of Southeast University, from 2016 to 2020, and received the National Scholarship in 2019.
I was a research intern at Intelligent Multimedia Group of MSRA from March 2023 to September 2024 under the supervision of Chong Luo.
My research interests mainly focus on diffusion models, image/video generation, and AIGC.
Work Experience
Researcher | ByteDance
Douyin Content Group
Jun. 2025 - Present
Research Intern | Microsoft Research Asia (MSRA)
Intelligent Multimedia Group
Mar. 2023 - Sep. 2024
Education
Ph.D. | University of Science and Technology of China (USTC)
Information and Communication Engineering
Sep. 2020 - Jun. 2025
B.E. | Southeast University (SEU)
Automation
Sep. 2016 - Jun. 2020
News
Selected Publications
View All →Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
Yueming Pan†, Ruoyu Feng†, Qi Dai, Yuqi Wang, Wenfeng Lin, Mingyu Guo, Chong Luo, Nanning Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2026)
Harmonizing semantic and texture modeling with asynchronous latent diffusion, accepted by CVPR 2026.
Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior
Ruoyu Feng, Yunpeng Qi, Jinming Liu, Yixin Gao, Xin Li, Xin Jin, Zhibo Chen
The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS) (2025)
Harmonizing machine and human vision in image compression using generative priors from diffusion models.
HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion
Ding Ding, Yueming Pan, Ruoyu Feng, Qi Dai, Kai Qiu, Jianmin Bao, Chong Luo, Zhenzhong Chen
Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) (2025)
Enhanced video inpainting method using homography propagation and diffusion models.
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Ruoyu Feng, Wenming Weng, Yanhui Wang, Yuhui Yuan, Jianmin Bao, Chong Luo, Zhibo Chen, Baining Guo
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024)
Creative and controllable video editing framework using diffusion models.
