Bingliang Li

Hello there! I am Bingliang Li. If everything goes with the plan, I will start my PhD journey at the University of New South Wales in late 2025, co-supervised by these legends: Dr Huadong Mo, Dr Dong Gong (Joint), Professor Daoyi Dong (Secondary), Dr Yawen Chen (Secondary). Previously, I obtained my master’s degree from The Chinese University of Hong Kong, Shenzhen, supervised by Professor Ruimao Zhang👍, and a bachelor’s degree from Lanzhou University. I also worked as an Algorithm Engineer at Xiaomi AI Lab.

Currently, my research focuses on open-world multimodal perception and generation, including image, audio, video, and more. My long-term research goal is to build an interactive system for high-quality video generation and editing. You are welcome to contact me via Email! bing.liang.li[at]outlook[dot]com

News

  • Tri-Ergon is accepted to AAAI 2025, work done at vivo.
  • Two papers accepted to CVPR 2024!
  • One paper accepted to ACM MM 2023.

Publications

Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control
Bingliang Li, Fengyu Yang, Yuxin Mao, Qingwen Ye, Hongkai Chen, Yiran Zhong
AAAI Conference on Artificial Intelligence ( AAAI ), 2025

Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie Yang* , Bingliang Li*, Ailing Zeng, Lei Zhang, Ruimao Zhang
IEEE Conference on Computer Vision and Pattern Recognition ( CVPR ), 2024

FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions
Jiong Wang*, Fengyu Yang*, Bingliang Li, Wenbo Gou, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang
IEEE Conference on Computer Vision and Pattern Recognition ( CVPR ), 2024

Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models
Siyue Yao, Mingjie Sun, Bingliang Li, Fengyu Yang, Junle Wang, Ruimao Zhang
ACM Multimedia ( ACM MM ), 2023

Experiences

vivoResearch Intern2023 - 2024
Xiaomi AI LabAlgorithm Engineer2024 - 2025

Academic Activity

Reviewer for Conferences:

  1. Neural Information Processing Systems (NeurIPS) – 2025
  2. ACM Multimedia (ACM MM) – 2023, 2024, 2025