Lumina-DiMOO: An open-source discrete multimodal diffusion model

2025-09-1211:45422synbol.github.io

Yi Xin1,2,5,♣ Qi Qin1,4,♣ Siqi Luo1,3 Kaiwen Zhu1,3 Juncheng Yan1,7 Yan Tai3 Jiayi Lei1,3 Yuewen Cao1 Yuandong Pu1,3 Dengyang Jiang1 Le Zhuo1,6 Shenglong Ye1 Ming Hu1

Yi Xin1,2,5,♣ Qi Qin1,4,♣ Siqi Luo1,3 Kaiwen Zhu1,3 Juncheng Yan1,7 Yan Tai3 Jiayi Lei1,3 Yuewen Cao1 Yuandong Pu1,3 Dengyang Jiang1 Le Zhuo1,6 Shenglong Ye1 Ming Hu1 Junjun He1 Bo Zhang1 Gen Luo1 Chang Xu4 Wenhai Wang1 Hongsheng Li1,6 Guangtao Zhai1,3 Tianfan Xue6,1
Bin Fu1,† Xiaohong Liu3,2,† Yu Qiao1,† Yihao Liu1,†

(♣ Equal Contributions, † Corresponding Authors)

1 Shanghai AI Laboratory   2 Shanghai Innovation Institute   3 Shanghai Jiao Tong University   4 The University of Sydney  
5 Nanjing University   6 The Chinese University of Hong Kong   7 Tsinghua University ‌

Technical Report (Coming Soon) Code Model


Read the original article

Comments

  • By turnsout 2025-09-1213:22

    This looks fantastic, and in the same vein as nano banana. I wonder if this inspires any startup ideas…

  • By randomNumber7 2025-09-1217:22

    Since there is no paper yet, can someone explain what fully discrete diffusion modeling means?

HackerNews