| Time | Session | Speaker |
|---|---|---|
| 13:30 – 13:40 | Opening and Welcome | Baining Guo, Microsoft Research Asia |
| 13:40 – 14:10 | Invited MSRA Research Talks (2×15 mins) |
Jiaolong Yang, Microsoft Research Asia
Li Zhang, Microsoft Research Asia |
| 14:10 – 15:40 | Idea Sparks Panel I (From Vision to Deployment: Scaling Multimodal Foundation Models for Real-World Impact) | Chong Luo, Microsoft Research Asia (Host) |
| The Research Journey: From Multimedia to Multimodal, then Multi-what? |
Di Hu, Renmin University of China
Collaborative researcher: Jianlong Fu, Microsoft Research Asia |
|
| Towards reasoning multimodal LLMs in low-resource scenarioes |
Guanhua Chen, Southern University of Science and Technology
Collaborative researcher: Dongdong Zhang, Microsoft Research Asia |
|
| Towards Generalizable Human-Level Multimodal Generalist |
Hao Fei, National University of Singapore
Collaborative researcher: Lei Cui, Microsoft Research Asia |
|
| When Do Multimodal Foundation Models Need 3D Capabilities |
Pengshuai Wang, Peking University
Collaborative researcher: Jiaolong Yang, Microsoft Research Asia |
|
| Toward Self-Supervised Large Feedforward Systems |
Tong Zhang, University of Chinese Academy of Sciences
Collaborative researcher: Baining Guo, Microsoft Research Asia |
|
| Fostering Digital Trust: Combating Untrustworthy Information with Multimodal AI |
Yupeng Li, Hong Kong Baptist University
Collaborative researcher: Fangzhao Wu, Microsoft Research Asia |
|
| Information foraging with multimodal LLMs: opportunities and challenges |
Ziang Xiao, Johns Hopkins University
Collaborative researcher: Xiaoyuan Yi, Microsoft Research Asia |
|
| Beyond Behavioral Alignment: Toward Neural-Level Alignment in Multimodal Foundation Models |
Ziyu Jia, Institute of Automation of Chinese Academy of Sciences
Collaborative researcher: Yansen Wang, Microsoft Research Asia |
|
| Discussion | All | |
| 15:40 – 16:00 | Group Photo and Tea Break | All |
| 16:00 – 17:30 | Idea Sparks Panel II (Understanding efficiency through the lens of intelligence) | Fan Yang, Microsoft Research Asia (Host) |
| Observations on the Evolving of LLM Intelligence and Efficiency | Yuqing Yang, Microsoft Research Asia | |
| Balancing Efficiency and Intelligence in Speech Enhancement: Insights from Recent Advances |
Chenda Li, Shanghai Jiao Tong University
Collaborative researcher:Shujie Liu, Microsoft Research Asia |
|
| Designing system Software for Wafer-Scale AI computing |
Luo Mai, University of Edinburgh (Online)
Collaborative researcher: Fan Yang, Microsoft Research Asia |
|
| Flexible Sensing for Physical Perception: Gaining Efficiency in Embodied AI |
Minhui Xie, Renmin University of China
Collaborative researcher: Ran Shu, Microsoft Research Asia Baotong Lu, Microsoft Research Asia |
|
| Tackling Data Redundancy in the Generative AI Era |
Yihao Chen, Tsinghua University
Collaborative researcher: Zilong Wang, Microsoft Research Asia Lili Qiu, Microsoft Research Asia |
|
| Rethinking the efficiency of generative models |
Zhenghao Chen, The University of Newcastle, Australia (Online)
Collaborative researcher: Bin Li, Microsoft Research Asia |
|
| Discussion | All | |
| 17:30 – 17:40 | Closing Remarks | Lily Sun, Microsoft Research Asia |
| 18:00 – 20:00 | Dinner + Question Box Interaction |
