| OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model |
Zhenhao Zhang, Ye Shi†, Lingxiao Yang, Suting Ni, Qi Ye, Jingya Wang† |
NeurIPS 2025 Oral |
Paper |
| Generalizable Operating Room Expert with Multimodal Enchance |
Peiqi He*, Zhenhao Zhang*, Yixiang Zhang, Jiaxin Liu, Xiongjun Zhao†, Shaoliang Peng† |
Preprint |
Paper |
| Diffusion Models are Open-World Affordance Learners: Leveraging Generative Priors for 3D Affordance Learning |
Hanqing Wang*, Zhenhao Zhang*, Kaiyang Ji*, Mingyu Liu, Wenti Yin, Yuchao Chen, Zhirui Liu, Xiangyu Zeng, Tianxiang Gui, Hangxing Zhang, Jiahao Yuan, Zhiqing Cui, Jiaxin Liu, Zhiyuan Ma, Hui Xiong† |
Preprint |
Paper |
| HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model |
Zhenhao Zhang*, Hanqing Wang*, Xiangyu Zeng*, Ziyu Cheng, Jiaxin Liu, Haoyu Yan, Zhirui Liu, Kaiyang Ji, Tianxiang Gui, Ke Hu, Kangyi Chen, Yahao Fan, Mokai Pan |
Preprint |
Paper |