[ECCV 2024 BPC]PointLLM: Empowering Large Language Models to Understand Point Clouds Cited by 170
[ICLR2024 Spotlight]Uni3d: Exploring unified 3d representation at scale Cited by 105
[CVPR2024]SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities Cited by 235
[ECCV2024]ShapeLLM: Universal 3d object understanding for embodied interaction Cited by 62
[ARXIV24/09]LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities Cited by 31
[CVPR2025]LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
3D AffordanceLLM
SeqAfford
MotionGPT
MotionChain
SemGrasp
HOIGPT