LLM-Learning

Foundation MLLM

Foundation Work

[ECCV 2024 BPC]PointLLM: Empowering Large Language Models to Understand Point Clouds Cited by 170

[ICLR2024 Spotlight]Uni3d: Exploring unified 3d representation at scale Cited by 105

[CVPR2024]SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities Cited by 235

[ECCV2024]ShapeLLM: Universal 3d object understanding for embodied interaction Cited by 62

[ARXIV24/09]LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities Cited by 31

Recent Works

[CVPR2025]LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

AffordanceLLM

3D AffordanceLLM

SeqAfford

MotionLLM

MotionGPT

MotionChain

HandLLM

SemGrasp

HOIGPT