Generative 3D Object Classification

5 papers with code • 2 benchmarks • 2 datasets

The task of generative 3D object classification involves prompting the model to generate the object type from its point cloud, distinguishing it from discriminative models that directly classify objects based on probability comparisons.

Benchmarks

Add a Result

These leaderboards are used to track progress in Generative 3D Object Classification

Trend	Dataset	Best Model	Paper	Code	Compare
	Objaverse	MiniGPT-3D			See all
	ModelNet40	MiniGPT-3D			See all

Libraries

Use these libraries to find Generative 3D Object Classification models and implementations

qizekun/ShapeLLM

4 papers

Pointcept/GPT4Point

3 papers

266

Datasets

Most implemented papers

Most implemented Social Latest No code

3D-LLM: Injecting the 3D World into Large Language Models

umass-foundation-model/3d-llm • • NeurIPS 2023

Furthermore, experiments on our held-in datasets for 3D captioning, task composition, and 3D-assisted dialogue show that our model outperforms 2D VLMs.

Paper
Code

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

ziyuguo99/point-bind_point-llm • • 1 Sep 2023

We introduce Point-Bind, a 3D multi-modality model aligning point clouds with 2D image, language, audio, and video.

Paper
Code

PointLLM: Empowering Large Language Models to Understand Point Clouds

openrobotlab/pointllm • • 31 Aug 2023

The unprecedented advancements in Large Language Models (LLMs) have shown a profound impact on natural language processing but are yet to fully embrace the realm of 3D understanding.

Paper
Code

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

qizekun/ShapeLLM • • 27 Feb 2024

This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM) designed for embodied interaction, exploring a universal 3D object understanding with 3D point clouds and languages.

Paper
Code

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

tangyuan96/minigpt-3d • 2 May 2024

Notably, MiniGPT-3D gains an 8. 12 increase on GPT-4 evaluation score for the challenging object captioning task compared to ShapeLLM-13B, while the latter costs 160 total GPU-hours on 8 A800.

Paper
Code

Generative 3D Object Classification

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

3D-LLM: Injecting the 3D World into Large Language Models

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

PointLLM: Empowering Large Language Models to Understand Point Clouds

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

Content

Benchmarks

Add a Result