CLIP-Mesh
Text-to-3D Without 3D Training Data: How DreamFusion generates 3D images from text
Researchers struggle to build models that can generate a three-dimensional scene from a text prompt largely because they lack sufficient paired text-3D training examples. A new approach works without any 3D data whatsoever.