trans 4dRecent advances in diffusion models have demonstrated exceptional capabilities in image and video generation, further improving the effectiveness of 4D synthesis. Existing 4D generationTo address these challenges, we propose a text-to-4D method Trans4D, which leverages multimodal large language models (MLLMs) for geometry-aware 4D scene planning,