Abstract: Large-scale multimodal generative modeling has created milestones in text-to-image and text-to-video generation.
確定! 回上一頁