The model operates on a compressed and quantized latent space. It is conditioned on CLIP embeddings and uses an improved sampling function over ...
確定! 回上一頁