Scalable Diffusion Models with Transformers
abs: https://arxiv.org/abs/2212.09748
largest DiT-XL/2 models outperform all prior diffusion models on the class conditional ImageNet 512×512 and 256×256 benchmarks, achieving a state-of-the-art FID of 2.27 on the latter