PoE-GAN: Generating Images from Multi-Modal Inputs
computervision
deeplearning
generative
2-minute-papers
PoE-GAN is a recent, fascinating paper where the authors generate images from multiple inputs like text, style, segmentation, and sketch. We dig into the architecture, the underlying math, and of course, generate some images along the way.