PoE-GAN: Generating Images from Multi-Modal Inputs

computervision

deeplearning

generative

2-minute-papers

PoE-GAN is a recent, fascinating paper where the authors generate images from multiple inputs like text, style, segmentation, and sketch. We dig into the architecture, the underlying math, and of course, generate some images along the way.

Author

GeekyRakshit

Published

February 25, 2022

Featured on Two Minute Papers