Geekyrakshit
  • About Me
  • Projects
  • Courses/Seminars
  • WandB Reports
  • Posts

PoE-GAN: Generating Images from Multi-Modal Inputs

computervision
deeplearning
generative
2-minute-papers
PoE-GAN is a recent, fascinating paper where the authors generate images from multiple inputs like text, style, segmentation, and sketch. We dig into the architecture, the underlying math, and of course, generate some images along the way.
Author

GeekyRakshit

Published

February 25, 2022

Featured on Two Minute Papers