Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

Utkarsh Ojha
Krishna Kumar Singh
Cho-Jui Hsieh
Yong Jae Lee
[Paper]
[GitHub]
[Slides]



Abstract

We propose a novel unsupervised generative model that learns to disentangle object identity from other low-level aspects in class-imbalanced data. We first investigate the issues surrounding the assumptions about uniformity made by InfoGAN, and demonstrate its ineffectiveness to properly disentangle object identity in imbalanced data. Our key idea is to make the discovery of the discrete latent factor of variation invariant to identity-preserving transformations in real images, and use that as a signal to learn the appropriate latent distribution representing object identity. Experiments on both artificial (MNIST, 3D cars, 3D chairs, ShapeNet) and real-world (YouTube-Faces) imbalanced datasets demonstrate the effectiveness of our method in disentangling object identity as a latent factor of variation.


Model Architecture



Imbalanced MNIST



Imbalanced 3D Cars



Imbalanced 3D Chairs



Imbalanced ShapeNet



Imbalanced YouTube Faces



Paper and Supplementary Material

U. Ojha, K. K. Singh, C. Hsieh, Y. J. Lee.
Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data
In NeurIPS, 2020.
(hosted on ArXiv)


[Bibtex]


Acknowledgements

This template was originally made by Phillip Isola and Richard Zhang for a colorful ECCV project; the code can be found here.