Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

Utkarsh Ojha

Krishna Kumar Singh

Cho-Jui Hsieh

Yong Jae Lee

[Paper]

[GitHub]

[Slides]

Abstract

We propose a novel unsupervised generative model that learns to disentangle object identity from other low-level aspects in class-imbalanced data. We first investigate the issues surrounding the assumptions about uniformity made by InfoGAN, and demonstrate its ineffectiveness to properly disentangle object identity in imbalanced data. Our key idea is to make the discovery of the discrete latent factor of variation invariant to identity-preserving transformations in real images, and use that as a signal to learn the appropriate latent distribution representing object identity. Experiments on both artificial (MNIST, 3D cars, 3D chairs, ShapeNet) and real-world (YouTube-Faces) imbalanced datasets demonstrate the effectiveness of our method in disentangling object identity as a latent factor of variation.

Model Architecture

Imbalanced MNIST

Imbalanced 3D Cars

Imbalanced 3D Chairs

Imbalanced ShapeNet

Imbalanced YouTube Faces

Paper and Supplementary Material

U. Ojha, K. K. Singh, C. Hsieh, Y. J. Lee.
Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data
In NeurIPS, 2020.
(hosted on ArXiv)

[Bibtex]

Acknowledgements

This template was originally made by Phillip Isola and Richard Zhang for a colorful ECCV project; the code can be found here.