Autoencoders, Unsupervised Learning, and Deep
Architectures
P. Baldi; JMLR W&CP 27:37–49,
2012.
Abstract
Autoencoders play a fundamental role in unsupervised learning and in
deep
architectures for transfer learning and other tasks. In spite of their
fundamental role, only linear
autoencoders over the real numbers have been solved analytically. Here
we present a general
mathematical framework for the study of both linear and non-linear
autoencoders.
The framework allows one to derive an analytical treatment for the most
non-linear
autoencoder, the Boolean autoencoder. Learning in the Boolean
autoencoder is equivalent
to a clustering problem that can be solved in polynomial time when the
number of
clusters is small and becomes NP complete when the number of clusters
is large. The
framework sheds light on the different kinds of autoencoders, their
learning complexity,
their horizontal and vertical composability in deep architectures,
their critical points,
and their fundamental connections to clustering, Hebbian learning, and
information
theory.