Top-level heading

Sparse dictionary learning and deep learning in practice and theory

Categoria
Seminari di Probabilità
Data e ora inizio evento
Data e ora fine evento
Aula
Sala di Consiglio
Sede

Dipartimento di Matematica, Sapienza Università di Roma

Speaker
Bin Yu (Statistics, EECS, Center for Computational Biology, and Simons Institute UC Berkeley)
Sparse dictionary learning has a long history and produces wavelet-like filters when fed with natural image patches, corresponding to the V1 primary visual cortex of the human brain. Wavelets as local Fourier Transforms are interpretable in physical sciences and beyond. In this talk, we will first describe adaptive wavelet distillation (AWD) to turn black-box deep learning models interpretable in cosmology and cellular biology problems while improving predictive performance. Then we present theoretical results that, under simple sparse dictionary models, gradient descent in auto-encoder fitting converges to one point on a manifold of global minima, and which minimum depends on the batch size. In particular, we show that when using a small batch-size as in stochastic gradient descent (SGD) a qualitatively different type of âfeature selectionâ occurs.