Deep Learning Class: Recurrent Neural Networks + AISTATS 2010

Monday, September 22, 2014

Graves, Mohamed and Hinton. Speech recognition with Deep Recurrent Neural Networks. [http://arxiv.org/abs/1303.5778]

Erhan, Courville, Bengio, Vincent. Why does unsupervised pre-training help deep learning?. AISTATS 2010 [PDF]

LerrelSeptember 30, 2014 at 11:12 PM
How exactly is unsupervised pre-training done. I am unable to understand how a unsupervised pre train could possibly help a different supervision task.

From what I deduce from the paper is that the pre training is done only for the first few layers. So my question is what is exactly is the structure of the unsupervised network used in pre training. Is it a sparse auto encoder; that would make sense given that the first few layers anyway encode low level features.
ReplyDelete
Replies
LerrelOctober 7, 2014 at 1:02 PM
Hi,
Could someone guide me to other regularization methods used in deep networks,
ReplyDelete
Replies