Paper
Quick Training of Probabilistic Neural Nets by Importance Sampling
Our previous work on statistical language modeling introduced the use of probabilistic feedforward neural networks to help dealing with the curse of dimensionality. Training this model by maximum likelihood however requires for each example to perform as many network passes as there are words in the vocabulary. Inspired by the contrastive divergence model, we propose and evaluate sampling-based methods which require network passes only for the observed "positive example" and a few sampled negative example words. A very significant speed-up is obtained with an adaptive importance sampling.
http://research.microsoft.com/conferences/AIStats2003/proceedings/164.psPublished 2003-01-03Paper link
Authors: Yoshua Bengio · Jean-Sébastien Senécal
Topics
Relevant entities
People
Related coverage
Linked coverage will appear here.
Related events
Linked events will appear here.
Related discussions
Related discussion nodes will appear here.