Paper

Replicated Softmax: an Undirected Topic Model

We introduce a two-layer undirected graphical model, called a Replicated Softmax, that can be used to model and automatically extract low-dimensional latent semantic representations from a large unstructured collection of documents. We present efficient learning and inference algorithms for this model, and show how a Monte-Carlo based method, Annealed Importance Sampling, can be used to produce an accurate estimate of the log-probability the model assigns to test data. This allows us to demonstrate that the proposed model is able to generalize much better compared to Latent Dirichlet Allocation in terms of both the log-probability of held-out documents and the retrieval accuracy.

Neural Information Processing SystemsPublished 2009-12-07Paper link

Authors: Geoffrey E. Hinton · Ruslan Salakhutdinov

Topics

Relevant entities

People

openalex-author

Geoffrey E. Hinton

Computer Scientist

Related coverage

Linked coverage will appear here.

Related events

Linked events will appear here.

Related discussions

Related discussion nodes will appear here.