Efficient Learning for Undirected Topic Models

Efficient Learning for Undirected Topic Models
Prof. Victor Li
October 5, 2022
Research

Replicated Softmax model, a well-known undirected topic model, is powerful in extracting semantic representations of documents. Traditional learning strategies such as Contrastive Divergence are very inefficient. This paper provides a novel estimator to speed up the learning based on Noise Contrastive Estimate, extended for documents of variant lengths and weighted inputs. Experiments on two benchmarks show that the new estimator achieves great learning efficiency and high accuracy on document retrieval and classification.

Efficient Learning for Undirected Topic Models

S.B. (1977), S.M. (1979), E.E. (1980), Sc.D. (1981), EECS, MIT. Chair Professor in Information Engineering, HKU. Fellow of IEEE, HKIE, IAE, and HK Academy of Engineering Sciences. Awardee of the Bronze Bauhinia Star, Government of HKSAR.