Incorporating Semantic Metadata into Nonnegative Matrix Factorization Based Topic Modeling and its Application to Main Topic Extraction

Please use this identifier to cite or link to this item: http://paper.sci.ui.ac.id/jspui/handle/2808.28/195

Title:	Incorporating Semantic Metadata into Nonnegative Matrix Factorization Based Topic Modeling and its Application to Main Topic Extraction
Authors:	Murfi, Hendri
Keywords:	Topic Modeling Nonnegative Matrix Factorization Incorporating Metadata Nonnegative Least Squares Main Topic Extraction
Issue Date:	Sep-2014
Publisher:	AICIT (Advanced Institute of Convergence Information Technology, Republic of Korea)
Series/Report no.:	Volume 5;Issue 2
Abstract:	Topic modeling is a type of a statistical model that has been proven successful for tasks including discovering topics and their trends over time. In many applications, documents may be accompanied by metadata that is manually created by their authors to describe the semantic content of documents, e.g. titles and tags. A proper way of incorporating this metadata into topic modeling should improve its performance. In this paper, we examine the use of semantic metadata for nonnegative matrix factorization based topic modeling and its application to main topic extraction. Besides a simple onelevel learning hierarchy, we adapt a two-level learning hierarchy method for this task. Our experiments show that these methods improve interpretability or coherence scores of extracted topics. Moreover, the two-level learning hierarchy approach can achieve higher interpretability scores than the one-level learning hierarchy method.
URI:	http://paper.sci.ui.ac.id/jspui/handle/2808.28/195
ISSN:	2093-1964
Appears in Collections:	Journal Collection

Files in This Item:

File	Description	Size	Format
Hendri.IJIIP.2014.pdf		1,38 MB	Adobe PDF	View/Open Request a copy

DSpace JSPUI