Please use this identifier to cite or link to this item:
http://paper.sci.ui.ac.id/jspui/handle/2808.28/195
Title: | Incorporating Semantic Metadata into Nonnegative Matrix Factorization Based Topic Modeling and its Application to Main Topic Extraction |
Authors: | Murfi, Hendri |
Keywords: | Topic Modeling Nonnegative Matrix Factorization Incorporating Metadata Nonnegative Least Squares Main Topic Extraction |
Issue Date: | Sep-2014 |
Publisher: | AICIT (Advanced Institute of Convergence Information Technology, Republic of Korea) |
Series/Report no.: | Volume 5;Issue 2 |
Abstract: | Topic modeling is a type of a statistical model that has been proven successful for tasks including discovering topics and their trends over time. In many applications, documents may be accompanied by metadata that is manually created by their authors to describe the semantic content of documents, e.g. titles and tags. A proper way of incorporating this metadata into topic modeling should improve its performance. In this paper, we examine the use of semantic metadata for nonnegative matrix factorization based topic modeling and its application to main topic extraction. Besides a simple onelevel learning hierarchy, we adapt a two-level learning hierarchy method for this task. Our experiments show that these methods improve interpretability or coherence scores of extracted topics. Moreover, the two-level learning hierarchy approach can achieve higher interpretability scores than the one-level learning hierarchy method. |
URI: | http://paper.sci.ui.ac.id/jspui/handle/2808.28/195 |
ISSN: | 2093-1964 |
Appears in Collections: | Journal Collection |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Hendri.IJIIP.2014.pdf | 1,38 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.