Please use this identifier to cite or link to this item:
http://paper.sci.ui.ac.id/jspui/handle/2808.28/196
Title: | Soft Clustering for Optimal Topic Based Document Retrieval |
Authors: | Murfi, Hendri |
Keywords: | Topic Modeling Nonnegative Matrix Factorization Soft Clustering, Document Retrieval |
Issue Date: | Nov-2014 |
Publisher: | AICIT (Advanced Institute of Convergence Information Technology, Republic of Korea) |
Series/Report no.: | Volume 5;Issue 4 |
Abstract: | Topic modeling is a type of the statistical model that has been proven successful for discovering topics from document. The extracted topics have been used as alternative indexes to overcome the drawbacks of the standard word based document retrieval. The purpose of this paper is to examine the use of clustering methods for the topic-based document retrieval. This leads us to propose a soft clustering method that can be used for optimizing the performance of the topic-based document retrieval. We show that by applying the soft clustering method where a document is assigned to several significant clusters, we can improve the performance of hard clustering method while still reducing the number of considered documents to a reasonable size. |
URI: | http://paper.sci.ui.ac.id/jspui/handle/2808.28/196 |
ISSN: | 2093-4009 |
Appears in Collections: | Journal Collection |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Hendri.IJIPM.2014.pdf | 2,71 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.