Please use this identifier to cite or link to this item: http://paper.sci.ui.ac.id/jspui/handle/2808.28/196
Title: Soft Clustering for Optimal Topic Based Document Retrieval
Authors: Murfi, Hendri
Keywords: Topic Modeling
Nonnegative Matrix Factorization
Soft Clustering, Document Retrieval
Issue Date: Nov-2014
Publisher: AICIT (Advanced Institute of Convergence Information Technology, Republic of Korea)
Series/Report no.: Volume 5;Issue 4
Abstract: Topic modeling is a type of the statistical model that has been proven successful for discovering topics from document. The extracted topics have been used as alternative indexes to overcome the drawbacks of the standard word based document retrieval. The purpose of this paper is to examine the use of clustering methods for the topic-based document retrieval. This leads us to propose a soft clustering method that can be used for optimizing the performance of the topic-based document retrieval. We show that by applying the soft clustering method where a document is assigned to several significant clusters, we can improve the performance of hard clustering method while still reducing the number of considered documents to a reasonable size.
URI: http://paper.sci.ui.ac.id/jspui/handle/2808.28/196
ISSN: 2093-4009
Appears in Collections:Journal Collection

Files in This Item:
File Description SizeFormat 
Hendri.IJIPM.2014.pdf2,71 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.