Please use this identifier to cite or link to this item: http://repository.ipb.ac.id/handle/123456789/65299
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorAdisantoso, Julio
dc.contributor.authorDewi, Rahmatika
dc.date.accessioned2013-09-09T02:43:26Z
dc.date.available2013-09-09T02:43:26Z
dc.date.issued2013
dc.identifier.urihttp://repository.ipb.ac.id/handle/123456789/65299
dc.description.abstractThe field of document information retrieval has very diverse and rapidly-growing documents thereforethe need for methods to categorize documents effectively and efficiently increases. Categorizing documents can be performed using clustering techniques. This research uses the K-Means technique, one example of a partitioning clustering algorithm. K-Means is a simple algorithm that aims to get the appropriate grouping. Chi-square feature selection and the IDF were used to obtain the termsused as the unique identifiers of the documents. Clustering results with different feature selection techniques were made forcomparison to get the expected results.The accuracy values obtained for the IDF and the chi-square feature selection for data size 150 using rand index are26%, 75%, respectively.The accuracy values obtained for the IDF and the chi-square feature selection for data size 457 using rand index are31%, 37%, respectively. The accuracy values obtained for the IDF and the chi-square feature selection for data size 150 usingpurity measureare 97%, 96%, respectively. The accuracy values obtained for the IDF and the chi-square feature selection for data size 457 using rand index are 93%, 95%, respectively.en
dc.subjectBogor Agricultural University (IPB)en
dc.subjectFeature Selectionen
dc.subjectClusteringen
dc.subjectK-Means,en
dc.titlePemilihan Fitur Dokumen Bahasa Indonesia untuk Pengelompokan dengan Metode K-Meansen
Appears in Collections:UT - Computer Science

Files in This Item:
File SizeFormat 
G13rde.pdf
  Restricted Access
553.25 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.