Kajian Metode Penggerombolan Dua Tahap untuk Data yang Mengandung Pencilan

Nurwida, Arni

Please use this identifier to cite or link to this item: http://repository.ipb.ac.id/handle/123456789/69118

Title:	Kajian Metode Penggerombolan Dua Tahap untuk Data yang Mengandung Pencilan
Authors:	Sadik, Kusman Indahwati Nurwida, Arni
Issue Date:	2014
Abstract:	Cluster analysis is often encountered in various studies. Analysis of classical clusters, such as hierarchical clustering method and k-means clustering cannot handle categorical variables or a mixture of numerical and categorical. In addition, the determination of the optimal number of clusters are still dependent on the subjectivity of the researcher and cannot handle very large datasets, which is larger than 500. One approach to addressing this problem is to use a two-step clustering method. The accuracy of the two-step clustering method of predicting the number of clusters generated as well as the classification of cluster membership, especially in the data containing outliers is important to be studied. Outliers in the data containing a small (1%), this method provides more accurate compared with the results of data containing a large outliers (5% or 15%). Scale use of outliers handling in the data containing outliers must be greater than the amount of outliers itself. Two-step clustering method is very accurate in producing a number of clusters associated with the actual number of population clusters that do not contain data outliers, especially in the most variable of type numeric and categorical rest. Clustering villages in Indonesia by a factor of progress and backwardness villages using a two-step clustering method generates optimal cluster 7.
URI:	http://repository.ipb.ac.id/handle/123456789/69118
Appears in Collections:	UT - Statistics and Data Sciences

Files in This Item:

File	Description	Size	Format
G14anu.pdf Restricted Access	full text	1.68 MB	Adobe PDF	View/Open

Show full item record Recommend this item

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets