Please use this identifier to cite or link to this item: http://repository.ipb.ac.id/handle/123456789/61946
Title: Weighting in Indexing Process for Document in Bahasa Indonesia Using Indri Framework
Pembobotan dalam Proses Pengindeksan Dokumen Bahasa Indonesia dengan Menggunakan Framework Indri
Authors: Adisantoso, Julio
Herdi, Hendrex
Keywords: index
information search engine
tf-idf
BM25
Issue Date: 2012
Abstract: A very large amount of information has stimulated the development of information search engine to help users in finding information they need. To retrieve the information according to the user’s needs, information search engine should be able to work well. One of the factors that can affect the performance of search engines is indexing. The purpose of this research is to implement automatic indexing process using Indri framework with tf-idf and BM25 term weighting. This testing used 30 queries and 2000 documents. The testing result showed that the performance of information search engine is better when we use the BM25 term weighting than tf-idf term weighting. However, the performance of information search engine with BM25 term weighting and tf-idf term weighting gave good results with around 64% average precision. The number of indexed documents for indexing will affect the indexing time. Increasing of the number of indexed documents will increase the indexing time.
URI: http://repository.ipb.ac.id/handle/123456789/61946
Appears in Collections:UT - Computer Science

Files in This Item:
File SizeFormat 
G10hhe.pdf
  Restricted Access
600.83 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.