View Item 
      •   IPB Repository
      • Dissertations and Theses
      • Undergraduate Theses
      • UT - Faculty of Mathematics and Natural Sciences
      • UT - Computer Science
      • View Item
      •   IPB Repository
      • Dissertations and Theses
      • Undergraduate Theses
      • UT - Faculty of Mathematics and Natural Sciences
      • UT - Computer Science
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Pseudo-Relevance Feedback on Retrieval Using Document Segmentation.

      Thumbnail
      View/Open
      G09eda.ppt (3.251Mb)
      Abstract (279.9Kb)
      Postscript (407.0Kb)
      Cover (304.9Kb)
      Full Text (1.227Mb)
      Hasil dan Pembahasan (470.8Kb)
      Kesimpulan (285.0Kb)
      Lampiran (1.043Mb)
      Pendahuluan (615.5Kb)
      Date
      2009
      Author
      Anbiana, Elenur Dwi
      Metadata
      Show full item record
      Abstract
      Web is the largest information source in the world, but the storage and effective information retrieval on the web is still a problem in this day. Pseudo-relevance feedback is automatically local analysis technique (relevance feedback technique without explicit input user). This technique takes the top n-ranked documents as the relevant documents and takes the top x-ranked terms from relevant documents. Basically, a document consist of any topics, so in the research, relevant documents substituted by a segment which represents a topic in the relevant document. The segment is taken by XML document tag, are <TITLE>, <AUTHOR> and <P> tag, since the text from those tags are usually represent a document. The visual block extraction phase (first step in VIsion based Page Segmentation algorithm), used in segmented the document, so the title of the research is Pseudo-Relevance Feedback on Retrieval Using Document Segmentation. This research done in six phases, there are retrieval initialization, document segmentation, segments selection, terms selection, final retrieval and retrieval output evaluation. The result of system performance is good, that is 0.5214. The test results show that the performance of the PRF based segmentation of documents and without PRF retrieval was not found significant differences. It is because of the taken documents and segments for expansion terms selection are not relevant, expanded queries are not exact representing the segment, the member of documents that are used in retrieval is relatively small, that are 1000 documents. Keywords: pseudo-relevance feedback, relevance feedback, page segmentation, text processing.
      URI
      http://repository.ipb.ac.id/handle/123456789/12603
      Collections
      • UT - Computer Science [2482]

      Copyright © 2020 Library of IPB University
      All rights reserved
      Contact Us | Send Feedback
      Indonesia DSpace Group 
      IPB University Scientific Repository
      UIN Syarif Hidayatullah Institutional Repository
      Universitas Jember Digital Repository
        

       

      Browse

      All of IPB RepositoryCollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

      My Account

      Login

      Application

      google store

      Copyright © 2020 Library of IPB University
      All rights reserved
      Contact Us | Send Feedback
      Indonesia DSpace Group 
      IPB University Scientific Repository
      UIN Syarif Hidayatullah Institutional Repository
      Universitas Jember Digital Repository