Please use this identifier to cite or link to this item: http://repository.ipb.ac.id/handle/123456789/55944
Title: Temporal Question Answering System Bahasa Indonesia
Authors: Ridha, Ahmad
Darliansyah, Adi
Keywords: Bogor Agricultural University (IPB)
temporal question
temporal expression
question answering
bahasa Indonesia
Issue Date: 2012
Abstract: Time is an important dimension in information retrieval. Temporal expressions describe time information embedded in the documents. Therefore, extraction and normalization of temporal expressions from documents are crucial. In this research, a question answering system is implemented for temporal information processing from documents in Indonesian language based on four types of temporal question beginning with question words such as siapa (what), kapan (when), di mana (where), and berapa (how many). Implicit time references in document are first normalized and tagged manually into explicit time references. Complex temporal question is divided into simpler questions by using temporal signal detection for specific sequence of events. In order to obtain answer candidates, heuristic weighting is performed on the top passages. Answer extraction is performed using the smallest distance between query and answer candidates. A corpus containing 100 documents and 80 queries is used in this research. Answer evaluation is based on three criteria, namely, Right, Wrong, and Unsupported. The questions are used to evaluate the results of BM25 and Proximity ranking modes. The evaluation for simple temporal questions (Type 1 and 2) using BM25 and Proximity gave the same results at 85% Right answers for Type 1 and 75% for Type 2. The results for complex temporal questions (Type 3 and 4) indicated good performance. The best results were obtained by BM25 at 95% Right answers for Type 3 and 75% for Type 4, while using Proximity resulted in 85% Right answers for Type 3 and 80% for Type 4. We also used our corpus on a nontemporal question answering system by Umriadi in 2011. The results are 60%, 55%, 60%, and 40% Right answers for Type 1, 2, 3, and 4, respectively, much lower than our temporal question answering system. Therefore, temporal expression extraction and temporal signal identification are particularly important for handling questions containing temporal information. Our system is able to identify and answer the temporal questions in Indonesian language.
URI: http://repository.ipb.ac.id/handle/123456789/55944
Appears in Collections:UT - Computer Science

Files in This Item:
File Description SizeFormat 
G12ada.pdf
  Restricted Access
Full text1.33 MBAdobe PDFView/Open
G12ada_Abstrak.pdf
  Restricted Access
Abstrak292.5 kBAdobe PDFView/Open
G12ada_BAB I Pendahuluan.pdf
  Restricted Access
BAB I373.38 kBAdobe PDFView/Open
G12ada_BAB II Metode Penelitian.pdf
  Restricted Access
BAB II881.86 kBAdobe PDFView/Open
G12ada_BAB III Hasil dan Pembahasan.pdf
  Restricted Access
BAB III845.07 kBAdobe PDFView/Open
G12ada_BAB IV Simpulan dan Saran.pdf
  Restricted Access
BAB IV465.84 kBAdobe PDFView/Open
G12ada_Cover.pdf
  Restricted Access
Cover368.78 kBAdobe PDFView/Open
G12ada_Daftar Pustaka.pdf
  Restricted Access
daftar pustaka355.9 kBAdobe PDFView/Open
G12ada_Lampiran.pdf
  Restricted Access
Lampiran686.34 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.