View Item 
      •   IPB Repository
      • Dissertations and Theses
      • Undergraduate Theses
      • UT - Faculty of Mathematics and Natural Sciences
      • UT - Statistics and Data Sciences
      • View Item
      •   IPB Repository
      • Dissertations and Theses
      • Undergraduate Theses
      • UT - Faculty of Mathematics and Natural Sciences
      • UT - Statistics and Data Sciences
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Analisi Kemiripan Teks Terhadap Goal Standard Menggunakan Algoritme Levenshtein Distance dan Cosine Similarity (Studi Kasus : Data Hasil Survei Perusahaan X)

      Thumbnail
      View/Open
      Cover (410.0Kb)
      Fullteks (1.587Mb)
      Date
      2022
      Author
      Azzahra, Aulia
      Kurnia, Anang
      Rizki, Akbar
      Metadata
      Show full item record
      Abstract
      Text mining merupakan teknik dalam pengolahan data teks untuk mengekstrak informasi tertentu. Salah satu contoh text mining adalah mengukur nilai kemiripan antara dua string. Algoritme Levenshtein Distance dapat digunakan untuk mengukur kemiripan teks dengan menghitung jarak perbedaan antara dua string. Algoritme lain yang digunakan adalah algoritme Cosine Similarity. Algoritme Cosine Similarity melakukan pengukuran dengan membandingkan jarak antara dua string yang dinyatakan dalam dua buah vektor. Penerapan algoritme ini juga dilakukan dengan berbagai teknik text pre-processing yang dapat memberikan hasil yang lebih maksimal. Kedua algoritme tersebut digunakan dalam penelitian ini untuk mengukur kemiripan teks antara dua dataset. Data yang digunakan yaitu, data hasil survei dengan master data pada database Perusahaan X. Data hasil survei berjumlah 24.659 respon sedangkan pada database Perusahaan X berjumlah 725 daftar nama program kelas online. Objek yang diukur kemiripannya ialah nama program kelas online pada data hasil survei dengan database Perusahaan X. Nilai kemiripan yang dihasilkan digunakan untuk mengidentifikasi daftar nama program kelas online yang diikuti peserta pada hasil survei. Hasil identifikasi dari 24.659 respon hasil survei diperoleh 322 daftar nama program kelas online dengan algoritme Levenshtein Distance dan 327 daftar nama program kelas online dengan algoritme Cosine Similarity.
       
      Text mining is a technique in processing text data to extract certain information. One example of text mining is measuring the similarity value between two strings. Levenshtein Distance algorithm can be used to measure the similarity of text by calculating the distance difference between two strings. Another algorithm used is the Cosine Similarity algorithm. The Cosine Similarity algorithm performs measurements by comparing the distance between two strings expressed in two vectors. The application of this algorithm is also carried out with various text pre-processing techniques that can provide maximum results. The two algorithms are used in this study to measure the similarity of the text between the two datasets. The data used are survey results data with master data in Company X database. Data from survey results totaled 24,659 responses, while in Company X database there were 725 lists of online class program names. The object whose similarity is measured is the name of the online class program in the survey data with the Company X database. The similarity value generated is used to identify the list of names for the online class program that participants participated in the survey results. The identification results from 24,659 responses to the survey results obtained 322 lists of names for online class programs with the Levenshtein Distance algorithm and 327 lists of names for online class programs with the Cosine Similarity algorithm.
       
      URI
      http://repository.ipb.ac.id/handle/123456789/114650
      Collections
      • UT - Statistics and Data Sciences [2260]

      Copyright © 2020 Library of IPB University
      All rights reserved
      Contact Us | Send Feedback
      Indonesia DSpace Group 
      IPB University Scientific Repository
      UIN Syarif Hidayatullah Institutional Repository
      Universitas Jember Digital Repository
        

       

      Browse

      All of IPB RepositoryCollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

      My Account

      Login

      Application

      google store

      Copyright © 2020 Library of IPB University
      All rights reserved
      Contact Us | Send Feedback
      Indonesia DSpace Group 
      IPB University Scientific Repository
      UIN Syarif Hidayatullah Institutional Repository
      Universitas Jember Digital Repository