Feature-based approach and sequential pattern mining to enhance quality of Indonesian automatic text summarization

Maylawati, Dian Sa'adillah and Kumar, Yogan Jaya and Kasmin, Fauziah Binti (2022) Feature-based approach and sequential pattern mining to enhance quality of Indonesian automatic text summarization. Indonesian Journal of Electrical Engineering and Computer Science, 30 (3). pp. 1795-1804. ISSN 2502-4760

[img]
Preview
Text (Artikel)
30114-63211-1-PB.pdf

Download (457kB) | Preview
[img]
Preview
Text (Turnitin)
IJEECS Turnitin Final Camera Ready.pdf

Download (2MB) | Preview
[img]
Preview
Text (Korespondensi)
Korespondensi IJEECS.pdf

Download (1MB) | Preview
Official URL: https://ijeecs.iaescore.com/index.php/IJEECS/artic...

Abstract

Indonesian automatic text summarization research is developed rapidly. The quality, especially readability aspect, of text summary can be reached if the meaning of the text can be maintained properly. Therefore, this research aims to enhance the quality of extractive Indonesian automatic text summarization with considering the quality of structured representation of text. This research uses sequential pattern mining (SPM) to produce This research use SPM to produce sequence of words (SoW) as structured text representation using PrefixSpan algorithm. Then, SPM is combined with feature-based approach using sentence scoring method to produce summary. The experiment result using IndoSum dataset shows that even though the combination of SPM and sentence scoring can increase the precision value of recall-oriented understudy for gisting evaluation (ROUGE)-1, ROUGE-2, and ROUGE-L, from 0.68 to 0.76, 0.54 to 0.69, and 0.51 to 0.72. Especially, combination of SPM and Sentence Scoring can enhance precision, recall, and f-measure of ROUGE-L that consider the order of word occurance in measurement. SPM increases ROUGE-L f-measure value of sentence scoring from 0.32 to 0.36. Moreover, combination of sentence scoring and SPM is better than SumBasic that used as feature-based approach in the previous Indonesian text summarization research.

Item Type: Article
Subjects: Technology, Applied Sciences
Divisions: Fakultas Sains dan Teknologi > Program Studi Teknik Informatika
Depositing User: Dian Sa'adillah Maylawati
Date Deposited: 04 Apr 2023 02:06
Last Modified: 04 Apr 2023 02:06
URI: https://digilib.uinsgd.ac.id/id/eprint/66648

Actions (login required)

View Item View Item