Analyzing PEGASUS Model Performance with ROUGE on Indonesian News Summarization

Kartamanah, Fatih Fauzan (2025) Analyzing PEGASUS Model Performance with ROUGE on Indonesian News Summarization. Analyzing PEGASUS Model Performance with ROUGE on Indonesian News Summarization, 9 (1). pp. 31-42. ISSN 2541-2019

[img]
Preview
Text
14303-Article Text-20847-1-10-20250108.pdf

Download (478kB) | Preview
Official URL: https://jurnal.polgan.ac.id/index.php/sinkron/arti...

Abstract

Text summarization technology has been rapidly advancing, playing a vital role in improving information accessibility and reducing reading time within Natural Language Processing (NLP) research. There are two primary approaches to text summarization: extractive and abstractive. Extractive methods focus on selecting key sentences or phrases directly from the source text, while abstractive summarization generates new sentences that capture the essence of the content. Abstractive summarization, although more flexible, poses greater challenges in maintaining coherence and contextual relevance due to its complexity. This study aims to enhance automated abstractive summarization for Indonesian-language online news articles by employing the PEGASUS (Pre-training with Extracted Gap-sentences Sequences for Abstractive Summarization) model, which leverages an encoder-decoder architecture optimized for summarization tasks. The dataset utilized consists of 193,883 articles from Liputan6, a prominent Indonesian news platform. The model was fine-tuned and evaluated using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metric, focusing on F-1 scores for ROUGE-1, ROUGE-2, and ROUGE-L. The results demonstrated the model's ability to generate coherent and informative summaries, achieving ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.439, 0.183, and 0.406, respectively. These findings underscore the potential of the PEGASUS model in addressing the challenges of abstractive summarization for low-resource languages like Indonesian language, offering a significant contribution to summarization quality for online news content.

Item Type: Article
Subjects: Special Computer Methods
Special Computer Methods > Artificial Intelligence
Special Computer Methods > Blogs
Mathematics
Mathematics > Data Processing and Analysis of Mathematics
Mathematics > Research Methods of Mathematics
Technology, Applied Sciences
Engineering
Engineering > Engineers
Engineering > Other Engineering Materials
Divisions: Fakultas Sains dan Teknologi > Program Studi Teknik Informatika
Depositing User: Fatih Fauzan Kartamanah
Date Deposited: 11 Feb 2025 08:38
Last Modified: 11 Feb 2025 08:38
URI: https://digilib.uinsgd.ac.id/id/eprint/104259

Actions (login required)

View Item View Item