Modified Viterbi Algorithm for Religious Text: A Part-of-Speech Tagging for Waray-Waray
TENCON 2025 - 2025 IEEE Region 10 Conference (TENCON), (2026), pp. 953-957
Jeneffer A. Sabonsolin
a
,
Robert R. Roxas
b
,
Ace C. Lagman
a
a FEU Institute of Technology, Manila, Philippines
b University of Philippines Cebu, Cebu City, Philippines
Abstract: Part-of-speech tagging (POS) is a vital process in natural language processing, enabling the identification of grammatical categories within sentences. This research emphasizes the lack of attention given to POS tagging for Asian languages, particularly Waray-waray. Limited studies on Waraywaray religious texts have hindered linguistic documentation and the deeper understanding of its grammar and vocabulary. To address this gap, the study introduces a POS tagging system for Waray-waray utilizing a Modified Viterbi Algorithm, which also incorporates a strategy for handling unfamiliar words. Evaluated on a corpus of 50,000 religious text datasets, the algorithm demonstrates outstanding performance-achieving an accuracy of 93%, precision of 90%, recall of 90.52%, and an F 1 score of 92%. These results underscore the algorithm's effectiveness in navigating linguistic challenges across specialized genres. Beyond technical contributions, the study promotes linguistic diversity and fosters inclusive language technologies, advancing the goals of the Sustainable Development Goals (SDGs). Specifically, it enhances language learning and literacy among Waray-waray speakers, supports inclusive education through computational tools for minority languages, and aligns with SDG 4 by providing foundational resources for mother-tongue instruction and educational content development. Additionally, it offers new insights into Waray-waray's grammatical structures, laying a robust groundwork for future linguistic and computational research. Beyond technical contributions, the study promotes linguistic diversity and fosters inclusive language technologies, advancing the goals of the Sustainable Development Goals (SDGs). Specifically, it enhances language learning and literacy among Waray-waray speakers, supports inclusive education through computational tools for minority languages, and aligns with SDG 4 by providing foundational resources for mother-tongue instruction and educational content development. Additionally, it offers new insights into Waray-waray's grammatical structures, laying a robust groundwork for future linguistic and computational research.