Utilizing Modified Viterbi Algorithm for Religious Text: A Cebuano Part-of-Speech Tagging

2024 International Conference on IT Innovation and Knowledge Discovery (ITIKD)
(2025), pp. 1-6
Jeneffer A. Sabonsolin
a
,
Shaneth C. Ambat
a
,
Angelo C. Arguson
a
,
Fanny C. Almeniana
a
,
Elisa V. Malasaga
a
,
Hadji J. Tejuco
a
a Computer Science Department, FEU Institute of Technology, Metro Manila
Abstract: Part of speech tagging (POS) is crucial in natural language processing, identifying the grammatical categories of words in sentences. This research highlights the lack of focus on POS tagging for Asian languages, particularly Cebuano. Inadequate research on Cebuano religious text has hindered linguistic documentation and understanding its grammar and vocabulary. This study introduces a Parts-of-Speech Tagging for Cebuano utilizing a Modified Viterbi Algorithm. The researchers also apply a method for handling unfamiliar words. Results indicate that the algorithm performs exceptionally well on a religious text corpus comprising 50,000 datasets, achieving an accuracy of93%,precision of90%, recall of 90. 52%, and an F1-score of92%. These results highlight the algorithm's effectiveness in tackling language challenges within specific genres. Furthermore, the research supports the Sustainable Development Goals (SDGs) by promoting linguistic diversity and advancing inclusive language technologies. The study also provides valuable insights into Cebuano's linguistic characteristics and grammatical structures, laying a solid foundation for future research in natural language processing.