A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre

TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON), (2024), pp. 940-943

Jeneffer A. Sabonsolin ^a , Shaneth C. Ambat ^a , Ace C. Lagman ^a

^a Computer Science Department, FEU Institute of Technology

Abstract: Part of speech tagging (POS) is crucial in natural language processing, identifying the grammatical categories of words in sentences. This research highlights the lack of focus on POS tagging for Asian languages, particularly Cebuano. Limited research on Cebuano has hindered linguistic documentation and understanding of its grammar and vocabulary. This study introduces a Cebuano POS tagger using the Hidden Markov Model (HMM) to improve Cebuano text processing. The researchers also propose a method for handling unfamiliar words. Results show the algorithm performs well on a news text corpus of 25,000 datasets, with an accuracy of 84 %, precision of 80%, recall of 81.52%, and F1-score of 82%. These outcomes demonstrate the algorithm's effectiveness in addressing language challenges in specific genres. Additionally, the research contributes to the Sustainable Development Goals (SDGs) by promoting linguistic diversity and fostering inclusive language technologies. The study provides insights into Cebuano's linguistic traits and grammatical structures, offering a foundation for further research in natural language processing.

Recommended Citation

Sabonsolin, J. A., Ambat, S. C., & Lagman, A. C. (2024). A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre. TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON), 940-943. https://doi.org/10.1109/TENCON61640.2024.10902693

J. A. Sabonsolin, S. C. Ambat, and A. C. Lagman, "A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre," TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON), pp. 940-943, 2024. doi: 10.1109/TENCON61640.2024.10902693.

Sabonsolin, Jeneffer A., et al.. "A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre." TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON), 2024, pp. 940-943. https://doi.org/10.1109/TENCON61640.2024.10902693.

Sabonsolin, J. A., Ambat, S. C., & Lagman, A. C.. 2024. "A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre." TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON): 940-943. https://doi.org/10.1109/TENCON61640.2024.10902693.

Previous Research

Social Relationship Development in the Metaverse: The Roles of Embodiment, Immersion, and the Moderating Effect of Copresence

Next Research

FEU Institute of Technology

Educational Innovation and Technology Hub

A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre

Recommended Citation

Social Relationship Development in the Metaverse: The Roles of Embodiment, Immersion, and the Moderating Effect of Copresence

Predicting the Factors to Artificial Intelligence in Peer-to-Peer Energy Sharing Service Adoption Intention: A Structural Equation Model Assessment

FEU Institute of Technology

Educational Innovation and Technology Hub

By Type

Last 5 Years

By Campus

A Cebuano Parts-of-Speech(POS) Tagger Using Hidden Markov Model(HMM) Applied to News Text Genre

Recommended Citation

Social Relationship Development in the Metaverse: The Roles of Embodiment, Immersion, and the Moderating Effect of Copresence

Predicting the Factors to Artificial Intelligence in Peer-to-Peer Energy Sharing Service Adoption Intention: A Structural Equation Model Assessment