Automatic Question Generation from Afaan Oromo Text UsingDeepLearning

Show simple item record

dc.contributor.author Jemal Abdela
dc.contributor.author Million Meshesha (PhD)
dc.contributor.author Jemal Abate (MSc)
dc.date.accessioned 2026-05-25T06:21:17Z
dc.date.available 2026-05-25T06:21:17Z
dc.date.issued 2025-03
dc.identifier.uri http://ir.haramaya.edu.et//hru/handle/123456789/8464
dc.description 138p. en_US
dc.description.abstract In the modern digital era, the availability of electronic content in multiple languages has grownconsiderably. Nevertheless, generating questions from these materials remains a labor-intensiveand time-consuming process, particularly for low-resource languages like Afaan Oromo. Whilesignificant progress has been made in Automatic Question Generation (AQG) for languages suchas English, Chinese, Amharic and Somali. there is a notable lack of technology with NLPonAfaan Oromo QG because Afaan Oromo has the unique linguistic characteristics. This studybridges this gap by developing an AQG system for Afaan Oromo using deep learning models, specifically LSTM, Bi-LSTM, and GRU. To build the model a dataset of 5,000 paragraph-question-answer triples was meticulouslycurated from Afaan Oromo textbooks and educational resources. The dataset underwent preprocessing steps such as tokenization, normalization, and word embedding using Word2Vec. A deep learning model with an attention mechanism was employed to generate questions bydesign science research methodology, Among the models evaluated, the Bi-LSTMdemonstratedthe highest performance, achieving a training accuracy of 95.3% and a validation accuracyof 92.5%. The LSTM model also performed well, with a training accuracy of 92.31%and a validation accuracy of 91.02%, while the GRU model showed marked improvement after hyperparameter tuning, reaching 88.0% training accuracy and 87.0% validation accuracy The results indicate that the Bi-LSTM model is the most effective for generating both factoidandnon-factoid questions in Afaan Oromo. Future research should explore transfer learning frompre-trained models, expand the dataset through collaborations with educational institutions, andintegrate advanced neural architectures like Transformers to further enhance performance andquestion quality. This study makes a significant contribution to the field of Natural LanguageProcessing (NLP) by pioneering AQG for Afaan Oromo, use as input for QA system, providingafoundation for future research and practical applications in education and language preservation. en_US
dc.description.sponsorship Haramaya University en_US
dc.language.iso en en_US
dc.publisher Haramaya University en_US
dc.subject Afaan Oromo, Question Generation, Deep Learning, BI-LSTM en_US
dc.title Automatic Question Generation from Afaan Oromo Text UsingDeepLearning en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search HU-IR System


Advanced Search

Browse

My Account