QUERY-BASED AMHARIC LEGAL DOCUMENT SUMMARIZATION

Show simple item record

dc.contributor.author Wolde, Sifrayenesh
dc.contributor.author Meshesha, Dr. Million
dc.contributor.author Delessa, MSc Tamrat
dc.date.accessioned 2021-06-29T03:28:31Z
dc.date.available 2021-06-29T03:28:31Z
dc.date.issued 2020-12
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/3791
dc.description 102p. en_US
dc.description.abstract Automatic text summarization is a natural language processing application that proposes the extraction of salient information from a source document to produce a condensed version for a particular user. Due to the increasing number of interested users in searching for Amharic documents, Automatic text summarization becomes a tool to help users through pointing which document is relevant to their need (search) by putting a summary for each particular document in a particular domain. However, most researches have been conducted for Amharic text summarization and almost all are done for generic summarization, but in this study query-oriented summarization have been done to extract query related sentences from the document based on their rank. In this study, two distinct approaches have been used to extract Amharic legal judgment summaries from a single legal document based on the incoming user query. Both TF*IDF weighting and TextRank approaches generate a summary based on the query with different performances results. The performance evaluation has been done objectively and subjectively by comparing manual summary with system generated summary. The objective evaluation result shows that Amharic document TextRank registers the best result of 71.36% F-measure at extraction rate 30%. The subjective evaluation also conducted by the experts to compare the coherence and informativeness of the summarizer. In terms of informativeness and coherency of the summary, 85.87% accuracy is registered at extraction rate of 30%. The experiment shows the study achieves promising result. However, TF*IDF weighting and TextRank approaches employed in this study are incapable of extracting a summary from documents which are too long this needs further investigation. One of the challenges of this study is when the document length increases the noise happen and important part of a summary may be missed. en_US
dc.description.sponsorship Haramaya University en_US
dc.language.iso en en_US
dc.publisher Haramaya university en_US
dc.title QUERY-BASED AMHARIC LEGAL DOCUMENT SUMMARIZATION en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search HU-IR System


Advanced Search

Browse

My Account