Combining Information Extraction and Text Segmentation methods in Greek Texts
Abstract
In this paper we examine the benefit of performing named entity recognition (NER) and co-reference resolution to a Greek corpus used for text segmentation. The aim here is to examine whether the combination of text segmentation and information extraction is beneficial for identifying various topics that appear in a document. NER was performed using an already existing tool for the Greek corpus. Produced annotations were manually corrected and enriched to cover four types of named entities. Co-reference resolution was subsequently performed manually. The evaluation, using four text segmentation algorithms leads to the conclusion that, information extraction techniques appear to be a promising solution in capturing semantic information for segmentation purposes.
Full Text:
PDFDOI: https://doi.org/10.5430/air.v7n1p23
Refbacks
- There are currently no refbacks.
Artificial Intelligence Research
ISSN 1927-6974 (Print) ISSN 1927-6982 (Online)
Copyright © Sciedu Press
To make sure that you can receive messages from us, please add the 'Sciedupress.com' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.