Combining Information Extraction and Text Segmentation methods in Greek Texts

Pavlina Fragkou

Abstract


In this paper we examine the benefit of performing named entity recognition (NER) and co-reference resolution to a Greek corpus used for text segmentation. The aim here is to examine whether the combination of text segmentation and information extraction is beneficial for identifying various topics that appear in a document. NER was performed using an already existing tool for the Greek corpus. Produced annotations were manually corrected and enriched to cover four types of named entities. Co-reference resolution was subsequently performed manually. The evaluation, using four text segmentation algorithms leads to the conclusion that, information extraction techniques appear to be a promising solution in capturing semantic information for segmentation purposes.


Full Text:

PDF


DOI: https://doi.org/10.5430/air.v7n1p23

Refbacks

  • There are currently no refbacks.


Artificial Intelligence Research

ISSN 1927-6974 (Print)   ISSN 1927-6982 (Online)

Copyright © Sciedu Press 
To make sure that you can receive messages from us, please add the 'Sciedupress.com' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.