Text segmentation techniques: A critical review

Pak, Irina * and Teh, Phoey Lee * (2017) Text segmentation techniques: A critical review. In: Innovative, Computing, Optimization and its Applictions. Studies in Computational Intelligence Book Series (741). Springer, Cham, pp. 167-181. ISBN 978-3-319-66983-0

Teh Phoey Lee Text Segmentation Techniques A Critical Review edited.pdf

Download (3MB) | Preview
Official URL: https://link.springer.com/chapter/10.1007/978-3-31...


Text segmentation is widely used for processing text. It is a method of splitting a document into smaller parts, which is usually called segments. Each segment has its relevant meaning. Those segments categorized as word, sentence, topic, phrase or any information unit depending on the task of the text analysis. This study presents various reasons of usage of text segmentation for different analyzing approaches. We categorized the types of documents and languages used. The main contribution of this study includes a summarization of 50 research papers and an illustration of past decade (January 2007- January 2017)’s of research that applied text segmentation as their main approach for analysing text. Results revealed the popularity of using text segmentation in different languages. Besides that, the “word” seems to be the most practical and usable segment, as it is the smaller unit than the phrase, sentence or line.

Item Type: Book Section
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: Sunway University > School of Engineering and Technology [formerly School of Science and Technology until 2020] > Dept. Computing and Information Systems
Depositing User: Dr Janaki Sinnasamy
Related URLs:
Date Deposited: 18 Jun 2018 03:08
Last Modified: 30 Apr 2019 08:15
URI: http://eprints.sunway.edu.my/id/eprint/840

Actions (login required)

View Item View Item