Text segmentation for analysing different languages

Pak, Irina * and Teh, Phoey Lee * (2016) Text segmentation for analysing different languages. In: COMSE 2016 - First EAI International Conference on Computer Science and Engineering, 11-12 November 2016, Penang, Malaysia.

Teh Phoey Lee conf.pdf

Download (506kB) | Preview
Official URL: http://compse-conf.org/2016/show/home


Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Organised by European Alliance Innovation
Uncontrolled Keywords: text segmentation; text analysis; text processing; languages; online reviews; opinion mining
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: Sunway University > School of Engineering and Technology [formerly School of Science and Technology until 2020] > Dept. Computing and Information Systems
Depositing User: Dr Janaki Sinnasamy
Related URLs:
Date Deposited: 18 Jun 2018 03:21
Last Modified: 23 Jul 2019 01:41
URI: http://eprints.sunway.edu.my/id/eprint/841

Actions (login required)

View Item View Item