Pak, Irina * and Teh, Phoey Lee * (2016) Text segmentation for analysing different languages. In: COMSE 2016 - First EAI International Conference on Computer Science and Engineering, 11-12 November 2016, Penang, Malaysia.
|
Text
Teh Phoey Lee conf.pdf Download (506kB) | Preview |
Abstract
Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Organised by European Alliance Innovation |
Uncontrolled Keywords: | text segmentation; text analysis; text processing; languages; online reviews; opinion mining |
Subjects: | Q Science > QA Mathematics > QA76 Computer software |
Divisions: | Sunway University > School of Engineering and Technology [formerly School of Science and Technology until 2020] > Dept. Computing and Information Systems |
Depositing User: | Dr Janaki Sinnasamy |
Related URLs: | |
Date Deposited: | 18 Jun 2018 03:21 |
Last Modified: | 23 Jul 2019 01:41 |
URI: | http://eprints.sunway.edu.my/id/eprint/841 |
Actions (login required)
View Item |