e-ISSN:0976-5166
p-ISSN:2231-3850


INDIAN JOURNAL OF COMPUTER SCIENCE AND ENGINEERING

Call for Papers 2020

Jun 2020 - Volume 11, Issue 3
Deadline: 15 May 2020
Due to COVID-19 deadline extended to 31-May-2020
Notification: 15 Jun 2020
Publication: 30 Jun 2020

Aug 2020 - Volume 11, Issue 4
Deadline: 15 Jul 2020
Notification: 15 Aug 2020
Publication: 31 Aug 2020

More

Indexed in

IJCSE Indexed in Scopus

ABSTRACT

Title : WEKA FOR REDUCING HIGH - DIMENSIONAL BIG TEXT DATA
Authors : Kotonko Lumanga Manga Tresor, Professor Xu Dezhi
Keywords : Dimension Reduction; J48; WEKA; MATLAB
Issue Date : Oct-Nov 2018
Abstract :
In the current era, data usually has a high volume, variety, velocity, and veracity, these are known as 4 V’s of Big Data. Social media is considered as one of the main causes of Big Data which get the 4 V’s of Big Data beside that it has high dimensionality. To manipulate Big Data efficiently; its dimensionality should be decreased. Reducing dimensionality converts the data with high dimensionality into an expressive representation of data with lower dimensions. This research work deals with efficient Dimension Reduction processes to reduce the original dimension aimed at improving the speed of data mining. Spam-WEKA dataset; which entails twitter user information. The modified J48 classifier is applied to reduce the dimension of the data thereby increasing the accuracy rate of data mining. The data mining tool WEKA is used as an API of MATLAB to generate the J48 classifiers. Experimental results indicated a significant improvement over the existing J48 algorithm.
Page(s) : 124-129
ISSN : 0976-5166
Source : Vol. 9, No.5
PDF : Download
DOI : 10.21817/indjcse/2018/v9i5/180905016