SYSTEM FOR SENTIMENT ANALYSIS OF BIG TEXT DATA

  • 1 Department of Computer Science – University of Chemical Technology and Metallurgy, Bulgaria

Abstract

The importance of Big Data and Big Data Mining is growing significantly in recent years. Different kind of e-sources as social networks, e-commerce sites, e-mails, sensors, etc. are generating large amount of structured and unstructured numerical and text data. This data provides valuable information about costumer’s preferences or ratings of products or commodities. This information is essential for making predictions on the base of the sentiment analysis of this data. The sentiment analysis of large amount of text data requires specific big data and machine learning /ML/ libraries. In this paper the implementation of a system for big data sentiment analysis using ML algorithms is proposed. It is based on Naïve Bayes and Support Vector Machines /SVM/ classification ML algorithms for text analysis. The system is implemented in Java and uses Apache Spark ML libraries which are very flexible, fast and scalable. The system is tested with well known Amazon dataset and its performance is measured in form of accuracy. The obtained results approve the effectiveness of big data sentiment analysis algorithms. The System can be applied for recommendation of products and services or predictions of customers’ needs.

Keywords

Article full text

Download PDF