Profile
International Journal of Computer & Software Engineering Volume 2 (2017), Article ID 2:IJCSE-125, 15 pages
https://doi.org/10.15344/2456-4451/2017/125
Review Article
Big Data Tools-An Overview

Rabie A. Ramadan

Computer Engineering Department, Cairo University, Giza, Egypt
Prof. Rabie A. Ramadan, Computer Engineering Department, Cairo University, Giza, Egypt; E-mail: rabie@rabieramadan.org
17 July 2017; 27 December 2017; 29 December 2017
Ramadan RA (2017) Big Data Tools – An Overview. Int J Comput Softw Eng 2: 125. doi: https://doi.org/10.15344/2456-4451/2017/125

Abstract

With the increasing of data to be analyzed either in social media, industry applications, or even science, there is a need for nontraditional methods for data analysis. Big data is a way for nontraditional strategies and techniques to organize, store, and process huge data collected from large datasets. Large dataset, in this context, means too large data that cannot be handled, stored, or processed using traditional tools and techniques or one computer. Therefore, there is a challenge to come up with different analytical approaches to analyze massive scale heterogeneous data coming with high speed. Consequently, big data has some characteristics that makes it different from any other data which are Veracity, Volume, Variety, Velocity, and Value. Veracity means variety of resources while Variety means data from different sources. Big data Value characteristic is one of the ultimate challenge that could be complex enough to be stored, extracted, and processed. The Volume deals with the size of the data and required storage while Velocity is related to data streaming time and latency. Throughout this paper, we review the state-of-the-art of big data tools. For the benefits of researchers, industry and practitioners, we review a large number of tools either commercial or free tools. The article also shows some of the important characteristics of the collected tools to make it easy on organizations and scientists to select the best tool for the type of data to be handled.