ANALYSING BIGDATA WITH CONTEXT TO HASH PARTITION USING METADATA
Keywords:
Big Data; Hive; MySQL; Pig; Partitioning; Bucketing; Hadoop framework.Abstract
Streaming data analysis has pulled in consideration In different applications like monetary records,
information investigation, and so forth. Such kind of utilizations require nonstop stockpiling of expansive measure of
information in information distribution center while at the same time giving brisk reaction time to the questions against
the information that is put away in the framework. The span of getting information shifts relying upon kind of information
required from the framework.. This paper presents the performance estimates in terms of MySQL Partition, Hive
partition-bucketing and Apache Pig framework. In this paper, big data eco systems and comparative performance
analysis of frequently used data retrieval techniques such as MySQL, Hive and Pig are described. From the work
introduced in the paper, it is presumed that the execution time for removing information turns out to be vast with
development in information estimate, especially if there should arise an occurrence of MySQL. When contrasted with
MySQL, Hive and Hive takes less time and give better outcomes.