A REVIEW ON EFFICIENT ANALYSIS OF BIG DATA
Keywords:
Apache Spark, Big Data, ICPAbstract
A colossal measure of information containing helpful data, called Big Data, is produced regularly. For
handling such gigantic volume of information, there is a need of Big Data structures, for example, Hadoop Map Reduce,
Apache Spark and so on. Among these, Apache Spark performs up to 100 circumstances speedier than traditional
systems like Hadoop Map reduce. we concentrate on the plan of partition grouping calculation and its execution on
Apache Spark. This paper presents a viable handling structure designated ICP (Image Cloud Processing) to capably
adapt to the information blast in picture handling field and we propose a partition based grouping calculation called
Scalable Random Sampling with Iterative Optimization Fuzzy c-Means calculation (SRSIO-FCM) which is executed on
Apache Spark to handle the difficulties connected with Big Data Clustering.