A REVIEW OF ADAPTIVE REPLICATION MANAGEMENT IN HDFS BASED ON SUPERVISED LEARNING
Keywords:
Replication, HDFS, Proactive Prediction, Optimization, Bayesian Learning, Gaussian ProcessAbstract
Apache Hadoop is very popular now-a-days. Hadoop Distributed file system (HDFS) is that the Heart of
Apache Hadoop that is reliable and extremely accessible. At the heart of Apache Hadoop, the Hadoop Distributed file
system (HDFS) provides the dependability and high availability for computation by applying a static replication by
default. This paper explains the dynamic approach to replicate data Files supported supervised Learning. Hadoop, an
open source implementation of the MapReduce framework, has been wide used for process massive-scale information in
parallel. Since Hadoop uses a distributed file system, referred to as HDFS, the data locality drawback usually happens)
and this drawback leads to the decrease in performance.