A SURVEY ON HADOOP HBASE SYSTEM
Keywords:
Hbase, HDFS, Zookeper, Hadoop, Database, Communication.Abstract
In this paper, we will discuss that how HBase works and how it is better than RDBMS. Actually HBase is a distributed column oriented database in which the data can be fetched randomly. Apparently, it provides a very high performance, low latency access to single rows from billions of records and manages fault tolerance. Access patterns are well known and simple. It sits on top of HDFS(hadoop distributed file system). It deals with enormous tables. It stores data in indexed HDFS files for faster lookups. Though, hbase is a column oriented database but the data in it are sorted by row.It also includes the architecture comprising of master server, region, region servers and zookeeper. Moreover,HBase contains a shell in which we can communicate with HBase. There are some general commands for communication as well as for certain operations. Also, there is an admin API in which communication is done with HBase and manages HBase. It is extremely useful for heavy read- write and to access the randomly large data. Companies such as facebook, Yahoo etc use HBase internally.