Classification and Prediction of Diabetics using Weka and Hive Tool

Authors

  • Dr.C.S.Kanimozhi Selvi Department of Computer Science and Engineering Kongu Engineering College, Perundurai, Erode-638060, Tamilnad, India
  • Dr.S.V.Kogilavani Department of Computer Science and Engineering Kongu Engineering College, Perundurai, Erode-638060, Tamilnad, India
  • Dr.S.Malliga Department of Computer Science and Engineering Kongu Engineering College, Perundurai, Erode-638060, Tamilnad, India
  • D.Jayaprakash Department of Computer Science and Engineering Kongu Engineering College, Perundurai, Erode-638060, Tamilnad, India

Keywords:

Classification, Diabetics, Big data, Hadoop, Hive

Abstract

Data mining is one such field which tries to extract some interesting facts from huge data set. Since many
years ago, the scientific community is concerned about how to increase the accuracy of different classification methods,
and major achievements have been made so far. Besides this issue, the increasing amount of data that is being generated
every day by various data logging methods raises more challenges. The objective of this project is to train historical
diabetes data and classifies them. Classifiers like Naïve Bayes, Decision Tree, Decision Stump, K star and Random forest
algorithms are trained very efficiently in a supervised learning setting. Though the chosen dataset is small in size, to
learn the big data tools and to find the effectiveness of the tools on a small data set Hadoop and Hive is used in this
paper.

Published

2018-04-25

How to Cite

Dr.C.S.Kanimozhi Selvi, Dr.S.V.Kogilavani, Dr.S.Malliga, & D.Jayaprakash. (2018). Classification and Prediction of Diabetics using Weka and Hive Tool. International Journal of Advance Engineering and Research Development (IJAERD), 5(4), 105–111. Retrieved from https://www.ijaerd.org/index.php/IJAERD/article/view/5505