FEATURE SUBSET SELECTION FOR HIGH DIMENSIONAL DATA BASED ON CLUSTERING

Prof. S.N.Zaware; Heena Shaikh; Sheefa Shaikh; Asmita Orpe; Pooja Rokade

FEATURE SUBSET SELECTION FOR HIGH DIMENSIONAL DATA BASED ON CLUSTERING

Authors

Prof. S.N.Zaware Computer Department, AISSMS IOIT Pune
Heena Shaikh Computer Department, AISSMS IOIT Pune
Sheefa Shaikh Computer Department, AISSMS IOIT Pune
Asmita Orpe Computer Department, AISSMS IOIT Pune
Pooja Rokade Computer Department, AISSMS IOIT Pune

Keywords:

Markov Blanket, MST Creation, Gaussian Distribution, Shannon Infogain, Bayesian Probability, Fuzzy Logic

Abstract

Feature selection is the process of evaluating and extracting desired data which can be grouped into subsets
which retain the integrity of original data. A feature selection algorithm should be efficient and effective. Efficient means
minimum time required and effective means quality of generated subset is not compromised. Our system proposes an
algorithm which consists of following steps: Markov Blanket, Shannon Infogain, Minimum Spanning Tree, Tree
Partition, Gaussian Distribution, Bayesian Probability. Applying these steps we get the desired subset from the clusters.
Our system ensures to remove irrelevant data along with redundant data which most of the systems fail to perform.

Downloads

PDF Download PDF Abstract

Published

2015-12-25

Issue

Vol. 2 No. 12 (2015): Volume 2 Issue 12, December 2015

Section

Articles

How to Cite

FEATURE SUBSET SELECTION FOR HIGH DIMENSIONAL DATA BASED ON CLUSTERING. (2015). International Journal of Advance Engineering and Research Development (IJAERD), 2(12), 105-107. https://www.ijaerd.org/index.php/IJAERD/article/view/5271

Download Citation

FEATURE SUBSET SELECTION FOR HIGH DIMENSIONAL DATA BASED ON CLUSTERING

Authors

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Make a Submission

downloads

Imp links

google

Latest publications

Information