Classification and Feature Selection Approaches by Machine Learning Techniques: Heart Disease Prediction

Authors

  • N. Satish Chandra Reddy School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia
  • Song Shue Nee School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia
  • Lim Zhi Min School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia
  • Chew Xin Ying School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia

DOI:

https://doi.org/10.11113/ijic.v9n1.210

Keywords:

Machine Learning, Feature Selection, R tool, Classification, Prediction

Abstract

The heart disease has been one of the major causes of death worldwide. The heart disease diagnosis has been expensive nowadays, thus it is necessary to predict the risk of getting heart disease with selected features. The feature selection methods could be used as valuable techniques to reduce the cost of diagnosis by selecting the important attributes. The objectives of this study are to predict the classification model, and to know which selected features play a key role in the prediction of heart disease by using Cleveland and statlog project heart datasets. The accuracy of random forest algorithm both in classification and feature selection model has been observed to be 90–95% based on three different percentage splits. The 8 and 6 selected features seem to be the minimum feature requirements to build a better performance model. Whereby, further dropping of the 8 or 6 selected features may not lead to better performance for the prediction model.

Author Biographies

N. Satish Chandra Reddy, School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia

School of Computer Sciences

Song Shue Nee, School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia

School of Computer Sciences

Lim Zhi Min, School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia

School of Computer Sciences

Chew Xin Ying, School of Computer Sciences, 11800, Universiti Sains Malaysia, Pulau Pinang, Malaysia

School of Computer Sciences

Downloads

Published

2019-05-31

How to Cite

Chandra Reddy, N. S., Shue Nee, S., Zhi Min, L., & Xin Ying, C. (2019). Classification and Feature Selection Approaches by Machine Learning Techniques: Heart Disease Prediction. International Journal of Innovative Computing, 9(1). https://doi.org/10.11113/ijic.v9n1.210

Issue

Section

Computer Science