Comparative Study on Perturbation Techniques in Privacy Preserving Data Mining on Two Numeric Datasets

Authors

  • Desmond Ko Khang Siang Faculty of Computing Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia
  • Siti Hajar Othman Faculty of Computing Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia
  • Raja Zahilah Raja Mohd Radzi Faculty of Computing Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia

DOI:

https://doi.org/10.11113/ijic.v8n1.161

Abstract

Data Mining is a computational process that able to identify patterns, trends and behaviour from large datasets. With this advantages, data mining has been applied in many fields such as finance, healthcare, retail and so on. However, information disclosure become one of an issue during data mining process. Therefore, privacy protection is needed during data mining process which known as Privacy Preserving Data Mining (PPDM). There are several techniques available in PPDM and each of the techniques has its’ own benefits and drawbacks. In this research, perturbation technique is selected as privacy preserving technique. Perturbation technique is a method that alters the original data value before the application of data mining. In PPDM applications, perturbation technique able to provide a protection of data privacy but the accuracy of data should not be ignored too. In this research, three perturbation techniques are selected which are additive noise, data swapping and resample. For data mining techniques, two methods of classification are selected which are Naïve Bayes and Support Vector Machines (SVM). With the selection of these techniques, the experimental results are evaluated based on the hiding failure, accuracy and precision. For overall result, resample is selected as the best perturbation technique in naïve bayes and SVM classification for both glass and ionosphere datasets.

Downloads

Published

2018-05-21

How to Cite

Ko Khang Siang, D., Othman, S. H., & Raja Mohd Radzi, R. Z. (2018). Comparative Study on Perturbation Techniques in Privacy Preserving Data Mining on Two Numeric Datasets. International Journal of Innovative Computing, 8(1). https://doi.org/10.11113/ijic.v8n1.161

Issue

Section

Computer Science