Beta
147564

A Proposed Model to Allow Data Mining Classification Avoiding Privacy Concerns

Article

Last updated: 27 Dec 2024

Subjects

-

Tags

-

Abstract

Data Mining aims to discover hidden facts that exist in the databases and data
warehouses. The discovered data should not reveal secrets that are considered
private for individuals or groups. In recent years, there have been privacy concerns
over the increase of gathering personal data by various institutions and merchants
over the Internet. There has been increasing interest in the problem of building
accurate data mining models over aggregate data while protecting privacy at the
level of individual records. One approach for this problem is to randomize the
values in individual records, and only disclose the randomized values. This method
is able to retain privacy while accessing the information implicit in the original
attributes. The distribution of the original data set is important and estimating it is
one of the goals of the data mining algorithms.
This paper introduces the privacy concerns and the obvious conflict between
privacy and data mining. Then, two approaches to resolve this conflict are
introduced, namely: the randomization approach and the cryptographic approach.
We consider the case of performing data mining classification for randomized
data. Two proposed algorithms for data mining classification of randomized data
,with high accuracy compared to classification algorithms for non perturbed data,
based on Bayes rules will be introduced (Step-Class, and Global-Decision).
These two algorithms are experimentally tested to measure the classification
accuracy of each of them. Our empirical results show that the Step-Class algorithm
has better performance results (classification accuracy ratio) than the Global
decision algorithm.

DOI

10.21608/asc.2007.147564

Keywords

Knowledge Discovery and Data Mining (KDDM), Bayes classifiers, privacy

Volume

1

Article Issue

1

Related Issue

21708

Issue Date

2007-06-01

Receive Date

2021-02-10

Publish Date

2007-06-01

Page Start

95

Page End

118

Print ISSN

1687-8515

Online ISSN

2682-3578

Link

https://asc.journals.ekb.eg/article_147564.html

Detail API

https://asc.journals.ekb.eg/service?article_code=147564

Order

8

Type

Original Article

Type Code

1,549

Publication Type

Journal

Publication Title

Journal of the ACS Advances in Computer Science

Publication Link

https://asc.journals.ekb.eg/

MainTitle

A Proposed Model to Allow Data Mining Classification Avoiding Privacy Concerns

Details

Type

Article

Created At

23 Jan 2023