Department of Computer Science
A privacy-aware service-oriented platform for distributed data mining
Customer data privacy is known to be a factor which makes just-in-time data sharing and mining among enterprises challenging. Learning-from-abstraction is a recently proposed paradigm for privacy preserving distributed data mining where distributed local data sources are protected by probabilistic data abstraction. In this paper, we investigate the use of a normalized negative log likelihood together with the paradigm for quantifying the level of privacy protection, and studied theoretically the change of the privacy levels of the local data abstractions after being aggregated for global data analysis. Experiments on distributed data clustering with a synthetic data set were conducted on a service-oriented BPEL platform. The promising results obtained demonstrates the effectiveness of the adopted privacy measure.
Data mining, Data privacy, Service oriented architecture, Protection, Data analysis, Communication system control, Covariance matrix, Computer science, Distributed computing, Medical services
Source Publication Title
Proceedings of the 8th IEEE International Conference on E-Commerce Technology and the 3rd IEEE International Conference on Enterprise Computing, E-Commerce, and E-Services (CEC/EEE’06)
San Francisco, United States
Copyright © 2006 by The Institute of Electrical and Electronics Engineers, Inc.
This work is partially supported by RGC Central Allocation HKBU 2/03C.
Link to Publisher's Edition
Zhang, Xiaofeng, Ho-fai Wong, and William K. Cheung. "A privacy-aware service-oriented platform for distributed data mining." Proceedings of the 8th IEEE International Conference on E-Commerce Technology and the 3rd IEEE International Conference on Enterprise Computing, E-Commerce, and E-Services (CEC/EEE’06) (2006).