Department of Computer Science
Learning global models based on distributed data abstractions
Due to the increasing demand of massive and distributed data analysis, achieving highly accurate global data analysis results with local data privacy preserved becomes an increasingly important research issue. In this paper, we propose to adopt a model-based method (Gaussian mixture model) for local data abstraction and aggregate the local model parameters for learning global models. To support global model learning based on solely local GMM parameters instead of virtual data generated from the aggregated local model, a novel EM-like algorithm is derived. Experiments have been performed using synthetic datasets and the proposed method was demonstrated to be able to achieve the global model accuracy comparable to that of using the data regeneration approach at a much lower computational cost.
Source Publication Title
Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence
International Joint Conferences on Artificial Intelligence
This work has been partially supported by RGC Central Allocation Group Research Grant (HKBU 2/03/C).
Link to Publisher's Edition
Zhang, Xiaofeng, and William K. Cheung. "Learning global models based on distributed data abstractions." Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (2005): 1645.