Data Mining Challenges in the Context of Data Retention
Data Mining Challenges in the Context of Data Retention
Retaining electronic communication and internet traffic data imposes novel technical and organisational challenges for internet service providers as well as for government authorities. ISP companies are not only burdened by storing extraordinary amounts of data, but also must develop and adhere to data protection and data security policies in order to protect the data against unauthorised access or disclosure and against accidental destruction. The authors present distributed, horizontally partitioned data warehouse architecture for retaining data at each internet service provider separately. Moreover, they elaborate a data warehouse schema for storing e-mail data according to the European data retention directive which facilitate parameterised data retrieval. The authors show how their system allows for applying various types of data mining techniques to both internet access and communication data. Finally, they discuss issues related to data security, cost and performance, and reveal limitations of data retention systems.
CITATION: Gansterer, Wilfried N.. Data Mining Challenges in the Context of Data Retention edited by Syvajarvi, Antti . Hershey, PA : IGI Global , 2010. Data Mining in Public and Private Sectors - Available at: https://library.au.int/data-mining-challenges-context-data-retention