透過交易新增來隱藏敏感的頻繁項目集 = Hiding Sensitive...
國立高雄大學資訊工程學系碩士班

 

  • 透過交易新增來隱藏敏感的頻繁項目集 = Hiding Sensitive Frequent Itemsets through Transaction Addition
  • 紀錄類型: 書目-語言資料,印刷品 : 單行本
    並列題名: Hiding Sensitive Frequent Itemsets through Transaction Addition
    作者: 張家境,
    其他團體作者: 國立高雄大學
    出版地: [高雄市]
    出版者: 撰者;
    出版年: 民100
    面頁冊數: 55葉部份彩圖,表格 : 30公分;
    標題: 資料探勘
    標題: Data mining
    電子資源: http://handle.ncl.edu.tw/11296/ndltd/92009588771079774289
    附註: 參考書目:葉43-45
    附註: 內容為英文
    摘要註: 資料探勘主要是用來從大量的資料中擷取出有用的知識,以用來幫助公司做出有效的決策。然而,在資料蒐集與數據傳播的過程中,卻可能引發隱私資料外洩風險。有關於個人、企業或組織的敏感資訊,在分享或是發佈時,應該以隱匿資訊的方式受到保護。也因此,隱私保護之探勘成為近年來重要的研究議題之一。在這篇論文裡,我們提出兩種透過新增交易資料的方法來隱藏敏感項目集。第一個方法以貪婪法為基礎,首先計算出可以用來完全的隱藏住敏感項目集的最大的新增交易筆數,再進行資料隱藏的動作。其中每筆新增交易資料的長度與新增的項目集取決於經驗法則裡的標準常態分佈,如此,也能進一步減少隱匿敏感項目時所產生的副作用。在第二個方法裡,我們提出一個演化式的隱私保護之探勘法,用來找出最適合的項目集以加入新增交易中,進一步得以隱藏住敏感項目集。此方法設計了三個變數來設計一個彈性的評估函數,並可根據使用者的喜好彈性地分配此三個變數的權重。此外,準大項目集的概念也被用來減少重新掃描資料庫的成本,以加快評估染色體的過程。最終,我們將透過實驗結果來評估演算法的效能。 Data mining technology is used to derive useful knowledge from large databases, thus being able to help make effective decision for companies. The process of data collection and dissemination may, however, cause the risk of privacy threats. Sensitive or personal information and knowledge of individuals, industries and organizations are required to be kept as private information before they are shared or published. Thus, the privacy-preserving data mining (PPDM) has become an important issue in recent years. In this thesis, two approaches for hiding sensitive itemsets by inserting new transactions are proposed. The first one is a greedy-based approach, which computes the maximal number of transactions to be inserted into the original database for totally hiding sensitive itemsets. The length of each inserted transaction and the itemsets within it are decided by some heuristic rules, such that the side effects from hiding sensitive itemsets could be reduced. The second one is an evolutionary privacy-preserving data mining method to find appropriate itemsets within inserted transactions for hiding sensitive itemsets. It uses a flexible evaluation function with three factors. Different weights are then assigned to three factors depending on users' preference. The concept of pre-large itemsets is also used in the GA-based approach to reduce the cost of rescanning databases, thus speeding up the evaluation process of chromosomes. Experimental results are finally performed to evaluate the performance of the proposed two approaches.
館藏
  • 2 筆 • 頁數 1 •
 
310002134669 博碩士論文區(二樓) 不外借資料 學位論文 TH 008M/0019 464103 1134 2011 一般使用(Normal) 在架 0
310002134651 博碩士論文區(二樓) 不外借資料 學位論文 TH 008M/0019 464103 1134 2011 c.2 一般使用(Normal) 在架 0
  • 2 筆 • 頁數 1 •
評論
Export
取書館別
 
 
變更密碼
登入