標題: 利用FID3於網站登入資料分析
Web Log-File Data Mining using FID3
作者: 許書淵
Shu-Yuan Hsu
張志永
Jyh-Yeong Chang
電控工程研究所
關鍵字: 網站登入資料;Data Mining;FID3
公開日期: 2001
摘要: 本篇論文應用FID3於網站紀錄之資料探勘(data mining)的分析系統。我們選取一個B2C電子商務型網站的連線資料(log-file)和使用者的資料做為資料探勘的資料庫,並且開發一個關於網站商品內容的資料探勘分析系統(Web log-file mining system)。這一個資料探勘分析系統的架構分成三個步驟:第一步為資料準備(data preparation),主要的負責將連線資料和使用者資料由文字模式轉換成資料庫並且除去多餘不必要的資料,第二步為資料引擎(data engine),這一部份為資料探勘的核心,其中包括建立各個資料庫的連結和執行資料探勘演算法,且根據知識庫(knowledge base)所提供的資料來將資料引擎所分析而得的結果轉換成規則(rules)。而第三步為資料分析(data analysis),此一階段是藉由這一套系統所得出的規則可做出兩種應用。第一種,我們可以針對電子商務的業者一些建構網站和維護的依據,以增進電子商務的實質效益。第二種,我們可針對網站登入使用者的背景,來提出針對其個人最佳的路徑,增加使用者於瀏覽此網站的效率。
The goal of data mining process is knowledge discovery. This thesis applies Fuzzy ID3 to develop a web log-file mining system to analyze the log-file and users’ profile of a B2C website. The web log-file mining system can be divided into three components: data preparation, data engine, and data analysis. The function of data preparation is to convert the log-file from text file to ACCESS database and remove all of the redundance in log-file. The data engine, the kernel of data mining process, is designed to combine the metadata of pages, log-file, and users’ profile. Then the combined database can be the input patterns of fuzzy ID3. The third module of web log-file mining system is data analysis and the data analysis is the final procedure of data mining process. Based on the decision tree constructed by fuzzy ID3, the system will build the fuzzy “IF-THEN” rules and extract information from them. According to these fuzzy rules, the system can realize two applications. One is to provide information about the behavior of the user for web master to maintain and promote the web site. One is to suggest the better browsing path to the user who visits the web site.
URI: http://140.113.39.130/cdrfb3/record/nctu/#NT900591070
http://hdl.handle.net/11536/69439
顯示於類別:畢業論文