標題: 一個具使用者圖形介面之網頁資料萃取系統
A GUI Based Enviroment For Web Data Extraction
作者: 黃敬堯
John Huang
吳毅成
I-Chen Wu
資訊學院資訊學程
關鍵字: 網頁資料萃取;Web data extraction
公開日期: 2000
摘要: 網際網路與電子商務的快速發展,給企業以及個人均帶來了許多的好處,人們花費在上網瀏覽網頁的時間也越來越長。但是有時候卻感覺到迷失在眾多各式各樣不同的資訊之中,因此必須有一套可以快速且有系統的幫使用者蒐集資訊的機制。 一些網頁查詢語言,例如XML-QL、WIDL、GIDL,就是這樣用來提供使用者自動化萃取網頁的工具,但是針對設計上太複雜的網頁,要很快寫出其查詢語法卻有困難。因此本論文以GIDL網頁查詢語言為基礎,提出並設計一套使用者易於上手的網頁資料萃取系統。有了這個結合圖形操作介面的資料萃取工具,可以加速網頁資料之萃取與蒐集,並且提昇網頁資料萃取之效率與降低人力上的成本。
The rapid growth of Internet and Electronic-Commerce has the potential to provide enormous benefits to business and consumers. People spending more and more time navigating the Web sites. But some times they feel they were lost when dealing with large amount of data and information. There must be a mechanism to help people acquiring data more systematically and much more quickly. The Web query languages such as XML-QL, WIDL, GIDL provide users with collecting data from the Web server automatically. But it is difficult to write sophisticated syntax for specific Web sites. In this thesis we introduce a user-friendly tool based on GIDL for Web data extraction. The GUI based Web data extraction tool will accelerating the process of extracting data from the Web. The utility will also increase the performance for Web data extraction and reduce the cost.
URI: http://140.113.39.130/cdrfb3/record/nctu/#NT891706008
http://hdl.handle.net/11536/68031
Appears in Collections:Thesis