Deep Web Query Interface Understanding and Integration (Paperback)

Eduard C. Dragut, Weiyi Meng, Clement T. Yu

相關主題

商品描述

There are millions of searchable data sources on the Web and to a large extent their contents can only be reached through their own query interfaces. There is an enormous interest in making the data in these sources easily accessible. There are primarily two general approaches to achieve this objective. The first is to surface the contents of these sources from the deep Web and add the contents to the index of regular search engines. The second is to integrate the searching capabilities of these sources and support integrated access to them. In this book, we introduce the state-of-the-art techniques for extracting, understanding, and integrating the query interfaces of deep Web data sources. These techniques are critical for producing an integrated query interface for each domain. The interface serves as the mediator for searching all data sources in the concerned domain. While query interface integration is only relevant for the deep Web integration approach, the extraction and understanding of query interfaces are critical for both deep Web exploration approaches.

This book aims to provide in-depth and comprehensive coverage of the key technologies needed to create high quality integrated query interfaces automatically. The following technical issues are discussed in detail in this book: query interface modeling, query interface extraction, query interface clustering, query interface matching, query interface attribute integration, and query interface integration.

Table of Contents: Introduction / Query Interface Representation and Extraction / Query Interface Clustering and Categorization / Query Interface Matching / Query Interface Attribute Integration / Query Interface Integration / Summary and Future Research

商品描述(中文翻譯)

網絡上有數百萬個可搜索的數據源,而這些數據源的內容往往只能通過它們自己的查詢界面進行訪問。人們對於使這些數據源的數據易於訪問非常感興趣。主要有兩種方法來實現這一目標。第一種方法是從深網中提取這些數據源的內容,並將其添加到常規搜索引擎的索引中。第二種方法是整合這些數據源的搜索功能,並支持對它們進行集成訪問。在本書中,我們介紹了提取、理解和整合深網數據源的查詢界面的最新技術。這些技術對於為每個領域創建集成查詢界面至關重要。該界面作為在相關領域中搜索所有數據源的中介。雖然查詢界面整合只與深網整合方法相關,但查詢界面的提取和理解對於兩種深網探索方法都至關重要。

本書旨在深入全面地介紹創建高質量集成查詢界面所需的關鍵技術。本書詳細討論了以下技術問題:查詢界面建模、查詢界面提取、查詢界面聚類、查詢界面匹配、查詢界面屬性整合和查詢界面整合。

目錄:引言 / 查詢界面表示和提取 / 查詢界面聚類和分類 / 查詢界面匹配 / 查詢界面屬性整合 / 查詢界面整合 / 總結和未來研究