Modeling the Internet and the Web: Probabilistic Methods and Algorithms (Hardcover)
Pierre Baldi, Paolo Frasconi, Padhraic Smyth
- 出版商: Wiley
- 出版日期: 2003-07-07
- 售價: $1,007
- 語言: 英文
- 頁數: 306
- 裝訂: Hardcover
- ISBN: 0470849061
- ISBN-13: 9780470849064
-
相關分類:
Algorithms-data-structures
已絕版
買這商品的人也買了...
-
$1,200$1,176 -
$749Disappearing Cryptography, 2/e
-
$450$360 -
$990Mining the Web: Discovering Knowledge for Hypertext Data
-
$680$537 -
$650$514 -
$680$537 -
$590$466 -
$690$538 -
$780$616 -
$640$576 -
$690$587 -
$620$490 -
$580$458 -
$750$675 -
$490$382 -
$560$504 -
$720$569 -
$720$569 -
$860$679 -
$550$468 -
$390$304 -
$880$695 -
$480$379 -
$1,058Software Engineering: A Practitioner's Approach, 6/e
商品描述
- Provides a comprehensive introduction to the modeling of the Internet and Web at the information level.
- Takes a modern approach based on mathematical, probabilistic and graphical modeling.
- Provides an integrated presentation of theory, examples, exercies and applications.
- Covers key topics such as text analysis, link analysis, crawling techniques, human behaviour, and commerce on the Web.
Interdisciplinary in nature, Modeling the Internet and the Web will be of interest to students and researchers from a variety of disciplines including computer science, machine learning, engineering, statistics, economics, business and the social sciences.
Table of Contents
Preface.
1 Mathematical Background.
1.1 Probability and Learning from a
Bayesian Perspective.
1.2 Parameter Estimation from Data.
1.3 Mixture
Models and the Expectation Maximization Algorithm.
1.4 Graphical
Models.
1.5 Classification.
1.6 Clustering.
1.7 Power-Law
Distributions.
1.8 Exercises.
2 Basic WWW Technologies.
2.1 Web Documents.
2.2 Resource
Identifiers: URI, URL, and URN.
2.3 Protocols.
2.4 Log Files.
2.5
Search Engines.
2.6 Exercises.
3 Web Graphs.
3.1 Internet and Web Graphs.
3.2 Generative Models
for theWeb Graph and Other Networks.
3.3 Applications.
3.4 Notes and
Additional Technical References.
3.5 Exercises.
4 Text Analysis.
4.1 Indexing.
4.2 Lexical Processing.
4.3
Content-Based Ranking.
4.4 Probabilistic Retrieval.
4.5 Latent Semantic
Analysis.
4.6 Text Categorization.
4.7 Exploiting Hyperlinks. 4.8
Document Clustering.
4.9 Information Extraction.
4.10 Exercises.
5 Link Analysis.
5.1 Early Approaches to Link Analysis.
5.2
Nonnegative Matrices and Dominant Eigenvectors.
5.3 Hubs and Authorities:
HITS.
5.4 PageRank.
5.5 Stability.
5.6 Probabilistic Link
Analysis.
5.7 Limitations of Link Analysis.
6 Advanced Crawling Techniques.
6.1 Selective Crawling.
6.2
Focused Crawling.
6.3 Distributed Crawling.
6.4 Web Dynamics.
7 Modeling and Understanding Human Behavior on the Web.
7.1
Introduction.
7.2 Web Data and Measurement Issues.
7.3 Empirical
Client-Side Studies of Browsing Behavior.
7.4 Probabilistic Models of
Browsing Behavior.
7.5 Modeling and Understanding Search Engine
Querying.
7.6 Exercises.
8 Commerce on the Web: Models and Applications.
8.1
Introduction.
8.2 Customer Data on theWeb.
8.3 Automated Recommender
Systems.
8.4 Networks and Recommendations.
8.5 Web Path Analysis for
Purchase Prediction.
8.6 Exercises.
Appendix A Mathematical Complements.
A.1 Graph Theory.
A.2
Distributions.
A.3 Singular Value Decomposition.
A.4 Markov Chains.
A.5
Information Theory.
Appendix B List of Main Symbols and Abbreviations.
References.
Index.