The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data

Ronen Feldman, James Sanger

  • 出版商: Cambridge
  • 出版日期: 2006-12-11
  • 售價: $1,400
  • 貴賓價: 9.8$1,372
  • 語言: 英文
  • 頁數: 424
  • 裝訂: Hardcover
  • ISBN: 0521836573
  • ISBN-13: 9780521836579
  • 相關分類: Text-mining 文字探勘





Text mining is a new and exciting area of computer science research that tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. Similarly, link detection – a rapidly evolving approach to the analysis of text that shares and builds upon many of the key elements of text mining – also provides new tools for people to better leverage their burgeoning textual data resources. The Text Mining Handbook presents a comprehensive discussion of the state-of-the-art in text mining and link detection. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, the book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.

• The first comprehensive compilation of algorithms, methodologies, practical approaches and applications

 • Co-authored by one of the founding figures in the field of text mining

 • Detailed description of core text mining algorithms for identifying patterns such as frequent sets, distributions and proportions and associations

Table of Contents

1. Introduction to text mining; 2. Core text mining operations; 3. Text mining preprocessing techniques; 4. Categorization; 5. Clustering; 6. Information extraction; 7. Probabilistic models for Information extraction; 8. Preprocessing applications using probabilistic and hybrid approaches; 9. Presentation-layer considerations for browsing and query refinement; 10. Visualization approaches; 11. Link analysis; 12. Text mining applications; Appendix; Bibliography.