Apache Solr Beginner's Guide(Paperback)

Alfredo Serafini

  • 出版商: Packt Publishing
  • 出版日期: 2013-12-28
  • 售價: $1,710
  • 貴賓價: 9.5$1,625
  • 語言: 英文
  • 頁數: 324
  • 裝訂: Paperback
  • ISBN: 1782162526
  • ISBN-13: 9781782162520
  • 相關分類: 全文搜尋引擎 Full-text-search
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

商品描述

Where do you start with Apache Soir? We'd suggest with this book, which assumes no prior knowledge and takes you step by careful step through all the essentials, putting you on the road towards successful implementation.

Overview

  • Learn to use Solr in real-world contexts, even if you are not a programmer, using simple configuration examples
  • Define simple configurations for searching data in several ways in your specific context, from suggestions to advanced faceted navigation
  • Teaches you in an easy-to-follow style, full of examples, illustrations, and tips to suit the demands of beginners

In Detail

With over 40 billion web pages, the importance of optimizing a search engine's performance is essential.

Solr is an open source enterprise search platform from the Apache Lucene project. Full-text search, faceted search, hit highlighting, dynamic clustering, database integration, and rich document handling are just some of its many features. Solr is highly scalable thanks to its distributed search and index replication.

Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Apache Tomcat or Jetty. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable with most popular programming languages. Solr's powerful external configuration allows it to be tailored to many types of application without Java coding, and it has a plugin architecture to support more advanced customization.

With "Apache Solr Beginner's Guide" you will learn how to configure your own search engine experience. Using real data as an example, you will have the chance to start writing step-by-step, simple, real-world configurations and understand when and where to adopt this technology.

"Apache Solr Beginner's Guide" will start by letting you explore a simple search over real data. You will then go through a step-by-step description that gives you the chance to explore several practical features. At the end of the book you will see how Solr is used in different real-world contexts.

Using data from public domains like DBpedia, you will define several different configurations, exploring some of the most interesting Solr features, such as faceted search and navigation, auto-suggestion, and rich document indexing. You will see how to configure different analysers for handling different data types, without programming.

You will learn the basics of Solr, focusing on real-world examples and practical configurations.

What you will learn from this book

  • Understand what is full-text search and a faceted navigation are and when to use them
  • Install and use Solr for testing
  • Write your own configurations for the Solr index incrementally and test them with the Solr web UI
  • Learn how to test a Solr running instance using cURL with different formats, like XML, JSON, and so on
  • Construe your data and define the entities to be indexed in Solr
  • Examine text and make auto-suggestions
  • Index data using various formats and various data sources, and learn how to expose data in various formats
  • Start using Solr in contexts like Open Data and Linked Data
  • Use Solr for expanding your data with resources from public, well-known knowledge bases

Approach

Written in a friendly, example-driven format, the book includes plenty of step-by-step instructions and examples that are designed to help you get started with Apache Solr.

商品描述(中文翻譯)

如何開始使用Apache Solr?我們建議您從這本書開始,它不需要任何先備知識,並且逐步引導您了解所有基本知識,讓您成功實施。

概述

- 學習在現實世界中使用Solr,即使您不是一名程式設計師,也可以使用簡單的配置示例。
- 在您特定的情境中,使用簡單的配置方式以多種方式搜索數據,從建議到高級分面導航。
- 以易於遵循的風格進行教學,充滿了例子、插圖和技巧,以滿足初學者的需求。

詳細內容

隨著超過400億個網頁,優化搜索引擎的性能至關重要。

Solr是Apache Lucene項目的開源企業搜索平台。全文搜索、分面搜索、高亮顯示、動態分群、數據庫集成和豐富的文檔處理只是其中的一些功能。由於其分佈式搜索和索引複製,Solr具有高度可擴展性。

Solr使用Java編寫,作為獨立的全文搜索服務器運行在像Apache Tomcat或Jetty這樣的Servlet容器中。Solr在其核心使用Lucene Java搜索庫進行全文索引和搜索,並具有類似REST的HTTP/XML和JSON API,可與大多數流行的編程語言一起使用。Solr強大的外部配置使其能夠在不需要Java編程的情況下適應多種應用類型,並且具有插件架構以支持更高級的自定義。

通過《Apache Solr初學者指南》,您將學習如何配置自己的搜索引擎體驗。通過使用真實數據作為示例,您將有機會開始逐步編寫簡單、實用的現實世界配置,並了解何時何地應用這項技術。

《Apache Solr初學者指南》將讓您探索對真實數據進行簡單搜索。然後,您將進行逐步描述,有機會探索幾個實用功能。在本書結尾,您將看到Solr在不同現實世界情境中的應用。

使用來自公共領域(如DBpedia)的數據,您將定義幾種不同的配置,探索一些最有趣的Solr功能,例如分面搜索和導航、自動建議和豐富文檔索引。您將了解如何配置不同的分析器來處理不同的數據類型,無需編程。

您將學習Solr的基礎知識,重點放在現實世界的例子和實用配置上。

從本書中您將學到:

- 瞭解全文搜索和分面導航的概念,以及何時使用它們。
- 安裝和使用Solr進行測試。
- 逐步編寫自己的Solr索引配置,並使用Solr Web UI進行測試。
- 學習如何使用cURL以不同格式(如XML、JSON等)測試運行中的Solr實例。
- 解釋數據並定義要在Solr中索引的實體。
- 檢查文本並進行自動建議。
- 使用各種格式和數據源索引數據,並學習如何以各種格式公開數據。
- 開始在開放數據和關聯數據等情境中使用Solr。
- 使用Solr從公共知識庫擴展您的數據。

這本書以友好的、以例子為驅動的格式撰寫,包含大量的逐步指示和例子,旨在幫助您開始使用Apache Solr。