Instant Apache Solr for Indexing Data How-to

Alexandre Rafalovitch

  • 出版商: Packt Publishing
  • 出版日期: 2013-06-18
  • 售價: $940
  • 貴賓價: 9.5$893
  • 語言: 英文
  • 頁數: 90
  • 裝訂: Paperback
  • ISBN: 1782164847
  • ISBN-13: 9781782164845
  • 相關分類: 全文搜尋引擎 Full-text-search
  • 下單後立即進貨 (約3~4週)

商品描述

Nobody pretends indexing data with Apache Solr is a walk in the park, but this book eases the path with plain language explanations and involving projects. Perfect for developers with sophisticated indexing ambitions.

Overview

  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results
  • Take the most basic schema and extend it to support multi-lingual, multi-field searches
  • Make Solr pull data from a variety of existing sources
  • Discover different pathways to acquire and normalize data and content

In Detail

Content and data searching is a very important part of the modern user experience, and before something can be searched, it has to be indexed. Indexing is a hidden part of the process that has a surprisingly strong impact on the overall user experience. From speed, to faceting, to multilingual support, everything depends on correct indexing.

Instant Apache Solr for Indexing Data How-to is an example-driven guide that will take you on a journey from the basic collection of data to a multi-lingual, multi-field, multi-type schema. By the end of the book, you will know how to get your data ready for searches and how to tune the process to achieve the required search use-cases.

Instant Apache Solr for Indexing Data How-to is a friendly, practical guide that will show you how to index your data with Solr. This book will explain how Solr’s basic blocks actually work and fit together. You will then explore additional settings, pipelines, and configuration changes to achieve ever more complex goals. You will then cover how to push data into Solr and when to get Solr to pull the data. You will then master indexing textual and binary context before enabling multilingual content to be searched.

What you will learn from this book

  • Produce a basic Solr schema ready for experimentation and exploration
  • Run several collections on one Solr server
  • Import, search, and facet simple and multi-valued fields
  • Create your own field type analyzer chains for ultimate indexing flexibility
  • Detect, index, and partition multi-lingual content
  • Use CSV, XML, JSON, and binary formats to get data into Solr
  • Pull data from external files and databases using DataImportHandler
  • Write a Java client using the SolrJ library in both remote and embedded mode
  • Change data already indexed using atomic updates
  • Reshape incoming data with UpdateRequestProcessors
  • Control the visibility of data with soft and hard commits

Approach

Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. This book is written in a friendly, practical manner with recipes covering important indexing techniques and methods using Apache Solr.

Who this book is written for

This book is for developers who want to dive deeper into Solr. Regardless of whether you are just starting with Solr or have already built your first collection by copying and modifying examples, this book will take you through the complicated steps of indexing your data with Solr.