Natural Language Processing with Spark Nlp: Learning to Understand Text at Scale

Thomas, Alex

  • 出版商: O'Reilly
  • 出版日期: 2020-07-21
  • 售價: $2,140
  • 貴賓價: 9.5$2,033
  • 語言: 英文
  • 頁數: 350
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1492047767
  • ISBN-13: 9781492047766
  • 相關分類: SparkText-mining 文字探勘
  • 立即出貨 (庫存 < 3)




Want to build an application that uses natural language text, but aren't sure where to start or what tools to use? This practical book gets you started with natural language processing from the basics to powerful modern techniques. Data scientists will learn how to build enterprise-quality NLP applications using deep learning and the Apache Spark distributed processing framework.

This guide includes concrete examples, practical and theoretical explanations, and hands-on exercises for NLP on Spark. You'll understand why these techniques work from machine learning, linguistic, and practical points of view.

This book shows you how to:

  • Process text in a distributed environment using Spark-NLP, a production-ready library for NLP built on Spark
  • Create, tune, and deploy your own word embeddings
  • Adapt your NLP applications to multiple languages
  • Use text in machine learning and deep learning


Alex Thomas is a data scientist at Indeed. He has used natural language processing (NLP) and machine learning with clinical data, identity data, and now employer and jobseeker data. He has worked with Apache Spark since version 0.9, and has worked with NLP libraries and frameworks including UIMA and OpenNLP.