Taming Text Book (code)

Code project for the book

Apache-licensed Java code for using Solr and OpenNLP is at https://github.com/tamingtext/book

Taming Text, by Grant Ingersoll, Thomas Morton and Drew Farris is designed to teach software engineers the basic concepts of working with text to solve search and Natural Language Processing problems. The book focuses on teaching using existing open source libraries like Apache Solr, Apache Mahout and Apache OpenNLP to manipulate text.  To learn more, visit http://www.manning.com/ingersoll.
RELATED ARTICLESExplain
OpenSherlock Project
Resources
Harvesting Process Support
Taming Text Book (code)
HTML Processing
NLP - Natural Language Processing
Topic Modeling
Word Meaning Analysis
ACE - Automatic Content Extraction
Berkeley Data Analytics Stack (BDAS)
Domeo Annotation Toolkit
FreeEed Open-source eDiscovery engine
H2O Big Data Prediction Engine
LanguageTool Style and Grammar Checker
Lingpipe
Link Grammar Parser
nlp2rdf
OpenDMAP
OpenSextant
RelEx Dependency Relationship Extractor
ReVerb (Github)
SIREn Semantic Information Retrieval Engine
SketchEngine
Triplify
Graph of this discussion
Enter the title of your article


Enter a short (max 500 characters) summation of your article
Enter the main body of your article
Lock
+Comments (0)
+Citations (0)
+About
Enter comment

Select article text to quote
welcome text

First name   Last name 

Email

Skip