Every text search solution is as powerful as the text analysis capabilities it offers. Lucene is such open source information retrieval library offering many text analysis possibilities. In this post, we will cover some of the main text analysis features offered by ElasticSearch available to enrich your search content.
Content Enrichment
Taking an example of a typical eCommerce site, serving the right content in search to the end customer is very important for the business. The text analysis strategy provided by any search solution plays very big role in it. As a search user, I would prefer some of typical search behavior for my query to automatically return,
- should look for synonyms matching my query text
- should match singluar and plural words or words sounding similar to enter query text
- should not allow searching on protected words
- should allow search for words mixed with numberic or special characters
- should not allow search on html tags
- should allow search text based on proximity of the letters and number of matching letters