Elastic Stack

Elasticsearch Scoring Changes In Action

January 15, 2020

For the ones started their journey with Elasticsearch before version 5.x sometimes upgrading to the newer versions like 6.x or 7.x bring many challenges. From data type changes to the index structure changes and deprecations, from Transport to REST client and so one. One of those changes that would influence your system, is the scoring algorithm evolution. In this blog, we demonstrate the impact on relevance and scoring if the Elasticsearch changes its algorithms under the hood between major versions. We will show it practically, using a small dataset. If the scoring feature of Elasticsearch…

Read more

你们好 - Elasticsearch and the Chinese language

December 19, 2019

Today we are looking into the Elasticsearch language support of Chinese. Chinese is spoken by the ethnic Chinese majority and many minority ethnic groups in China. About 1.2 billion people (around 16% of the world's population) speak some form of Chinese as their first language. We are an international company, so having customers in Singapore or Hong Kong makes it super interesting. Chinese consists of many dialects and mostly two written forms. In our first section, I will clarify which region uses what dialect and written form. After that, we are looking at what is supported by…

Read more

Mirror, Mirror on the Wall

November 11, 2019

Reading the title of this blog post, you will likely associate the title with the fairy tale Snow White and the Seven Dwarfs. An association is a connection in mind for two related terms. It is a creative process that the human brain is so good at it. Another creative process is using synonyms.

Read more

Okapi BM25 with Game of Thrones

September 1, 2019

Since Elasticsearch 5, the default similarity algorithm for Elasticsearch is Okapi BM25. A similarity (scoring/ranking model) defines how matching documents are scored. Performing a search against a set of documents gives you results sorted by relevance. In one of our previous blog posts by Rocco Schulz, BM25 was already mentioned. In this blog article, we are going to look into the inner workings of the Okapi BM25 algorithm.

Read more
Older posts