同步操作将从 Hutool/elasticsearch-definitive-guide-cn 强制同步,此操作会覆盖自 Fork 仓库以来所做的任何修改,且无法恢复!!!
确定后同步将在后台操作,完成时将刷新页面,请耐心等待。
Elasticsearch ships with a collection of language analyzers which provide good, basic, out-of-the-box support for a number of the world’s most common languages:
Arabic, Armenian, Basque, Brazilian, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, Finnish, French, Galician, German, Greek, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Kurdish, Norwegian, Persian, Portuguese, Romanian, Russian, Spanish, Swedish, Turkish, and Thai.
These analyzers typically perform four roles:
Tokenize text into individual words:
The quick brown foxes
→ [The
, quick
, brown
, foxes
]
Lowercase tokens:
The
→ the
Remove common stopwords:
[`The`, quick
, brown
, foxes
] → [quick
, brown
, foxes
]
Stem tokens to their root form:
foxes
→ fox
Each analyzer may also apply other transformations specific to its language in order to make words from that language more searchable:
the english
analyzer removes the possessive 's
:
John’s
→ john
the french
analyzer removes elisions like l'
and qu'
and
diactrics like ¨
or ^
:
l’église
→ eglis
the german
analyzer normalizes terms, replacing ä
and ae
with a
, or
ß
with ss
, among others:
äußerst
→ ausserst
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。