You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We fixed the elision/contraction management for French, Italian and Catalan when indexing/searching content.
Historically the text analyzers for those languages were supposed to ignore the elision/contraction of some articles or pronouns but didn't due to an improper order of text transformation components.
This could be problematic if you had a Minimum Should Match lower than 100%
For instance :
in French: l'avion (mandatory form of "le avion" - "the plane") was supposed to be indexed as avion (plane) but was still indexed as l avion, so searching for l'avion bleu vole could match l'écureuil bleu vole with a 75% minimum should match (matches on l, bleu and vole)
in Italian: comedia dell'arte was supposed to be indexed as comedia arte but was indexed as comedia dell arte
in Catalan: l'avi (the grandfather) was supposed to be indexed as avi (grandfather) but was indexed as l avi
This release fixes that issues and now the following elisions/contractions will be removed as intended :
Should this changes affect you negatively, please be aware that you can override those settings (as all of those in the elasticsuite_analysis.xml configuration file) in a custom module.
Feel also free to let us know, especially about the configuration Italian and Catalan since we are not native speakers and we are just following official Elasticsearch recommendations.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Main changes
We fixed the elision/contraction management for French, Italian and Catalan when indexing/searching content.
Historically the text analyzers for those languages were supposed to ignore the elision/contraction of some articles or pronouns but didn't due to an improper order of text transformation components.
This could be problematic if you had a Minimum Should Match lower than 100%
For instance :
l'avion
(mandatory form of "le avion" - "the plane") was supposed to be indexed asavion
(plane) but was still indexed asl avion
, so searching forl'avion bleu vole
could matchl'écureuil bleu vole
with a 75% minimum should match (matches onl
,bleu
andvole
)comedia dell'arte
was supposed to be indexed ascomedia arte
but was indexed ascomedia dell arte
l'avi
(the grandfather) was supposed to be indexed asavi
(grandfather) but was indexed asl avi
This release fixes that issues and now the following elisions/contractions will be removed as intended :
l, m, t, qu, n, s, j, d, c
as defined in that language specific elision filter configurationc, l, all, dall, dell, nell, sull, coll, pell, gl, agl, dagl, degl, negl, sugl, un, m, t, s, v, d
as defined in that language specific elision filter configurationd, l, m, n, s, t
as defined in that language specific elision filter configurationShould this changes affect you negatively, please be aware that you can override those settings (as all of those in the elasticsuite_analysis.xml configuration file) in a custom module.
Feel also free to let us know, especially about the configuration Italian and Catalan since we are not native speakers and we are just following official Elasticsearch recommendations.
📦 Features
🐛 Fixes
Full Changelog: 2.11.5...2.11.5.1
This discussion was created from the release 2.11.5.1.
Beta Was this translation helpful? Give feedback.
All reactions