Hi,
I'm trying to figure out if it's intentional or a bug that truncated search in
fields with field type text_da only works for 4-6 characters - longer queries
gives 0 results.
I've tried starting up both solr 9.4 and 9.6 using the tutorial for launching
Solr in SolrCloud mode, but instead of choosing the techproducts configset I've
used the _default configset, because it has in the managed-schema.xml a
dynamic field for Danish.
<!-- Danish -->
<dynamicField name="*_txt_da" type="text_da" indexed="true"
stored="true"/>
I've then posted a modified record (like the ones in the books.json example) to
my collection with this data:
{
"id" : "978-8776075224",
"cat" : ["book","paperback"],
"name" : "Lægen",
"author" : "Kirsten Ahlburg",
"sequence_i" : 1,
"genre_s" : "fiction",
"inStock" : true,
"price" : 30.50,
"pages_i" : 36,
"abstract_txt_da": "Da Lisa skal opereres, møder hun lægen Jacob. Hun
forelsker sig i ham, men kan man blive kæreste med sin læge?"
}
If I in the solr GUI send of this query (expected to hit "læge"/"lægen")
abstract_txt_da:læg*
Numfound: 1
If I add a letter, I get no results (expected to hit "lægen")
abstract_txt_da:læge*
Numfound: 0
The same can be seen with this query (expected to hit "opereres")
abstract_txt_da:oper*
Numfound: 1
abstract_txt_da:opere* (expected to hit "opereres")
Numfound: 0
If I try truncating the word 'kæreste' I however get to add a few more
characters before it gives no results
abstract_txt_da:kærest* (expected to hit "kæreste")
Numfound: 1
abstract_txt_da:kæreste* (expected to hit "kæreste")
Numfound: 0
I doesn't seem to be a problem to truncate a word that doesn't really need
truncating in the genre_s field
genre_s:fiction*
Numfound: 1
Is this working as intended, and if yes, then why?
Or is this a bug?
Link to the tutorial I used.
https://solr.apache.org/guide/solr/latest/getting-started/tutorial-techproducts.html
Best regards,
Kirstine Christensen,
Developer at Danish company