Re: Different analyzers for dfferent documents in different languages?

2010-09-22 Thread Andrzej Bialecki
On 2010-09-22 15:30, Bernd Fehling wrote: Actually, this is one of the biggest disadvantage of Solr for multilingual content. Solr is field based which means you have to know the language _before_ you feed the content to a specific field and process the content for that field. This results in ha

Re: Different analyzers for dfferent documents in different languages?

2010-09-22 Thread Bernd Fehling
Actually, this is one of the biggest disadvantage of Solr for multilingual content. Solr is field based which means you have to know the language _before_ you feed the content to a specific field and process the content for that field. This results in having separate fields for each language. E.g.

Re: Different analyzers for dfferent documents in different languages?

2010-09-22 Thread Jan Høydahl / Cominvent
See this thread: http://search-lucene.com/m/FgbDS1JL3J1 Basically, what we normally do is to rename the fields with a language suffix, so if you have language=en and text="A red fox", then you would index it as text_en="A red fox". You would either have to do this outside Solr or write an Updat

Different analyzers for dfferent documents in different languages?

2010-09-22 Thread Andy
I have documents that are in different languages. There's a field in the documents specifying what language it's in. Is it possible to index the documents such that based on what language a document is in, a different analyzer will be used on that document? What is the "normal" way to handle do