Hi Hoss.
I've tried that yesterday using the same approach you just said (I've
created the base fields for any language with basic analyzers) and it worked
alright.
Thanks again for you time.
Regards,
Daniel
On 20/6/07 21:00, Chris Hostetter [EMAIL PROTECTED] wrote:
: So far it sounds
: So far it sounds good for my needs, now I'm going to try if my other
: features still work (I'm worried about highlighting as I'm going to return a
: different field)...
i'm not really a highlighting guy so i'm not sure ... but if you're okay
with *simple* highlighting you can probably just
: range wouldn't be a problem in this case. The real issue I can see in this
: approach, is related to Analyzers... How to make them deal with different
: languages properly using one Solr instance with the same set of fields being
: used by documents in different languages
i would still use
: One bad thing in having fields specific for your language (in my point of
: view) is that you will have to re-index your content when you add a new
: language (some will need to start with one language and in future will have
: others added). But OK, let's say the indexing is done.
i don't see
Hi Hoss
One bad thing in having fields specific for your language (in my point of
view) is that you will have to re-index your content when you add a new
language (some will need to start with one language and in future will have
others added). But OK, let's say the indexing is done.
So using
Daniel,
I was reading your email and responses to it with great
interest.
I was aware that Solr has an implicit assumption that
a field is mono-lingual per system. But your mail and
its correspondence made me wonder if this limitation
is practical for multi-lingual search applications. For
On 6/12/07, Teruhiko Kurosaka [EMAIL PROTECTED] wrote:
For bi-lingual
or tri-lingual search, we can have parallel fields (title_en,
title_fr, title_de, for example) but this wouldn't scale well.
Due to search across multiple fields, or due to increased index size?
Lucene and Solr
requires
Hi Yonik,
On 6/12/07, Teruhiko Kurosaka [EMAIL PROTECTED] wrote:
For bi-lingual
or tri-lingual search, we can have parallel fields (title_en,
title_fr, title_de, for example) but this wouldn't scale well.
Due to search across multiple fields, or due to increased index size?
Due to the
: Due to the prolification of number of fields. Say, we want
: to have the field title to have the title of the book in
: its original language. But because Solr has this implicit
: assumption of one language per field, we would have to have
: the artifitial fields title_fr, title_de, title_en,
This sounds OK.
I can create a field name mapping structure to change the requests /
responses in a way my client doesn't need to be aware of different fields.
Thanks for this directions,
Daniel
On 8/6/07 21:32, Chris Hostetter [EMAIL PROTECTED] wrote:
: Can't I have the same index, using
Hi Henri,
Thanks again, your considerations will sure help on my decision.
Now I'll do my homework to check document volume / growth - expected index
sizes and query load.
Regards,
Daniel Alheiros
On 9/6/07 10:53, Henrib [EMAIL PROTECTED] wrote:
Hi Daniel,
Trying to recap: you are
on it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.
--
View this message in context:
http://www.nabble.com/Multi-language-indexing-and-searching-tf3885324
way nor act in
reliance on it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.
--
View this message in context:
http://www.nabble.com/Multi-language
Hi Henri.
Thanks for your reply.
I've just looked at the patch you referred, but doing this I will lose the
out of the box Solr installation... I'll have to create my own Solr
application responsible for creating the multiple cores and I'll have to
change my indexing process to something able to
: Can't I have the same index, using one single core, same field names being
: processed by language specific components based on a field/parameter?
yes, but you don't really need the complexity you describe below ... you
don't need seperate request handlers per language, just seperate fields
Hi,
I'm just starting to use Solr and so far, it has been a very interesting
learning process. I wasn't a Lucene user, so I'm learning a lot about both.
My problem is:
I have to index and search content in several languages.
My scenario is a bit different from other that I've already read in
16 matches
Mail list logo