On Monday 18 April 2011 12:47:25 Klaus Tachtler wrote:
> Hi Markus,
> 
> I'm using nutch 1.2 for crawling. Later i will use luke to inspect the
> lucene index.
> 
> But i wonder, because i don't think i'm the only one who will make a
> search for different languages.

I partially indexed wikipedia once and used the languageidentifier to add the 
lang field. I had no trouble, most likely because i omitNorms.

> 
> Now I'm frustrated that so solve the problem, will be more difficult
> as a expected!
> 
> But, thank you very much for your help und for the answer a few months
> ago, when I ask the first time for crawling different languages and
> you said to me that i must have different urls, thank you for that too!

no problem

> 
> 
> Klaus.
> 
> >> what about the configuration in nutch-default.xml -->
> >> 
> >> ...
> >> <property>
> >> 
> >>    <name>query.lang.boost</name>
> >>    <value>0.0</value>
> >>    <description> Used as a boost for lang field in Lucene query.
> >>    </description>
> >> 
> >> </property>
> >> ...
> >> 
> >> could this be helpful ?!?
> > 
> > Only if you're sure it's the lang field with a 0 index-time boost.
> > You can use
> > Luke to inspect the documents for index-time boosts. Also, this query.*
> > configuration should not be used when not using Nutch =< 1.2 search.
> > 
> >> Klaus.
> >> 
> >> 
> >> --
> >> 
> >> ------------------------------------------------
> >> e-Mail  : [email protected]
> >> Homepage: http://www.tachtler.net
> >> DokuWiki: http://www.dokuwiki.tachtler.net
> >> ------------------------------------------------
> > 
> > --
> > Markus Jelsma - CTO - Openindex
> > http://www.linkedin.com/in/markus17
> > 050-8536620 / 06-50258350
> 
> ----- Ende der Nachricht von [email protected] -----
> 
> 
> 
> Grüße
> Klaus.
> 
> --
> 
> ------------------------------------------------
> e-Mail  : [email protected]
> Homepage: http://www.tachtler.net
> DokuWiki: http://www.dokuwiki.tachtler.net
> ------------------------------------------------

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to