On Monday 18 April 2011 12:47:25 Klaus Tachtler wrote: > Hi Markus, > > I'm using nutch 1.2 for crawling. Later i will use luke to inspect the > lucene index. > > But i wonder, because i don't think i'm the only one who will make a > search for different languages.
I partially indexed wikipedia once and used the languageidentifier to add the lang field. I had no trouble, most likely because i omitNorms. > > Now I'm frustrated that so solve the problem, will be more difficult > as a expected! > > But, thank you very much for your help und for the answer a few months > ago, when I ask the first time for crawling different languages and > you said to me that i must have different urls, thank you for that too! no problem > > > Klaus. > > >> what about the configuration in nutch-default.xml --> > >> > >> ... > >> <property> > >> > >> <name>query.lang.boost</name> > >> <value>0.0</value> > >> <description> Used as a boost for lang field in Lucene query. > >> </description> > >> > >> </property> > >> ... > >> > >> could this be helpful ?!? > > > > Only if you're sure it's the lang field with a 0 index-time boost. > > You can use > > Luke to inspect the documents for index-time boosts. Also, this query.* > > configuration should not be used when not using Nutch =< 1.2 search. > > > >> Klaus. > >> > >> > >> -- > >> > >> ------------------------------------------------ > >> e-Mail : [email protected] > >> Homepage: http://www.tachtler.net > >> DokuWiki: http://www.dokuwiki.tachtler.net > >> ------------------------------------------------ > > > > -- > > Markus Jelsma - CTO - Openindex > > http://www.linkedin.com/in/markus17 > > 050-8536620 / 06-50258350 > > ----- Ende der Nachricht von [email protected] ----- > > > > Grüße > Klaus. > > -- > > ------------------------------------------------ > e-Mail : [email protected] > Homepage: http://www.tachtler.net > DokuWiki: http://www.dokuwiki.tachtler.net > ------------------------------------------------ -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

