Re: Preserving original HTML file offsets for highlighting

2011-01-25 Thread Karolina Bernat
Hi Uwe, thanks for this hint. I'm not sure, how much of the Solr functionality do I need to implement for using the HTTPStripCharFilter. I'm using Apache Tika for HTML parsing. Furthermore I use the StandardAnalyzer to initialize my IndexWriter. I don't use a Tokenizer - this would be the Solr app

RE: Preserving original HTML file offsets for highlighting

2011-01-25 Thread Uwe Schindler
Hi Karolina, for this no Solr is needed at all. The CharFilter is simply placed outside Lucene, but you can use without anything else from Solr. You can copy the java file from Solr's source, choose another package name and you are finished. About Tokenizer and Analyzer: StandardAnalyzer does the

Could not find implementing class

2011-01-25 Thread Alex vB
Hello everybody, I used a small indexing example from "Lucene in Action" and can run and compile the program under eclipse. If I want to compile and run it by console I get this error: java.lang.IllegalArgumentException: Could not find implementing class for org.apache.lucene.analysis.tokenattr

Re: Lucene , hits per document

2011-01-25 Thread Grant Ingersoll
With a little logic on your size to count, you can use SpanQueries to do that. -Grant On Jan 21, 2011, at 4:03 PM, Sharma Kollaparthi wrote: > Hi , > > I have started to use Lucene for searching in HTML files. Is it > possible to get Hits per document, when we search for phrases like "Hel

Re: Could not find implementing class

2011-01-25 Thread Alexander Aristov
Set correct classpath. you may want to compare which libraries eclipse includes in classpath. Best Regards Alexander Aristov On 25 January 2011 18:12, Alex vB wrote: > > Hello everybody, > > I used a small indexing example from "Lucene in Action" and can run and > compile the program under ec

AssertionError

2011-01-25 Thread Anuj Shah
Hi, I've upgraded from 3.00 to 3.0.3 and am now hitting assertion errors from IndexWriter.ReaderPool.commit, at this line: // We invoke deleter.checkpoint below, so we must be > // sync'd on IW: > assert Thread.holdsLock(IndexWriter.this); > Has anyone encountered this before?

Lucene on Multi-Processor/Core machines

2011-01-25 Thread Siraj Haider
Hello there, I was looking for best practices for indexing/searching on a multi-processor/core machine but could not find any specific material on that. Do you think it is a good idea to create a guide/how-to for that purpose? It would be very helpful for many people in todays world, where a

Re: Could not find implementing class

2011-01-25 Thread Alex vB
Hello Alexander, isn't it enough to add the classpath through -cp? If I don't use -cp I can't compile my project. I thought after compiling without errors all sources are correctly added. In Eclipse I added Lucene sources the same way(which works) and I also tried using the jar file. Therefore I

RE: Could not find implementing class

2011-01-25 Thread Uwe Schindler
You have to also pass the classpath to java runtime, not only to javac. If you get this problem within Eclipse, have you tried to cleanup the project, maybe not all files are compiled? Are you also compiling Lucene from source? If yes, make sure that all Lucene classes are compiled (see cleanup a

RE: AssertionError

2011-01-25 Thread Uwe Schindler
Can you post your code? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Anuj Shah [mailto:anujshahw...@gmail.com] > Sent: Tuesday, January 25, 2011 5:18 PM > To: java-user@lucene.apache.org > Subject: As

Re: Lucene on Multi-Processor/Core machines

2011-01-25 Thread Glen Newton
This is some older stuff I have done, likely still fairly relevant. I would say that today things are _better_ than these results for Lucene multithreading / multicore. :-) http://zzzoot.blogspot.com/2008/06/lucene-concurrent-search-performance.html http://zzzoot.blogspot.com/2008/06/simultaneous-t

RE: Could not find implementing class

2011-01-25 Thread Alex vB
Hello Uwe, I recompiled some classes manually in Lucene sources. No it's running fine! Something went wrong there. Thank you very much! Best regards Alex -- View this message in context: http://lucene.472066.n3.nabble.com/Could-not-find-implementing-class-tp2330598p2332141.html Sent from the

Highlight Wildcard Queries

2011-01-25 Thread Wulf Berschin
Hi, I'm just migrating our small search customization from Lucene version 2.3 to the current version (3.0.3) and wonder why, in contrast to the old version, we no longer get the Wildcard Queries (which are default, since surround the search string with asterisks) highlighted. We're using the

Re: AssertionError

2011-01-25 Thread Michael McCandless
... and also the full exception? Mike On Tue, Jan 25, 2011 at 11:52 AM, Uwe Schindler wrote: > Can you post your code? > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message- >> From: Anuj Shah [mailto:

Re: Highlight Wildcard Queries

2011-01-25 Thread Alexander Kanarsky
Hi Wulf, Check the http://www.lucidimagination.com/blog/2009/06/08/bringing-the-highlighter-back-to-wildcard-queries-in-solr-14/ this may help. I do not know what of Mark's changes are in Lucene 3.x, but most likely you will just need to set a proper RewriteMethod for the MultiTermQuery somewhere

RE: Highlight Wildcard Queries

2011-01-25 Thread Uwe Schindler
You can set setExpandMultiTermQuery(true) on QueryScorer (or WeightedSpanTermExtractor), which is needed by the Highlighter CTOR - that would do exactly what you want with the standard highlighter. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaph

RE: Highlight Wildcard Queries

2011-01-25 Thread Uwe Schindler
And: you don't need to rewrite queries before highlighting, highlighter does this automatically internally if needed. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Uwe Schindler [mailto:u...@thetaphi.d

CFP - Berlin Buzzwords 2011 - Search, Score, Scale

2011-01-25 Thread Isabel Drost
This is to announce the Berlin Buzzwords 2011. The second edition of the successful conference on scalable and open search, data processing and data storage in Germany, taking place in Berlin. Call for Presentations Berlin Buzzwords http://berlinbuz