Good example scientific lucene sites?

2005-02-08 Thread Fred Toth
Hi, I'm going to be demonstrating some of our work with lucene to a prospective customer this week, and I'm wondering if any of you have suggestions for other relevant sites that use lucene. In particular, I'm interested in scientific or technical sites, perhaps with use of the highlighter, and per

Re: Faster highlighting with TermPositionVectors

2004-10-28 Thread Fred Toth
Hi, We are very interested in highlighting, but haven't gotten around to reviewing the state of the highlighting mechanisms. Could someone possibly give me the "big picture" on highlighting? What code is available? How does it work? What are the current issues? Many thanks, Fred At 07:16 PM 10/28/2

Re: Index + Searching

2004-10-15 Thread Fred Toth
direction that would really be appreciated. I am thinking about editing the shipped HTMLDocument.java and HTMLParser.java files but not sure what I need to add. Can you please explain a little more. TIA. -H Fred Toth wrote: Hi, Could be your best bet is to use HTML tags. Create tags for name, description,

Re: single quote unicode character

2004-10-11 Thread Fred Toth
Hi Chris, Getting unicode to pass cleanly through to the browser can be tricky. A couple of questions: What do you see instead of the incorrect character? What happens when you tell your browser to display UTF-8? (In IE, it's in View->Encoding->Unicode) Does your character display properly? Are you

RE: demo IndexHTML parser breaks unicode?

2004-09-24 Thread Fred Toth
ce the fis with this: fis = new InputStreamReader(new FileInputStream(f), "UTF-16"); -----Original Message- From: Fred Toth [mailto:[EMAIL PROTECTED] Sent: Friday, September 24, 2004 9:25 PM To: Lucene Users List Subject: Re: demo IndexHTML parser breaks unicode? Sorry, that didn't c

Re: demo IndexHTML parser breaks unicode?

2004-09-24 Thread Fred Toth
Sorry, that didn't cure it. Again, anyone want to point me to the quickest replacement HTML parser (that's unicode clean)? Thanks, Fred At 03:17 PM 9/24/2004, you wrote: On Friday 24 September 2004 19:58, Fred Toth wrote: > I've got unicode in my source HTML. In particular, withi

demo IndexHTML parser breaks unicode?

2004-09-24 Thread Fred Toth
Hi, I was hoping it wouldn't come to this: I've got unicode in my source HTML. In particular, within meta tags, and it's getting broken by the indexer. Note that I'm not trying to query on any of this, just store and retrieve document titles with unicode characters. Has anyone else experienced this

Keyword query confusion

2004-09-24 Thread Fred Toth
Hi all, I'm trying to understand what's going on with the query parser and keyword fields. I've got a large subset of my documents which are "publications". So as to be able to query these, I've got this in the indexer: doc.add(Field.Keyword("is_pub", "1")); However, if I run a query: is_pu

demo HTML parser question

2004-09-22 Thread Fred Toth
Hi, I've been working with the HTML parser demo that comes with Lucene and I'm trying to understand why it's multi-threaded, and, more importantly, how to exit gracefully on errors. I've discovered if I throw an exception in the front-end static code (main(), etc.), the JVM hangs instead of exiting

RE: indexes won't close on windows - solved

2004-09-20 Thread Fred Toth
t was serving your servlet with constructed index searcher while your command line application wanted to recreate the index. Are you protected against this situation? Jiri. -Original Message- From: Fred Toth [mailto:[EMAIL PROTECTED] Sent: Monday, September 20, 2004 3:40 PM To: Lucene Users

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
ing environment, in this case you have to make indexing, delition and search synchronized So ... Have fun, Sergiu PS: I think that I'll submit some code with synchronized index/delete/search operations and to tell why I need to use it. Fred Toth wrote: Hi Sergiu, My searches take place in to

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
r process has it open. I am not certain where exactly this error comes from in Lucene (exception stack trace?), but I thought the Lucene code included work-arounds for this. Otis --- Fred Toth <[EMAIL PROTECTED]> wrote: > Hi Sergiu, > > My searches take place in tomcat, in a struts

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
thanks, Fred At 02:32 AM 9/20/2004, you wrote: Hi Fred, I think that we can help you if you provide us your code, and the context in which it is used. we need to see how you open and close the searcher and the reader, and what operations are you doing on index. All the best, Sergiu Fred To

indexes won't close on windows

2004-09-19 Thread Fred Toth
Hi, I have built a nice lucene application on linux with no problems, but when I ported to windows for the customer, I started experiencing problems with the index not closing. This prevents re-indexing. I'm using lucene 1.4.1 under tomcat 5.0.28. My search operation is very simple and works great: