Hi,
I'm going to be demonstrating some of our work with lucene to
a prospective customer this week, and I'm wondering if
any of you have suggestions for other relevant sites that
use lucene.
In particular, I'm interested in scientific or technical sites,
perhaps with use of the highlighter, and per
Hi,
We are very interested in highlighting, but haven't gotten around
to reviewing the state of the highlighting mechanisms.
Could someone possibly give me the "big picture" on highlighting?
What code is available?
How does it work?
What are the current issues?
Many thanks,
Fred
At 07:16 PM 10/28/2
direction that would really be appreciated. I am
thinking about editing the shipped HTMLDocument.java and HTMLParser.java
files but not sure what I need to add. Can you please explain a little more.
TIA.
-H
Fred Toth wrote:
Hi,
Could be your best bet is to use HTML tags. Create tags
for name, description,
Hi Chris,
Getting unicode to pass cleanly through to the browser can
be tricky. A couple of questions:
What do you see instead of the incorrect character?
What happens when you tell your browser to display UTF-8?
(In IE, it's in View->Encoding->Unicode) Does your character
display properly?
Are you
ce the fis with this:
fis = new InputStreamReader(new FileInputStream(f), "UTF-16");
-----Original Message-
From: Fred Toth [mailto:[EMAIL PROTECTED]
Sent: Friday, September 24, 2004 9:25 PM
To: Lucene Users List
Subject: Re: demo IndexHTML parser breaks unicode?
Sorry, that didn't c
Sorry, that didn't cure it.
Again, anyone want to point me to the quickest replacement
HTML parser (that's unicode clean)?
Thanks,
Fred
At 03:17 PM 9/24/2004, you wrote:
On Friday 24 September 2004 19:58, Fred Toth wrote:
> I've got unicode in my source HTML. In particular, withi
Hi,
I was hoping it wouldn't come to this:
I've got unicode in my source HTML. In particular, within meta tags,
and it's getting broken by the indexer. Note that I'm not trying to
query on any of this, just store and retrieve document titles with
unicode characters.
Has anyone else experienced this
Hi all,
I'm trying to understand what's going on with the query parser
and keyword fields.
I've got a large subset of my documents which are "publications".
So as to be able to query these, I've got this in the indexer:
doc.add(Field.Keyword("is_pub", "1"));
However, if I run a query:
is_pu
Hi,
I've been working with the HTML parser demo that comes with
Lucene and I'm trying to understand why it's multi-threaded,
and, more importantly, how to exit gracefully on errors.
I've discovered if I throw an exception in the front-end static
code (main(), etc.), the JVM hangs instead of exiting
t was
serving your servlet with constructed index searcher while your command
line application wanted to recreate the index. Are you protected against
this situation?
Jiri.
-Original Message-
From: Fred Toth [mailto:[EMAIL PROTECTED]
Sent: Monday, September 20, 2004 3:40 PM
To: Lucene Users
ing environment, in this case you
have to make indexing, delition and search synchronized
So ...
Have fun,
Sergiu
PS: I think that I'll submit some code with synchronized
index/delete/search operations and to tell why I need to use it.
Fred Toth wrote:
Hi Sergiu,
My searches take place in to
r process has it open. I am not certain
where exactly this error comes from in Lucene (exception stack trace?),
but I thought the Lucene code included work-arounds for this.
Otis
--- Fred Toth <[EMAIL PROTECTED]> wrote:
> Hi Sergiu,
>
> My searches take place in tomcat, in a struts
thanks,
Fred
At 02:32 AM 9/20/2004, you wrote:
Hi Fred,
I think that we can help you if you provide us your code, and the context
in which it is used.
we need to see how you open and close the searcher and the reader, and
what operations are you doing on index.
All the best,
Sergiu
Fred To
Hi,
I have built a nice lucene application on linux with no problems,
but when I ported to windows for the customer, I started experiencing
problems with the index not closing. This prevents re-indexing.
I'm using lucene 1.4.1 under tomcat 5.0.28.
My search operation is very simple and works great:
14 matches
Mail list logo