On Tuesday 13 April 2004 15:31, Holger Klawitter wrote:
> Hi Erik,
>
> > What is wrong with simply creating a new token that replaces an
> > incoming one for synonyms?
> > I'm just playing devil's advocate here since you can already get
> > the termText() through the public _method_.
>
> Well,
The underlying assumption was that File.isDirectory() does return false
on symlinks, but we never tested under UNIX or Linux and JavaDoc is not
very explicit about this (as so often). If that is wrong, can someone
mail me some hint how to do it properly? I assume it involves
getCanonicalPath()
I've uploaded it to the wiki:
http://wiki.apache.org/jakarta-lucene/HttpUnitExample
It's not anywhere close to production quality, especially since it's based
on a unit test framework.
sv
On Tue, 13 Apr 2004, Stephane James Vaucher wrote:
> I'm wondering if there is interest for a simple sp
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hi Erik,
> What is wrong with simply creating a new token that replaces an
> incoming one for synonyms?
> I'm just playing devil's advocate here since you can already get
> the termText() through the public _method_.
Well, you're right; I forgot
Looks cool, but I've got a question:
How do you handle symlinks on *nix? I think it's stuck in a loop
When indexing my home dir, I see it indexing:
/home/vauchers/.Cirano-gnome/.gnome-desktop/Home directory/.Cirano-gnome/...
cheers,
sv
On Wed, 14 Apr 2004, Peter Becker wrote:
> Hello,
>
> we
Hi,
Who knows the diference between i411 Faceted Metadata Search and Lucene
Search Engine.
Thanks,
William.
_
Tax headache? MSN Money provides relief with tax tips, tools, IRS forms and
more! http://moneycentral.msn.com/tax/works
I'm wondering if there is interest for a simple spider demo.
I've got an example of how to use HttpUnit to spider on a web site and
have it index it on disk (only html page now). I can send it to the list
if anyone is interested (it's one class, < 200 loc).
cheers,
sv
---
Hello,
we released Docco 0.3 along with two updates for its plugins.
Docco is a personal document retrieval tool based on Apache's Lucene
indexing engine and Formal Concept Analysis. It allows you to create an
index for files on your file system which you can then search for
keywords. It can i
What is wrong with simply creating a new token that replaces an
incoming one for synonyms?
I'm just playing devil's advocate here since you can already get
the termText() through the public _method_.
Erik
On Apr 13, 2004, at 9:52 AM, Holger Klawitter wrote:
-BEGIN PGP SIGNED MESSAGE--
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hi there,
Just a short suggestion:
It would be useful to make Token.termText public (or to provide a reader/
writer pair).
That way one can create TokenFilters altering termText (for Synonyms for
example) in other packages as org.apache.lucene.anal
I'm actually pretty lazy about index updates, and haven't had the need for
efficiency, since my requirement is that new documents should be
available on a next working day basis.
I reindex everything from scatch every night (400,000 docs) and store it
in an timestamped index. When the reindexin
If you close an IndexWriter more than once, the release of the writeLock
creates a NullPointerException.
You should clean your code and close your writer only once. Anyway, I
don't know why there's no test on the 'writeLock' as in the 'finalize'
method.
I think it's a little error, so I suggest
On Apr 13, 2004, at 02:45, Kevin A. Burton wrote:
He mentioned that I might be able to squeeze 5-10% out of index merges
this way.
Talking of which... what strategy(ies) do people use to minimize
downtime when updating an index?
My current "strategy" is as follow:
(1) use a temporary RAMDirect
13 matches
Mail list logo