Re: Tool for analyzing analyzers

2004-06-02 Thread Leo Galambos


Zilverline <[EMAIL PROTECTED]> wrote:
__

>get more out of  lucene, such as incremental indexing, to name one. On 

Hello,

as far as I know, the incremental indexing
could be a real bottleneck if you implemented
your system without some knowledge
about Lucene internals.

The respective test is here:
http://www.egothor.org/twiki/bin/view/Know/LuceneIssue

Cheers,
Leo



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Tool for analyzing analyzers

2004-06-02 Thread Zilverline
Hi Erik,
Thanks for your reply. Have you tried it on a collection yet? I'd love 
the get some of your feedback. I have limited knowledge of the 
underlying capabilities of the lucene library, which is a complement to 
you, since it was extremely easy to integrate lucene. But I'd like to 
get more out of  lucene, such as incremental indexing, to name one. On 
the otherhand I'm interested in general requirements and wishes for the app.

regards,
 Michael Franken
Erik Hatcher wrote:
On May 28, 2004, at 6:50 AM, Zilverline info wrote:
But I'd love to build a Lucene "demo" application that is powerful 
enough to be used as a foundation for folks to use out-of-the-box.

That's just what I thought. Here's one: http://www.zilverline.org

Michael - zilverline is nicely done!  I downloaded it and dropped it 
into Tomcat and it came right up.  I did not actually configure a 
collection yet, but from the docs on the website it looks like you 
have built something quite nice.  Maybe you could embed a built-in 
collection of the zilverline docs so something comes up right away and 
is searchable :)

Nice work.  I'll definitely stay tuned into your project.
Erik
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Tool for analyzing analyzers

2004-05-30 Thread Erik Hatcher
On May 28, 2004, at 6:50 AM, Zilverline info wrote:
But I'd love to build a Lucene "demo" application that is powerful 
enough to be used as a foundation for folks to use out-of-the-box.
That's just what I thought. Here's one: http://www.zilverline.org
Michael - zilverline is nicely done!  I downloaded it and dropped it 
into Tomcat and it came right up.  I did not actually configure a 
collection yet, but from the docs on the website it looks like you have 
built something quite nice.  Maybe you could embed a built-in 
collection of the zilverline docs so something comes up right away and 
is searchable :)

Nice work.  I'll definitely stay tuned into your project.
Erik
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Tool for analyzing analyzers

2004-05-28 Thread Zilverline info
Hi Erik,
Erik Hatcher wrote:
[snip]
But I'd love to build a Lucene "demo" application that is powerful 
enough to be used as a foundation for folks to use out-of-the-box.
That's just what I thought. Here's one: http://www.zilverline.org
Erik
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Cheers,
   Michael Franken
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Tool for analyzing analyzers

2004-05-28 Thread Erik Hatcher
On May 28, 2004, at 2:46 AM, [EMAIL PROTECTED] wrote:
Hi Erik,
I've had this running OK from the command line and in Eclipse on XP.
I suspect it might be because you're running a different OS? The 
"Classfinder" tries to split the system property
"java.class.path"  on the ";" character but I forgot different OSes 
have different seperators.
There is another OS other than Mac OS X?  :)
There is a File constant that gives you the OS-specific separator.  
File.pathSeparatorChar.

Using a plug-in architecture (once we get classloading sorted!) you 
could define interfaces for
things such as fetchers (db/file/web) and parsers (PDF/Word..) and 
configure them to create indexes
using a GUI like this, or a web-based interface. People could then 
contribute plug-in implementations
as Jars that you could just drop in to the workbench.
Sounds like we'd be re-inventing Nutch :)
But I'd love to build a Lucene "demo" application that is powerful 
enough to be used as a foundation for folks to use out-of-the-box.

Erik
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Tool for analyzing analyzers

2004-05-28 Thread Morus Walter
Hi Mark,

> I've had this running OK from the command line and in Eclipse on XP.
> I suspect it might be because you're running a different OS? The "Classfinder" tries 
> to split the system property
> "java.class.path"  on the ";" character but I forgot different OSes have different 
> seperators.
> 
> Let me know your setup details and I'll try fix the classloader issue.
> 
I have the same problems and am running on linux using ':' to separate
the class path...

BTW: I tried to compile your sources but you left out the part in thinlet.
  2928 Sun Oct 12 19:47:56 CEST 2003 thinlet/AppletLauncher.class
  2643 Sun Oct 12 19:47:56 CEST 2003 thinlet/FrameLauncher.class
 74823 Sun Oct 12 19:47:56 CEST 2003 thinlet/Thinlet.class
Was that intentional?

Morus

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Tool for analyzing analyzers

2004-05-27 Thread markharw00d
Hi Erik,
I've had this running OK from the command line and in Eclipse on XP.
I suspect it might be because you're running a different OS? The "Classfinder" tries 
to split the system property
"java.class.path"  on the ";" character but I forgot different OSes have different 
seperators.

As for Luke etc - I had a vague notion that this could be extended into a more 
generalised workbench
for Lucene that could also help with indexing.
Using a plug-in architecture (once we get classloading sorted!) you could define 
interfaces for 
things such as fetchers (db/file/web) and parsers (PDF/Word..) and configure them to 
create indexes
using a GUI like this, or a web-based interface. People could then contribute plug-in 
implementations 
as Jars that you could just drop in to the workbench. 

Let me know your setup details and I'll try fix the classloader issue.

Cheers
Mark

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Tool for analyzing analyzers

2004-05-27 Thread Erik Hatcher
Mark,
Nice idea!  (I've had this type of thing on my to-do list for the 
Lucene demo refactoring that I *promise* I'll eventually get around 
to).

I tried to get it to work, though, and was unsuccessful.  It did not 
show me any Analyzers in the drop down (I have the latest CVS version 
of Lucene in my classpath).  Maybe this could be added into Luke as a 
new tab?  You can sort of fake this with Luke now, by entering your 
text as a query and seeing what it parses to, and select an Analyzer.

Erik
On May 27, 2004, at 6:45 PM, [EMAIL PROTECTED] wrote:
I've knocked together this tool which automatically discovers 
Analyzers on the classpath and provides a GUI to allow you to try out 
different Analyzers and see their effects:

http://www.inperspective.com/lucene/Viewer.zip
This needs JDK1.4 and you'll need to define  the classpath to include 
Lucene and any of your custom analyzers.

Paste in some example text, take your pick of analyzer and hit the 
"Analyze" button to see the results.

Cheers
Mark

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]