Zilverline Search Engine version 1.0-final released

2004-11-27 Thread Zilverline info
All, I've just released Zilverline version 1.0. New features include incremental indexing and scheduling of indexing proces, as well as a few minor updates. The source will be made available as well very soon. Zilverline is protected by a Collaborative Source License. You can read more on this type

Re: Search PDF ???

2004-10-25 Thread Zilverline info
Hi Eric, Try zilverline Michael Eric Chow wrote: Hello, 1. Is it possibleto use Lucene to search PDF contents ? 2. Can it search Chinese contents PDF files ??? Eric - To unsubscribe, e-mail: [EMAIL PROTE

Zilverline release candidate 1.0-rc7 available

2004-10-18 Thread Zilverline info
All, I've just released a new candidate (*1.0-rc7*) New features include Highlighting and 'on-the-fly' extraction of archives. Zilverline is a search engine based on lucene that's ready to roll, and can be simply dropped in a Servlet Engine. It runs out of the box, and supports PDF, WORD, HTM, TX

Zilverline release candidate 1.0-rc6 available

2004-10-02 Thread Zilverline info
All, I've just released a new candidate (*1.0-rc6*) New features include a command line indexer and support for Chinese and Cyrillic. Zilverline is an free search engine based on lucene that's ready to roll, and can be simply dropped in a Servlet Engine. It runs out of the box, and supports PDF,

lucene 1.4 in maven repository

2004-08-25 Thread Zilverline info
Hi, Can anyone tell me why there is no lucene 1.4 jar in the maven repository @ http://www.ibiblio.org/maven/lucene/jars/ ? Who makes them available? It would be very convenient to be able to get the latest version from there (or anywhere else) regards, Michael Franken

Re: searchhelp

2004-08-19 Thread Zilverline info
The PDF and WORD stuff has been done too: have a look at http://www.zilverline.org. Michael Franken Chandan Tamrakar wrote: For PDF you need to extract a text from pdf files using pdfbox library and for word documents u can use apache POI api's . There are messages posted on the lucene list rel

Re: Weighted queries

2004-08-06 Thread Zilverline info
Hi Eric, I have implemented this in Zilverline. What I do is the following: subclass QueryParser and override getFieldQuery: protected Query getFieldQuery(String field, Analyzer analyzer, String queryText) throws ParseException { // for field that contain 'contents' add boostfactors f

Zilverline release candidate 1.0-rc4 available

2004-07-26 Thread Zilverline info
All, I've just released a new candidate (*1.0-rc4*) New features include Spanish GUI, RTF support, searching on date range, customizable boosting factors, and configurable analyzers per collection. Zilverline now generates a MD5 Hash per file, and prevents duplicate files from being added more tha

Re: PDFBox problem.

2004-07-23 Thread Zilverline info
new PDFTextStripper(); pdftext = stripper.getText(pdDoc); writer.write(pdftext +" "); PDDocumentInformation info = pdDoc.getDocumentInformation(); pdftitle = info.getTitle(); } catch(Exception err) {

Re: Extracting Lucene onto Tomcat

2004-07-21 Thread Zilverline info
ted directories onto the tomcat server, if so what should my root directory be? Say for example the extracted directories org/apache/lucene/ Should i have that as public_html/WEB-INF/org/apache/lucene? Ian --- Zilverline info <[EMAIL PROTECTED]> wrote: Hi Ian, Depending on what you want to do,

Re: Extracting Lucene onto Tomcat

2004-07-21 Thread Zilverline info
Hi Ian, Depending on what you want to do, you could also follow the installation instructions on http://www.zilverline.org. It describes how to install zilverline, but the same goes for the lucene war. Hope this helps, Michael Franken Ian McDonnell wrote: Also another silly question, do i need

Re: Anyone use MultiSearcher class

2004-07-12 Thread Zilverline info
Hi Don, Yes, I'm using the MultiSearcher (in Zilverline), and have seen no serious performance issues with it. The app performs well with multiple indexes, it's responds so quick (with 100k+ documents) that I haven't even taken the time to measure the difference to a single index search. Michael

Re: upgrade from Lucene 1.3 final to 1.4rc3 problem

2004-07-07 Thread Zilverline info
This is a bug (see posting 'Lockfile Problem Solved'), upgrade to 1.4-final, and you'll be fine Alex Aw Seat Kiong wrote: Hi! I'm using Lucene 1.3 final currently, all things were working fine. But, after i'm upgraded from Lucene 1.3 final to 1.4rc3 (simply overwrite the lucene-1.4-final.jar to

Zilverline release candidate 1.0-rc3 available

2004-06-07 Thread Zilverline info
All, I've just released a new candidate (*1.0-rc3*) that now supports plugins. You can create your own extractors for various file formats. I've provided Extractors for Text, PDF, Word, and HTML. It's also possible to specify your own handlers for archives. Say you have a RAR archive, and you h

Re: Tool for analyzing analyzers

2004-05-28 Thread Zilverline info
Hi Erik, Erik Hatcher wrote: [snip] But I'd love to build a Lucene "demo" application that is powerful enough to be used as a foundation for folks to use out-of-the-box. That's just what I thought. Here's one: http://www.zilverline.org Erik -

Re: Help with web Lucene Interface

2004-05-24 Thread info
Satish Plakote wrote: Hello Today is my second day with lucene. I want to make a search engine with lucene for our website.I tried the docs that come with lucene, but i am not able to generate the index for my site. neither i am able to load the the war file that is given into tomcat.I need some b

Re: How do i prevent the HTML tags being added to Lucene Index..

2004-05-20 Thread info
Or use a ready web application such as http://www.zilverline.org Michael Mahesh wrote: I am using the lucene 1.4 to index the information. I have lot of HTML tags in the information that i will be indexing ,so let me know if their is any way of removing the HTML tags from being indexed.. MAHESH --

Zilverline webapplication

2004-04-30 Thread info
All, For those that are interested, I've created a web application based on lucene that's ready to roll, and can be simply dropped in a Servlet Engine. It runs out of the box, doing PDF, WORD, HTML. TXT, and can (on WIndows for now) index zip, rart, and CHM. I've just put up a website for it,