Thanks Andrzej and Pasha for your prompt replies and suggestions.
I will try everything you have suggested and report back on the findings!
regards
-pedja
Pasha Bizhan said the following on 2/25/2005 6:32 PM:
Hi,
whole document was indexed or not.
Luke can help you to give an answer the question
pedja
[EMAIL PROTECTED] said the following on 2/24/2005 2:08 PM:
Hi everyone
I'm having a bizzare problem with a few of the documents here that do
not seem to get indexed entirely.
I use textmining WordExtractor to convert M$ Word to plain text and
then index that text.
For example one docu
indexed.
You could also try the extreme case and set that max value to the max
Integer.
Otis
--- "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
Hi everyone
I'm having a bizzare problem with a few of the documents here that do
not seem to get indexed entirely.
I use textmining WordE
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
f-written command-line search program and it outputs
its results to the
standard output.
I guess your solution must be better ;)
If the "communication parts" of your code aren't top secret, can you please
share them with me/us?
face to
that. And some similar comments. But I'm a bit surprised there's not
a bit more in terms of use of the official java extension to php.
Thanks for the great package!
Owen
-----
To unsubscribe, e-mail: [EMAIL PROTECTE
Morus Walter said the following on 1/21/2005 2:14 AM:
No. You could do a ( ( french-query ) or ( english-query ) ) construct
using
one query. So query construction would be a bit more complex but querying
itself wouldn't change.
The first thing I'd do in your case would be to look at the differen
EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
to get the language for the particular document before creating the
analyzer.
regards
Bernhard
[EMAIL PROTECTED] schrieb:
Greetings everyone
I wonder is there a solution for analyzing both English and French
documents using the same analyzer.
Reason being is that we have predominantly English
thanks
-pedja
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
ry time and tacking on a rare book charge. Amazon.com are quoting shipping in 24hrs. Is this a new 'Boston Tea Party'?
cheers
David
-----
To unsubscribe, e-mail: [EMAIL PROTECTED]
For addit
e if I've a forum with
Mysql and a lot of files on my web, for every search I've to select
the index that I want use in my search, true? But I don't know how to
do that Lucene writes an index about the information of the DB
supported indexable filetype-collection (XML, HTML, PDF,
MSWord-DOC, RTF, Plaintext).
WBR,
Tom.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED
eader.
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, December 10, 2004 2:59 PM
To: Lucene Users List
Subject: Re: No of docs using IndexSearcher
numDocs()
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/index/IndexR
eader.html#numDocs()
Ravi
.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
ne support conditional operator? Like retrieve all documents where
age is greater than 21, how do I compose a query like this in Lucene is
there a different Query object to use?
Thanks,
Ramon
-
To unsubscribe, e-mail: [EMAIL PROT
ectly fine technically.
Otis
--- "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
Here's probably a silly question, very newbish, but I had to ask.
Since I have mysql documents that contain over 30 fields each and
most of them
are added to the index, is it a common practice to add fields
nks
-pedja
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Hi Otis
I did try, here's what I get:
[EMAIL PROTECTED] tmp]# time java MemoryVsDisk 1 1 10 -r
Docs in the RAM index: 1
Docs in the FS index: 0
Total time: 142 ms
real0m0.322s
user0m0.268s
sys 0m0.033s
I tried other combinations but they dont seem to affect the outcome
e
------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
cs to fsWriter
directly, you were using an IndexReader you had opened prior to calling
fsWriter.close() to check the number of docs ... that won't work for hte
same reason.
-Hoss
---------
To unsubscr
r = new RAMDirectory();
IndexWriter ramWriter= new IndexWriter(ramDir, analyzer, true);
addDoc(ramWriter, doctype, crofileno);
System.out.println("Docs In the RAM index: " + ramWriter.docCount());
IndexWriter fsWriter = new IndexWriter(indexDir, analyzer, true);
//fsWriter.setUseCompoundFile(false);
//fsWriter.mergeFactor = 1000;
//fsWriter.maxMergeDocs = 10;
fsWriter.addIndexes(new Directory[] { ramDir });
//fsWriter.optimize();
System.out.println("Docs in the FS index: " + fsWriter.docCount());
ramWriter.close();
fsWriter.close();
Date end = new Date();
System.out.println("Lucene Added OK: " + Long.toString(end.getTime() -
start.getTime()) + " total milliseconds");
} catch (IOException e) {
throw new Exception("Something bad happened: " + e.getClass() + " with
message: " + e.getMessage());
} catch (Exception e) {
throw new Exception(" caught a " + e.getClass() + "\n with message: " +
e.getMessage());
}
}
}
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
hi morus & company;
On Thursday 18 November 2004 12:49, Morus Walter wrote:
> [EMAIL PROTECTED] writes:
> > i need to solve this search:
> > number: -10
> > range: -50 TO 5
> >
> > i need help..
> > i dont find anything using google..
>
> If your
.
but then another problem starts:
i need to use negative numbers and then all becomes crazy for me...
i need to solve this search:
number: -10
range: -50 TO 5
i need help..
i dont find anything using google..
thanks
d2clon
-----
im sorry friends.. i put the title incorrectly for two times
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
61.2372
house455.9017
house254.1266
house144.1942
house037.5
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
use455.9017
house254.1266
house144.1942
house037.5
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
c wrote:
>
> > You need your own Similarity implementation and you need to set it as
> > shown in this javadoc:
> > http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/
> > Similarity.html
> >
> > Otis
> >
> > --- "[EMAIL PROTECT
norm in
Simliarity. Should I do anything about it? or does'nt it matter?
/William
> You need your own Similarity implementation and you need to set it as
> shown in this javadoc:
>
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarit
y.html
>
> Otis
&
orted after popularity (a field) and
not by anything else. How can I do this? What classes and methods do I have
to change?
thanks,
William
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
it is not about analyzer ,i need to read text from pdf file first.
- Original Message -
From: "Chandan Tamrakar" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, September 08, 2004 4:15 PM
Subject: Re: pdf in Chinese
&
Hi all,
i use pdfbox to parse pdf file to lucene document.when i parse Chinese
pdf file,pdfbox is not always success.
Is anyone have some advice?
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e
java.io.IOException: Lock obtain timed out
I was trying to create two instance of IndexSearcher with different index files
Is there something i've missed?
tia,
buics
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For addit
I also have problems regarding my application,
what would be the ideal memory allocation for lucene
considering my application will serve at least 20 transactions per second?
tia
--buics
On Fri, 3 Sep 2004 15:20:45 +0200, [EMAIL PROTECTED]
<[EMAIL PROTECTED]> wrote:
> Terence,
>
2web.com/ .
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
; Date sent:Wed, 24 Apr 2002 11:02:32 +0200
> From: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]>
> Subject: Italian web sites
> To: [EMAIL PROTECTED]
> Send reply to:Lucene Users List
>
> > Hi
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
> > Sent: 23. huhtikuuta 2002 10:06
> > To: [EMAIL PROTECTED]
> > Subject: Re: Lucene in action at www.mil.fi
> >
> > Hi Jari
> >
> > whre do you build your index? On filesystem? Do
top-
word list to run statistics
> on a page :-) ?!?
>
> On Wednesday 24 April 2002 11:02, [EMAIL PROTECTED] wrote:
> > Hi all,
> >
> > I'm using Jobo for spidering web sites and lucene for indexing. The
> > problem is that I'd like spidering only I
Hi all,
I'm using Jobo for spidering web sites and lucene for indexing. The
problem is that I'd like spidering only Italian web sites.
How can I see discover the country of a web site?
Dou you know some method that tou can suggest me?
Thanks
Laura
ot; at the "Powered by
"
> -section of the Lucene web site.
>
> Thanks go to all the Lucene developers - it's great stuff :D
>
> Jari Aarniala
>
> --
> Jari Aarniala
> [EMAIL PROTECTED] "death is the
> Vantaa, .fi last dance eternal"
>
>
>
>
> --
> To unsubscribe, e-mail: <mailto:lucene-user-
[EMAIL PROTECTED]>
> For additional commands, e-mail: <mailto:lucene-user-
[EMAIL PROTECTED]>
>
>
> http://www.matuschek.net/software/jobo/
>
> Otis
>
> --- "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
> > Hi Otis,
> >
> > thanks for your reply. I have been looking for Spindle and Mojo for
2
> >
> > hours but I don
>
> > > Its easy to write a Visitor which extracts the links; should take
> > abou
> > t ten
> > > lines of code.
>
>
> __
> Do You Yahoo!?
> Yahoo! Games - play chess, backgammon, pool and more
>
e following...here
's a
> >good example of link extraction.
>
> Try http://www.quiotix.com/opensource/html-parser
>
> Its easy to write a Visitor which extracts the links; should take abou
t ten
> lines of code.
>
>
>
> --
> Brian Goetz
>
Hi all,
my name is Laura and I'm a new member of this list. I'm a long date
user of tomcat and I'm also a meber of tomcat user list.
Yesterday looking at the jakarta menu I saw lucene and I said:"What is
this?"
Reading lucene home page I understood that Lucene is a very interesting
and
57 matches
Mail list logo