Re: Transaction in Lucene

2004-11-08 Thread Bernhard Messer
The message No tvx file can appear, if you have term vectors enabled during 
index and the documents you are adding have empty fields. As an example, if you try to 
index html documents, where many of them don't have a valid html title, the message will 
raise up. Looking at the term vector relevant code, this is nothing you have to worry 
about, it is just a status message. Otis is right, it is planned for future releases to 
avoid System.out.println() statements within lucene.
regards
Bernhard 

Otis Gospodnetic schrieb:
I'm not sure about the tvx error, but I think I recall somebody
changing some code around it a month or two ago.  I also believe
System.out.println is on the TODO list for elimination.
Otis
--- commandor [EMAIL PROTECTED] wrote:
 

Hello,
I came across the following problem with No tvx file. 

How could I manage to get it?
I like to have transaction processes in Lucene.
After my reading dev-lucene and user-lucene lists and analysing what
people suggested
I made up my own.
The problem in my case is that I had to make several changes and only
than make commit.
That's why I did the following:
1. Turn off Lucene lock (setting the corresponding system variable =
false)
2. Start the loop (from the first document to the last one to change
in the index)
2.1. Open IndexReader 
2.2. Get a document by its id
2.3. Store it as local variable
2.4. IndexReader.delete(document id)
2.5. IndexReader.close()
2.6. Merge new Terms (changes) and old ones in the document I
retrieved
2.6. Open IndexWriter
2.7. Add a new made document

3. end of loop
4. Waiting for other action ends in my programm I close IndexWriter.
The Result:
Everything works fine but I had No tvx file
I really worried about it cause I read what for tvx file...
Might anybody explain me what I did wrong?
In spite of your answer I did like the following: the way of logging
messages
This message appeared with the help of System.out.println()
Investigating the code of Lucene I found a lot of places of using
System.out 

I guess it is not a very good solution espessially in so beautiful
search/indexing API. 
I guess Lucene must have a normal log to write its messages.

Thanks in advance...

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
   


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
 




Re: Does lucene makes any compression

2004-11-08 Thread Bernhard Messer
The lucene version from CVS head does now have a option to store and 
compress whole text files (binary fields within a lucene document) thru 
GZip. The index itself is not GZip compressed. Due to the nature of how 
the index is created and stored, it is very effectiv regarding to 
diskspace without the need of additional compression.

I have no idea if the new functionality is already adapted within the c# 
port.

regards
Bernhard
abdulrahman galal schrieb:
i got the c# of lucene thanks god @ 
http://sourceforge.net/projects/nlucene

what about the new version that include the compression facility ?
you did n't replay on my qustion does it compress original text files 
and its indexs like Great MG 

thanks alot
_
FREE pop-up blocking with the new MSN Toolbar - get it now! 
http://toolbar.msn.com/

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Filters for Openoffice File Indexing available (Java)

2004-11-08 Thread Joachim Arrasz
Hello List.
we have written an application which includes OpenOffice Integration 
into an OpenSource CMS (OpenCms).

For this CMS there is a Lucene Integration available under sourceforge.
So now we are looking for search and index Filters for Lucene, that 
were able to integrate out OpenOffice Files also into search result.

Is there any project or code available for doing this, or must we write 
everything by ourself? Do anybody know good beginner Tutorials for doing 
things like this?

Best Regards
Joachim Arrasz
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Need Help

2004-11-08 Thread Chandrashekhar
Hi,
I have query regarding index file portability of lucene 1.4 and clucene 0.8.12.
I have created index file in Java  - lucene 1.4  and now want to search some 
term in the same index file by using clucene.
I am not getting results if i do that. 
So just wanted to make sure, does it support such kind of interportability?


  
With Regards,
Chandrashekhar V Deshmukh
Sr. System Analyst
Cybage Software Pvt. Ltd. (a CMM Level 3 company)
Phone(O) : 91-20-4041700, 91-20-4044700 Ext: 804
Cell : 91-9822749239
Fax : 91-20-4041701 , 4041702
[EMAIL PROTECTED]
www.cybage.com

Lock Issues

2004-11-08 Thread Luke Shannon
Hi;

I am working to integrate lucene into a internal site. The site contains .pdf, 
.doc and .xml files. I need to index all of them.

My indexer runs in the do get method of a servlet for now (I am trying to make 
a demo for my boss).

Once I get this working I will move all the logic into a class that can be 
instantiated by the application.

Things we working ok until I integrated the pdfbox. Now I get:

java.io.IOException: Lock obtain timed out: [EMAIL PROTECTED]:\Program 
Files\Apache Software Foundation\Tomcat 
5.0\temp\lucene-b826e0c22cbd003cd1a6f908eda9894f-write.lock

I think there might be a conflict between the reader I have open already and 
what ever happens in here:

LucenePDFDocument.getDocument(file);

Has anyone done something like this before? Anyone have any ideas what I can do?

Thanks,

Luke

Re: Lucene external field storage contribution

2004-11-08 Thread Miles Barr
 On Sun, 07 Nov 2004 13:51:23, Kevin A. Burton wrote:
 About 3 months ago I developed a external storage engine which ties
 into lucene. 
  
 I'd like to discuss making a contribution so that this is integrated
 into a future version of Lucene.
  
 I'm going to paste my original PROPOSAL in this email. 
  
 There wasn't a ton of feedback first time around but I figure squeaky
 wheel gets the grease...

I'd be interested in this type of functionality. Could you raise an
issue in bugzilla so it's easier to track?


Cheers,
-- 
Miles Barr [EMAIL PROTECTED]
Runtime Collective


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Need Help

2004-11-08 Thread Otis Gospodnetic
Hello,

You should double-check with CLucene community, but from my research
for Lucene in Action CLucene's index is not compatible with that of
Lucene 1.4, so you will not be able to use the same index with both
Lucene and CLucene.

Otis

--- Chandrashekhar [EMAIL PROTECTED] wrote:

 Hi,
 I have query regarding index file portability of lucene 1.4 and
 clucene 0.8.12.
 I have created index file in Java  - lucene 1.4  and now want to
 search some term in the same index file by using clucene.
 I am not getting results if i do that. 
 So just wanted to make sure, does it support such kind of
 interportability?
 
 
   
 With Regards,
 Chandrashekhar V Deshmukh
 Sr. System Analyst
 Cybage Software Pvt. Ltd. (a CMM Level 3 company)
 Phone(O) : 91-20-4041700, 91-20-4044700 Ext: 804
 Cell : 91-9822749239
 Fax : 91-20-4041701 , 4041702
 [EMAIL PROTECTED]
 www.cybage.com


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Thread Safe Lucene

2004-11-08 Thread Erik Hatcher
On Nov 8, 2004, at 12:34 PM, Ramon Aseniero wrote:
Is Lucene thread safe? What happens to searches already in progress 
while
the index is being updated?
Searches work fine while another process is indexing.  An already 
instantiated IndexSearcher will only see the documents in the index at 
the time it was instantiated, not new ones.

Erik
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Windows Bug?

2004-11-08 Thread yahootintin . 1247688
Hi,



My understanding is that I can have an IndexReader open for searching
(as long as it doesn't delete) while an IndexWriter is updating the index.
 



I wrote a simple test app to prove this and it works great on Mac OS
X, Java 1.4.2 and Lucene 1.4.2.  It fails on Windows XP, Java 1.4.2 and Lucene
1.4.2.  I tried other versions of Lucene and it failed in those too.



This
is the app that fails on Windows:



public static void main(String[] args)


throws Exception {

  String indexFolder = /TestIndex;

 

  // add
a document to the index

  IndexWriter indexWriter = new IndexWriter

   
(indexFolder, 

 new StandardAnalyzer(), true);

  Document document =
new Document();

  Field field = new Field(foo, bar, 

  
   true, true, true)

  document.add(field);

  indexWriter.addDocument(document);

  indexWriter.close();



  // open an index reader but don't close it


 IndexReader indexReader =

IndexReader.open(indexFolder);



  // open
an index writer

  indexWriter = new IndexWriter

(indexFolder, 

   
 new StandardAnalyzer(), true);

  indexWriter.close();

}



On Windows XP
this throws an Exception as soon as it tries to open the IndexWriter after
the IndexReader has been opened.



Here's the stack trace:

Exception in
thread main java.io.IOException: Cannot delete _1.cfs

  at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144)

  at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105)

  at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83)

  at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173)

 
at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17)



Is this a bug?



Thanks.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Windows Bug?

2004-11-08 Thread Justin Swanhart
The reason this is failing is because you are trying to create a new
index in the directory.  It works on *nix file systems because you can
delete an open file on those operating systems, something you can't do
under Windows.

If you change the create parameter to false on your second call
everything should work as you expect it to.


On 8 Nov 2004 18:27:12 -, [EMAIL PROTECTED]
[EMAIL PROTECTED] wrote:
 Hi,
 
 My understanding is that I can have an IndexReader open for searching
 (as long as it doesn't delete) while an IndexWriter is updating the index.
 
 I wrote a simple test app to prove this and it works great on Mac OS
 X, Java 1.4.2 and Lucene 1.4.2.  It fails on Windows XP, Java 1.4.2 and Lucene
 1.4.2.  I tried other versions of Lucene and it failed in those too.
 
 This
 is the app that fails on Windows:
 
 public static void main(String[] args)
 
 throws Exception {
 
   String indexFolder = /TestIndex;
 
   // add
 a document to the index
 
   IndexWriter indexWriter = new IndexWriter
 
 (indexFolder,
 
  new StandardAnalyzer(), true);
 
   Document document =
 new Document();
 
   Field field = new Field(foo, bar,
 
true, true, true)
 
   document.add(field);
 
   indexWriter.addDocument(document);
 
   indexWriter.close();
 
   // open an index reader but don't close it
 
  IndexReader indexReader =
 
 IndexReader.open(indexFolder);
 
   // open
 an index writer
 
   indexWriter = new IndexWriter
 
 (indexFolder,
 
  new StandardAnalyzer(), true);
 
   indexWriter.close();
 
 }
 
 On Windows XP
 this throws an Exception as soon as it tries to open the IndexWriter after
 the IndexReader has been opened.
 
 Here's the stack trace:
 
 Exception in
 thread main java.io.IOException: Cannot delete _1.cfs
 
   at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144)
 
   at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105)
 
   at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83)
 
   at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173)
 
 at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17)
 
 Is this a bug?
 
 Thanks.
 
 -
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]
 


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Windows Bug?

2004-11-08 Thread yahootintin . 1247688
Thanks!  I just figured that out.  I was passing true by mistake.



--- Lucene
Users List [EMAIL PROTECTED] wrote:

The reason this is failing
is because you are trying to create a new

 index in the directory.  It works
on *nix file systems because you can

 delete an open file on those operating
systems, something you can't do

 under Windows.

 

 If you change the
create parameter to false on your second call

 everything should work as
you expect it to.

 

 

 On 8 Nov 2004 18:27:12 -, [EMAIL PROTECTED]

 [EMAIL PROTECTED] wrote:

  Hi,

  

  My understanding
is that I can have an IndexReader open for searching

  (as long as it doesn't
delete) while an IndexWriter is updating the index.

  

  I wrote a simple
test app to prove this and it works great on Mac OS

  X, Java 1.4.2 and
Lucene 1.4.2.  It fails on Windows XP, Java 1.4.2 and Lucene

  1.4.2. 
I tried other versions of Lucene and it failed in those too.

  

  This

  is the app that fails on Windows:

  

  public static void main(String[]
args)

  

  throws Exception {

  

String indexFolder = /TestIndex;

  

// add

  a document to the index

  

IndexWriter
indexWriter = new IndexWriter

  

  (indexFolder,

  

   new
StandardAnalyzer(), true);

  

Document document =

  new Document();

  

Field field = new Field(foo, bar,

  

 true,
true, true)

  

document.add(field);

  

indexWriter.addDocument(document);

  

indexWriter.close();

  

// open an index reader but
don't close it

  

   IndexReader indexReader =

  

  IndexReader.open(indexFolder);

  

// open

  an index writer

  

indexWriter = new IndexWriter

  

  (indexFolder,

  

   new StandardAnalyzer(), true);


 

indexWriter.close();

  

  }

  

  On Windows XP

 
this throws an Exception as soon as it tries to open the IndexWriter after

  the IndexReader has been opened.

  

  Here's the stack trace:

  

  Exception in

  thread main java.io.IOException: Cannot delete
_1.cfs

  

at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144)

  

at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105)

  

at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83)

  

at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173)

  

  at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17)

  

  Is this a bug?

  

  Thanks.

  

  -

  To unsubscribe, e-mail: [EMAIL PROTECTED]


 For additional commands, e-mail: [EMAIL PROTECTED]


 

 

 

 -

 To unsubscribe, e-mail: [EMAIL PROTECTED]

 For
additional commands, e-mail: [EMAIL PROTECTED]

 

 

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



IndexSearch

2004-11-08 Thread Ramon Aseniero
Hi All,

 

Can IndexSearcher be persisted? Are there any limitations on index updates
while searches are in progress? Any file locking issues?

 

Thanks,

Ramon



Re: IndexSearch

2004-11-08 Thread Justin Swanhart
You can write to the index and read from it at the same time. You can
only have one IndexWriter open at any one time.

IndexSearchers will only see documents that were created before they
were instantiated, so you need to create new ones periodically to see
new documents.


On Mon, 8 Nov 2004 14:26:40 -0800, Ramon Aseniero
[EMAIL PROTECTED] wrote:
 Hi All,
 
 Can IndexSearcher be persisted? Are there any limitations on index updates
 while searches are in progress? Any file locking issues?
 
 Thanks,
 
 Ramon
 


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]