Re: Transaction in Lucene
The message No tvx file can appear, if you have term vectors enabled during index and the documents you are adding have empty fields. As an example, if you try to index html documents, where many of them don't have a valid html title, the message will raise up. Looking at the term vector relevant code, this is nothing you have to worry about, it is just a status message. Otis is right, it is planned for future releases to avoid System.out.println() statements within lucene. regards Bernhard Otis Gospodnetic schrieb: I'm not sure about the tvx error, but I think I recall somebody changing some code around it a month or two ago. I also believe System.out.println is on the TODO list for elimination. Otis --- commandor [EMAIL PROTECTED] wrote: Hello, I came across the following problem with No tvx file. How could I manage to get it? I like to have transaction processes in Lucene. After my reading dev-lucene and user-lucene lists and analysing what people suggested I made up my own. The problem in my case is that I had to make several changes and only than make commit. That's why I did the following: 1. Turn off Lucene lock (setting the corresponding system variable = false) 2. Start the loop (from the first document to the last one to change in the index) 2.1. Open IndexReader 2.2. Get a document by its id 2.3. Store it as local variable 2.4. IndexReader.delete(document id) 2.5. IndexReader.close() 2.6. Merge new Terms (changes) and old ones in the document I retrieved 2.6. Open IndexWriter 2.7. Add a new made document 3. end of loop 4. Waiting for other action ends in my programm I close IndexWriter. The Result: Everything works fine but I had No tvx file I really worried about it cause I read what for tvx file... Might anybody explain me what I did wrong? In spite of your answer I did like the following: the way of logging messages This message appeared with the help of System.out.println() Investigating the code of Lucene I found a lot of places of using System.out I guess it is not a very good solution espessially in so beautiful search/indexing API. I guess Lucene must have a normal log to write its messages. Thanks in advance... - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Does lucene makes any compression
The lucene version from CVS head does now have a option to store and compress whole text files (binary fields within a lucene document) thru GZip. The index itself is not GZip compressed. Due to the nature of how the index is created and stored, it is very effectiv regarding to diskspace without the need of additional compression. I have no idea if the new functionality is already adapted within the c# port. regards Bernhard abdulrahman galal schrieb: i got the c# of lucene thanks god @ http://sourceforge.net/projects/nlucene what about the new version that include the compression facility ? you did n't replay on my qustion does it compress original text files and its indexs like Great MG thanks alot _ FREE pop-up blocking with the new MSN Toolbar - get it now! http://toolbar.msn.com/ - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Filters for Openoffice File Indexing available (Java)
Hello List. we have written an application which includes OpenOffice Integration into an OpenSource CMS (OpenCms). For this CMS there is a Lucene Integration available under sourceforge. So now we are looking for search and index Filters for Lucene, that were able to integrate out OpenOffice Files also into search result. Is there any project or code available for doing this, or must we write everything by ourself? Do anybody know good beginner Tutorials for doing things like this? Best Regards Joachim Arrasz - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Need Help
Hi, I have query regarding index file portability of lucene 1.4 and clucene 0.8.12. I have created index file in Java - lucene 1.4 and now want to search some term in the same index file by using clucene. I am not getting results if i do that. So just wanted to make sure, does it support such kind of interportability? With Regards, Chandrashekhar V Deshmukh Sr. System Analyst Cybage Software Pvt. Ltd. (a CMM Level 3 company) Phone(O) : 91-20-4041700, 91-20-4044700 Ext: 804 Cell : 91-9822749239 Fax : 91-20-4041701 , 4041702 [EMAIL PROTECTED] www.cybage.com
Lock Issues
Hi; I am working to integrate lucene into a internal site. The site contains .pdf, .doc and .xml files. I need to index all of them. My indexer runs in the do get method of a servlet for now (I am trying to make a demo for my boss). Once I get this working I will move all the logic into a class that can be instantiated by the application. Things we working ok until I integrated the pdfbox. Now I get: java.io.IOException: Lock obtain timed out: [EMAIL PROTECTED]:\Program Files\Apache Software Foundation\Tomcat 5.0\temp\lucene-b826e0c22cbd003cd1a6f908eda9894f-write.lock I think there might be a conflict between the reader I have open already and what ever happens in here: LucenePDFDocument.getDocument(file); Has anyone done something like this before? Anyone have any ideas what I can do? Thanks, Luke
Re: Lucene external field storage contribution
On Sun, 07 Nov 2004 13:51:23, Kevin A. Burton wrote: About 3 months ago I developed a external storage engine which ties into lucene. I'd like to discuss making a contribution so that this is integrated into a future version of Lucene. I'm going to paste my original PROPOSAL in this email. There wasn't a ton of feedback first time around but I figure squeaky wheel gets the grease... I'd be interested in this type of functionality. Could you raise an issue in bugzilla so it's easier to track? Cheers, -- Miles Barr [EMAIL PROTECTED] Runtime Collective - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Need Help
Hello, You should double-check with CLucene community, but from my research for Lucene in Action CLucene's index is not compatible with that of Lucene 1.4, so you will not be able to use the same index with both Lucene and CLucene. Otis --- Chandrashekhar [EMAIL PROTECTED] wrote: Hi, I have query regarding index file portability of lucene 1.4 and clucene 0.8.12. I have created index file in Java - lucene 1.4 and now want to search some term in the same index file by using clucene. I am not getting results if i do that. So just wanted to make sure, does it support such kind of interportability? With Regards, Chandrashekhar V Deshmukh Sr. System Analyst Cybage Software Pvt. Ltd. (a CMM Level 3 company) Phone(O) : 91-20-4041700, 91-20-4044700 Ext: 804 Cell : 91-9822749239 Fax : 91-20-4041701 , 4041702 [EMAIL PROTECTED] www.cybage.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Thread Safe Lucene
On Nov 8, 2004, at 12:34 PM, Ramon Aseniero wrote: Is Lucene thread safe? What happens to searches already in progress while the index is being updated? Searches work fine while another process is indexing. An already instantiated IndexSearcher will only see the documents in the index at the time it was instantiated, not new ones. Erik - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Windows Bug?
Hi, My understanding is that I can have an IndexReader open for searching (as long as it doesn't delete) while an IndexWriter is updating the index. I wrote a simple test app to prove this and it works great on Mac OS X, Java 1.4.2 and Lucene 1.4.2. It fails on Windows XP, Java 1.4.2 and Lucene 1.4.2. I tried other versions of Lucene and it failed in those too. This is the app that fails on Windows: public static void main(String[] args) throws Exception { String indexFolder = /TestIndex; // add a document to the index IndexWriter indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); Document document = new Document(); Field field = new Field(foo, bar, true, true, true) document.add(field); indexWriter.addDocument(document); indexWriter.close(); // open an index reader but don't close it IndexReader indexReader = IndexReader.open(indexFolder); // open an index writer indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); indexWriter.close(); } On Windows XP this throws an Exception as soon as it tries to open the IndexWriter after the IndexReader has been opened. Here's the stack trace: Exception in thread main java.io.IOException: Cannot delete _1.cfs at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83) at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173) at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17) Is this a bug? Thanks. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Windows Bug?
The reason this is failing is because you are trying to create a new index in the directory. It works on *nix file systems because you can delete an open file on those operating systems, something you can't do under Windows. If you change the create parameter to false on your second call everything should work as you expect it to. On 8 Nov 2004 18:27:12 -, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi, My understanding is that I can have an IndexReader open for searching (as long as it doesn't delete) while an IndexWriter is updating the index. I wrote a simple test app to prove this and it works great on Mac OS X, Java 1.4.2 and Lucene 1.4.2. It fails on Windows XP, Java 1.4.2 and Lucene 1.4.2. I tried other versions of Lucene and it failed in those too. This is the app that fails on Windows: public static void main(String[] args) throws Exception { String indexFolder = /TestIndex; // add a document to the index IndexWriter indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); Document document = new Document(); Field field = new Field(foo, bar, true, true, true) document.add(field); indexWriter.addDocument(document); indexWriter.close(); // open an index reader but don't close it IndexReader indexReader = IndexReader.open(indexFolder); // open an index writer indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); indexWriter.close(); } On Windows XP this throws an Exception as soon as it tries to open the IndexWriter after the IndexReader has been opened. Here's the stack trace: Exception in thread main java.io.IOException: Cannot delete _1.cfs at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83) at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173) at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17) Is this a bug? Thanks. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Windows Bug?
Thanks! I just figured that out. I was passing true by mistake. --- Lucene Users List [EMAIL PROTECTED] wrote: The reason this is failing is because you are trying to create a new index in the directory. It works on *nix file systems because you can delete an open file on those operating systems, something you can't do under Windows. If you change the create parameter to false on your second call everything should work as you expect it to. On 8 Nov 2004 18:27:12 -, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi, My understanding is that I can have an IndexReader open for searching (as long as it doesn't delete) while an IndexWriter is updating the index. I wrote a simple test app to prove this and it works great on Mac OS X, Java 1.4.2 and Lucene 1.4.2. It fails on Windows XP, Java 1.4.2 and Lucene 1.4.2. I tried other versions of Lucene and it failed in those too. This is the app that fails on Windows: public static void main(String[] args) throws Exception { String indexFolder = /TestIndex; // add a document to the index IndexWriter indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); Document document = new Document(); Field field = new Field(foo, bar, true, true, true) document.add(field); indexWriter.addDocument(document); indexWriter.close(); // open an index reader but don't close it IndexReader indexReader = IndexReader.open(indexFolder); // open an index writer indexWriter = new IndexWriter (indexFolder, new StandardAnalyzer(), true); indexWriter.close(); } On Windows XP this throws an Exception as soon as it tries to open the IndexWriter after the IndexReader has been opened. Here's the stack trace: Exception in thread main java.io.IOException: Cannot delete _1.cfs at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:144) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:105) at org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:83) at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:173) at scratch.TestLuceneLocks.main(TestLuceneLocks.java:17) Is this a bug? Thanks. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
IndexSearch
Hi All, Can IndexSearcher be persisted? Are there any limitations on index updates while searches are in progress? Any file locking issues? Thanks, Ramon
Re: IndexSearch
You can write to the index and read from it at the same time. You can only have one IndexWriter open at any one time. IndexSearchers will only see documents that were created before they were instantiated, so you need to create new ones periodically to see new documents. On Mon, 8 Nov 2004 14:26:40 -0800, Ramon Aseniero [EMAIL PROTECTED] wrote: Hi All, Can IndexSearcher be persisted? Are there any limitations on index updates while searches are in progress? Any file locking issues? Thanks, Ramon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]