Index locked exception while updating index

2005-07-17 Thread Harini Raghavan
Hi All, I am quite new to Lucene and I have problem with locking. I have a MessageDrivenBean that sends messages to my Lucene indexer whenever there is a new database update. The indexer updates the index incrementally . Below is the code fragment in the indexer method that gets invoked by the

Re: Index locked exception while updating index

2005-07-17 Thread Harini Raghavan
You are catching IOException in the finally block, but you are not even printing out the exception stack trace. Perhaps you are not able to close your IndexWriter for some reason. Otis --- Harini Raghavan <[EMAIL PROTECTED]> wrote: Hi All, I am quite new to Lucene and I have problem

Re: Index locked exception while updating index

2005-07-18 Thread Harini Raghavan
ses the current writer to persist the additions, and opens a new one it it's palce for future updates. ...if you're using JDK1.5, take a look at java.util.concurrent.atomic.AtomicReference, it should make managing a singleton IndexWriter really easy. : Date: Mon, 18 Jul 2005 10:12

Share Index on NFS

2009-04-21 Thread Harini Raghavan
Hi Everyone, We are planning to distribute searches on the index and have a single indexing node. We want to mount the index on NFS so that it can be shared by the indexer and searcher nodes. To optimize several of our search workflows, we are caching the IndexSearcher and refreshing it every h

Re: Highlighting "really" found terms

2006-10-23 Thread Harini Raghavan
ce). Is there a way to highlight only the terms really found ? Thanks a lot ! Pierre - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL P

Merge Index Filling up Disk Space

2006-12-21 Thread Harini Raghavan
size to be allotted in such scenario. Any suggestions would be appreciated. Thanks, Harini -- Harini Raghavan Software Engineer Office : +91-40-23556255 [EMAIL PROTECTED] we think, you sell www.InsideView.com InsideView - To

Re: Merge Index Filling up Disk Space

2006-12-22 Thread Harini Raghavan
uld that also be related to having searchers open while running optimize? -Harini Michael McCandless wrote: Harini Raghavan wrote: I am using lucene 1.9.1 for search functionality in my j2ee application using JBoss as app server. The lucene index directory size is almost 20G right now. Th

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Harini Raghavan
Yes I think I got hit IOException. I assumed that the.tmp files are not required and deleted them manually from the indes directory as they were more than 10G. Is that ok? Michael McCandless wrote: Harini Raghavan wrote: Thank you for the response. I don't have readers open on the

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Harini Raghavan
the index for searching. - Mark Harini Raghavan wrote: Hi Mike, Thank you for the response. I don't have readers open on the index, but while the optimize/merge was running I was searching on the index. Would that make any difference? Also after the optimizing the index I had some .tmp

Modelling Relational Lucene Index

2006-12-27 Thread Harini Raghavan
s a text-search engine, not a relational database. This kind of relation may be perfectly valid to implement in Lucene, but you want to be careful if you find yourself trying to do any more RDBMS-like things. Best Erick On 12/26/06, Harini Raghavan <[EMAIL PROTECTED]> wrote: Hi, I have

RE: Counting term frequency without using Explanation

2007-02-19 Thread Harini Raghavan
Hi Erick, I have a similar requirement to know the frequency of occurrence of a keyword in a given content to find out the relevancy of the article to a set of keywords. If the keyword is mentioned more than once in the article, then I want to treat it as more relevant. Can you please point me t

Delete document with keyword field

2007-03-10 Thread Harini Raghavan
Hi All, I have a lucene index with many fields, one of which is a Keyword field IS. The values stored in this field are the document ids like _839930494, _839930492. But I am unable to delete the documents using this id. Is this something to do with the underscore? Can someone suggest how I shou

Re: Delete document with keyword field

2007-03-11 Thread Harini Raghavan
what you really have in your index is, as you suspect, 839930494, it all depends upon the analyzer you used. Erick On 3/11/07, Harini Raghavan <[EMAIL PROTECTED]> wrote: > > Hi All, > > I have a lucene index with many fields, one of which is a Keyword field > IS. > The v

Scalability Issues with Indexing

2007-03-12 Thread Harini Raghavan
Hi Everyone, We have been using Lucene integrated with our application for over a year now. The indexing and searching has been pretty fast until recently. But now we are having some scalability issues. We have a job that indexes around 2 documents in to index every day. There are 2 processes

Re: ERROR opening the Index - contact sysadmin!

2007-06-12 Thread Harini Raghavan
Hi All, I am trying to create a lucene query to search for companies based on areacode. The phone no. is stored in the lucene index in the form of '415-567-2323'. I need to create a query like +areaCode:"415-". But the QueryParser is stripping off the hyphen(-). Here is the code to create the qu

QueryParser stripping special char

2007-06-12 Thread Harini Raghavan
Hi All, I am trying to create a lucene query to search for companies based on areacode. The phone no. is stored in the lucene index in the form of '415-567-2323'. I need to create a query like +areaCode:"415-". But the QueryParser is stripping off the hyphen(-). Here is the code to create the qu

unused tmp fdt files in index

2007-07-09 Thread Harini Raghavan
Hi All, I have a large lucene index of size 60G. We have had Out Of Memory issues a few times in the past due to which the indexing had got interrupted. This has resulted in a lot of .fnm, .fdt,.tmp files which don't get removed even through optimizing the index. We have data for last 90 days in

Re: unused tmp fdt files in index

2007-07-10 Thread Harini Raghavan
etable, it is should be safe to delete these files. Please backup your data before testing. Andy -Original Message- From: Harini Raghavan [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 10, 2007 2:29 PM To: java-user@lucene.apache.org Subject: unused tmp fdt files in index Hi All, I have a

Lucene 2.2 + Not Merging Segments

2007-07-23 Thread Harini Raghavan
Hi All, I just upgraded my application to use lucene 2.2. Prior to this, I was using lucene 1.9.1 and the addDocument method in the IndexWriter used to keep merging smaller segments while adding new documents depending on the mergeFactor. I was using the default mergeFactor settings. But after th

Re: Lucene 2.2 + Not Merging Segments

2007-07-24 Thread Harini Raghavan
McCandless <[EMAIL PROTECTED]> wrote: This is spooky. Could you provide more details about how you are using IndexWriter? Are you changing "max buffered docs" from the default? Also, can you call writer.setInfoStream(System.out), capture that log, and post it? Mike "Har

Searching with too many clauses + Out of Memory

2007-08-01 Thread Harini Raghavan
Hi Everyone, I am using Compass 1.1 M2 which supports Lucene 2.2 to store & search huge amount of company, executive and employment data. There are some usecases where I need to search for executives/employments on the result set of company search. But when I try to create a compass query to sear

Query Analyzer Issue

2007-08-31 Thread Harini Raghavan
Hi Everyone, I am facing some strange behaviour with Analyzers. I am using SimpleAnalyzer for some fields in my Compass entity, but I also wrote a custom Analyzer that is slightly different from the SimpleAnalyzer as I wanted to allow even letters and digits in company name column. So custom analy

Re: Index locked exception while updating index

2005-07-24 Thread Harini Raghavan
Hi All, I am trying to add paging functionality while using lucene search. I have created a PageFilter what takes in the current page num and the number of records as input and invoking the IndexSearcher passing the a Boolean Query object and the PageFilter. The search returns around 1000 reco

Implementing paging functionality in lucene

2005-07-24 Thread Harini Raghavan
Hi All, I am trying to add paging functionality while using lucene search. I have created a PageFilter what takes in the current page num and the number of records as input and invoking the IndexSearcher passing the a Boolean Query object and the PageFilter. The search returns around 1000 records

OutOfMemory errors while indexing large documents

2005-07-25 Thread Harini Raghavan
Hi All, I am using lucene to index large documents(HTML pages). The application is running on JBoss and MySQL on UNIX. The indexing is throwing OutOfMemory errors beyond a certain point. I am not sure why this is happening. I am using the default IndexWriter properties, but the lucene documenta

Re: OutOfMemory errors while indexing large documents

2005-07-25 Thread Harini Raghavan
ire document and not just the first 10,000 or so terms - I set maxFieldLength to Integer.MAX_VALUE. Erik On Jul 25, 2005, at 7:30 AM, Harini Raghavan wrote: Hi All, I am using lucene to index large documents(HTML pages). The application is running on JBoss and MySQL on UNIX. The index

IOException : Lock obtain timed out

2005-07-30 Thread Harini Raghavan
lise a new IndexWriter. So I am wondering why this problem is occuring. Can someone please help? Thanks, Harini : Date: Mon, 18 Jul 2005 10:12:39 +0530 : From: Harini Raghavan <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: [EMAIL PROTECTED] : Cc: java-user@lucene.apache.o

Re: IOException : Lock obtain timed out

2005-07-31 Thread Harini Raghavan
The default value of IndexWriter.WRITE_LOCK_TIMEOUT property is 1000ms. Can this value be increased to some optimum value? - Original Message - From: "Harini Raghavan" <[EMAIL PROTECTED]> To: Sent: Saturday, July 30, 2005 11:23 PM Subject: IOException : Lock obtain

Stale NFS file handle Exception

2005-09-12 Thread Harini Raghavan
Hi All, I have 2 servers in the production environment, one running some Quartz jobs and the other one running the application. There is a common NFS mount which has the lucene index directory. The jobs fetch the latest data and update the lucene index. And the user can search on the index to

Lucene search is very slow

2005-10-10 Thread Harini Raghavan
Hi, I am using lucene for search functionality in my j2ee application using JBoss as app server. The lucene index directory size is almsot 10G. The performance has been quite good until now. But after the last deploy, when the server was restarted , the lucene search has become very slow. It t

Re: Lucene search is very slow

2005-10-20 Thread Harini Raghavan
http://www.dbsight.net On 10/10/05, Koji Sekiguchi <[EMAIL PROTECTED]> wrote: Is it really the part of Lucene slow? Please take thread dumps every 15 secs, 3 to 4 times. What can you look at them? Koji -Original Message- From: Harini Raghavan [mailto:[EMAIL

Re: Lucene search is very slow

2005-10-20 Thread Harini Raghavan
caching data in each index searcher. For large index, it's definitely a waste to re-create index searcher every time. Chris -- Full-Text Search on Any Databases http://www.dbsight.net On 10/20/05, Harini Raghavan <[EMAIL PROTECTED]> wrote: Hi

how to control terms to be highlighted?

2005-11-30 Thread Harini Raghavan
Hi, I have a requirement to highlight search keywords in the results and display the matching fragment of the text with the results. I am using the Hits highlighting mentioned in Lucene in Action. Here is the search query(BooleanQuery) I am passing to the IndexSearcher and QueryScorer: +Doc

Re: how to control terms to be highlighted?

2005-11-30 Thread Harini Raghavan
o have the effect of filtering out any query terms that weren't for this field. Would this change be a problem for anyone? Cheers, Mark --- Harini Raghavan <[EMAIL PROTECTED]> wrote: Hi, I have a requirement to highlight search keywords in the results and display the matching frag

Re: how to control terms to be highlighted?

2005-12-02 Thread Harini Raghavan
: >>Content:restructuring Content:restructure) ...just give the highlighter... (Content:"cost saving" Content:"cost savings" Content:outsource Content:outsources Content:downsize Content:downsizes Content:restructuring Content:restructure) : Date: Th

Re: how to control terms to be highlighted?

2005-12-05 Thread Harini Raghavan
the keywords appear(similar to google) and display below the search result. But, the above text extracted is not really the best fragment, it seems to be the first fragment which has the keywords. Has someone implemented this kind of functionality? -Harini Harini Raghavan wrote: Hi Chris

Query Scoring

2005-12-30 Thread Harini Raghavan
Hi, I have a requirement to highlight search keywords in the results and display the matching fragment of the text with the results. I am using the Hits highlighting mentioned in Lucene in Action. Here is the search query(BooleanQuery) I am passing to the IndexSearcher and QueryScorer: +Document

Re: Query Scoring

2006-01-01 Thread Harini Raghavan
e returned, but again this is unrelated to highlighting. Could you elaborate on what you're after? Erik On Dec 30, 2005, at 12:02 PM, Harini Raghavan wrote: Hi, I have a requirement to highlight search keywords in the results and display the matching fragment of the text with the re

Re: Query Scoring

2006-01-02 Thread Harini Raghavan
Thank you Chris. That seems like a good suggestion. I will try to pass a different Query object to the Highlighter api that the one used for searching. I plan to break down the HTML document and store the title/sub title/content in different fields of the index. So if I create a new query com

Deleting a Document

2006-01-08 Thread Harini Raghavan
Hi, I want to update a document in the lucene index. As mentioned in the documentation, I tried to delete the document using IndexReader.delete method. But even after I delete the document, I am able to see the document when I perform a search. I thought this could be because, I am caching the

Re: Deleting a Document

2006-01-08 Thread Harini Raghavan
Hi Koji, I am closing the reader after deleting. You can see the close being called in the finally block in the code. -Harini Koji Sekiguchi wrote: Hi Harini, Close the reader after delete docs to take effect. Hope this helps, Koji -Original Message- From: Harini Raghavan

Re: Deleting a Document

2006-01-09 Thread Harini Raghavan
Hi Koji, Thanks for the suggestion. It worked when I closed the reader before refreshing the IndexSearcher instance. Harini Koji Sekiguchi wrote: Hi Harini, I meant you close the reader first, then get a new searcher. regards, Koji -Original Message- From: Harini Raghavan

highlighting phrases

2006-01-09 Thread Harini Raghavan
Hi All, I am using the highlighter package to highlight my search results. The query I am passing to the Highlighter is: +(Content:"Apple Computer" Content:"Apple Comp") +(Title:"Apple Computer" Title:"Apple Comp") But the Highlighter is highlighting even occurances of terms 'Computer'/'Comp'.

Lock obtain timed out + IndexSearcher

2006-01-09 Thread Harini Raghavan
Hi All, All of a sudden I have started getting LockTimeOut exception while searching the index. There is no write.lock file in the index directory, so why should this issue come while searching? I tried to delete the index directory and restarted the server, but still no luck. What could be w

Re: highlighting phrases

2006-01-11 Thread Harini Raghavan
, Harini Erik Hatcher wrote: On Jan 9, 2006, at 1:16 PM, Harini Raghavan wrote: I am using the highlighter package to highlight my search results. The query I am passing to the Highlighter is: +(Content:"Apple Computer" Content:"Apple Comp") +(Title:"Apple Computer&quo

adding new fields to index

2006-05-17 Thread Harini Raghavan
Hi All, I have an existing lucene index whose size is around 10G. I need to add/delete some fields for all the documents in the index. Is there any way to do this without reindexing all the documents again? Thanks, Harini - To

indexing in lucene 1.9.1

2006-05-21 Thread Harini Raghavan
Hi All, We have recently upgraded from lucene 1.4.3 to lucene 1.9.1 version. After the upgrade, we are facing some issues: 1. Indexing seems to be behaving differently. There were more than 300 segment files(.cfs) in the index and the IndexSearcher is taking forever to refresh the index. Have t

Re: indexing in lucene 1.9.1

2006-05-22 Thread Harini Raghavan
Richmond wrote: Hello Harini, When you are finished indexing the documents are you running the optimize() method on the IndexWriter before closing it? This should reduce the number of segments and make searching faster. Just a thought. --Mike On 5/22/06, Harini Raghavan <[EMAIL PROTEC