Re: Solr performance for Instance updates

2008-09-26 Thread Otis Gospodnetic
You can add in real-time. You are thinking of "commit" as a RDBMS commit, I assume. That happens "automatically". Solr has a notion of "commit", too, but it's different that the DB one. I have a feeling you haven't really looked at the Solr tutorial yet. Want to give that a try first? Oti

Re: Solr performance for Instance updates

2008-09-26 Thread mahendra mahendra
Hi,   Instantly I want to update each doc(based on db changes) and commit, I hope for every commit it takes more time.I don't want to post some bulk docs and commit.   How can be the performance for this scenario... Also every time if I am going to update the docs the index size is going to incre

Re: Anyproblem in running two solr instances on the same machine using the same directory ?

2008-09-26 Thread Yonik Seeley
On Fri, Sep 26, 2008 at 2:18 AM, Jagadish Rath <[EMAIL PROTECTED]> wrote: > - *What are the other solutions to the problem of "maxWarmingSearchers > limit exceeded error " ?** * Don't commit so rapidly? What is the reason for your high commit rate? -Yonik

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Jaco
Thanks for these suggestions, will try it in the coming days and post my findings in this thread. Bye, Jaco. 2008/9/26 Grant Ingersoll <[EMAIL PROTECTED]> > > On Sep 26, 2008, at 12:05 PM, Jaco wrote: > > Hi Grant, >> >> In reply to your questions: >> >> 1. Are you having to restart/initialize

Re: Solr performance for Instance updates

2008-09-26 Thread Otis Gospodnetic
Hi, - Original Message > From: mahendra mahendra <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Friday, September 26, 2008 3:52:57 PM > Subject: Solr performance for Instance updates > > Hi, > > We want to update the index based on TIB listener, whenever database chang

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Grant Ingersoll
On Sep 26, 2008, at 12:05 PM, Jaco wrote: Hi Grant, In reply to your questions: 1. Are you having to restart/initialize the stemmer every time for your "slow" approach? Does that really need to happen? It is invoking a COM object in Windows. The object is instantiated once for a token

Solr performance for Instance updates

2008-09-26 Thread mahendra mahendra
Hi,   We want to update the index based on TIB listener, whenever database changes happens we want to update my index instantly this may happen very frequently for number of records.   Could anyone please tell me how would be the performance for these scenarios?   Question related linguistic supp

ApacheCon US promo

2008-09-26 Thread Grant Ingersoll
Cross-posting... Just wanted to let everyone know that there will be a number of Lucene/ Solr/Mahout/Tika related talks, training sessions, and Birds of a Feather (BOF) gatherings at ApacheCon New Orleans this fall. Details: When: November 3-7 Where: Sheraton, New Orleans, USA URL: http://u

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Jaco
The overhead is not in the instantiation, but in the actual call to the COM object. The approach with one time instantiation in the TokenFilterFactory, and the use of that object in the TokenFilter is exactly what I tried. There is a factor of 10 performance gain when being able to do a single call

Re: Searching Question

2008-09-26 Thread Otis Gospodnetic
It might be easiest to store the thread ID and the number of replies in the thread in each post Document in Solr. Otherwise it sounds like you'll have to combine some search results or data post-search. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message

Re: Searching Question

2008-09-26 Thread Jake Conk
Grant, Each post is its own document but I can merge them all into a single document under one thread if that will allow me to do what I want. The number of replies is stored both in Solr and the DB. Thanks, - JC On Fri, Sep 26, 2008 at 5:24 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Is

Re: Dismax , "query phrases"

2008-09-26 Thread Chris Hostetter
I'm not fully following everything you've got here, but one thing jumped out at me... :

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Chris Hostetter
: It is invoking a COM object in Windows. The object is instantiated once for : a token stream, and then invoked once for each token. The invoke always has : an overhead, not much to do about that (sigh...) I also know nothing about COM, but based on your comments it sounds like instantiating yo

ANNOUNCE: Application Period Opens for Travel Assistance to ApacheCon US 2008

2008-09-26 Thread Chris Hostetter
NOTE: This is a cross posted announcement to all Lucene sub-projects, please confine any replies to [EMAIL PROTECTED] - The Travel Assistance Committee is taking in applications for those wanting to attend ApacheCon US 2008 between the 3rd and 7th November 2008 in New Orleans. Th

Re: Is it possible to specify a pattern of Ranking while querying the indexes?

2008-09-26 Thread Grant Ingersoll
Can you give an example of what you mean? On Sep 26, 2008, at 11:28 AM, tushar kapoor wrote: I want to specify a particular pattern in which results are retrieved for a query. Can a pattern of ranks be specified in the query ? -- View this message in context: http://www.nabble.com/Is-it-po

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Jaco
Hi Grant, In reply to your questions: 1. Are you having to restart/initialize the stemmer every time for your "slow" approach? Does that really need to happen? It is invoking a COM object in Windows. The object is instantiated once for a token stream, and then invoked once for each token. The i

Re: Bunch of questions regarding enterprise configuration

2008-09-26 Thread Dev Team
Hi Otis, Ah, okay those are all great pointers, thanks. I will certainly have to do more research, and then I'll certainly have more questions later. I have thought of using some kind of non-lucene/solr distributed cache to narrow-down the online search... but the problem comes when ther

Is it possible to specify a pattern of Ranking while querying the indexes?

2008-09-26 Thread tushar kapoor
I want to specify a particular pattern in which results are retrieved for a query. Can a pattern of ranks be specified in the query ? -- View this message in context: http://www.nabble.com/Is-it-possible-to-specify-a-pattern-of-Ranking-while-querying-the-indexes--tp19690731p19690731.html Sent fr

Re: Bunch of questions regarding enterprise configuration

2008-09-26 Thread Otis Gospodnetic
Hi Daryl, Re RAM amount - depends on your particular index (DB size doesn't help - who knows how you'll analyze/tokenize/index data, what term distribution is like, etc.) Re master-slave - look for Collection Replication page on the Wiki Re real-time IM-like presence - perhaps you can do it al

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Grant Ingersoll
On Sep 26, 2008, at 9:40 AM, Jaco wrote: Hi, Here's some of the code of my Tokenizer: public class MyTokenizerFactory extends BaseTokenizerFactory { public WhitespaceTokenizer create(Reader input) { String text, normalizedText; try { text = IOUtils.toString(in

Re: Bunch of questions regarding enterprise configuration

2008-09-26 Thread Dev Team
Hi Otis, First off, thanks for your complete reply! It certainly has a lot of good info in it. To address some of the questions you asked, please see below: On Fri, Sep 26, 2008 at 1:36 AM, Otis Gospodnetic < [EMAIL PROTECTED]> wrote: > Hi, > > Your questions don't have simple answers,

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Jaco
Hi, Here's some of the code of my Tokenizer: public class MyTokenizerFactory extends BaseTokenizerFactory { public WhitespaceTokenizer create(Reader input) { String text, normalizedText; try { text = IOUtils.toString(input); normalizedText= *i

Re: spellcheck: buildOnOptimize?

2008-09-26 Thread Shalin Shekhar Mangar
On Fri, Sep 26, 2008 at 9:27 AM, Chris Hostetter <[EMAIL PROTECTED]>wrote: > > couldn't the Listener's newSearcher() method just do something like > this... > > if (rebuildOnlyAfterOptimize && >! (newSearcher.getReader().isOptimized() && > ! oldSearcher.getReader().isOptimized()) { > re

RE: Create Indexes

2008-09-26 Thread Dinesh Gupta
Hi, Please tell me where to upload the files. Regard, Dinesh Gupta > Date: Fri, 26 Sep 2008 21:23:58 +1000 > From: [EMAIL PROTECTED] > To: solr-user@lucene.apache.org > Subject: Re: Create Indexes > > On Fri, 26 Sep 2008 16:32:05 +0530 > Dinesh Gupta <[EMAIL PROTECTED]> wrote: > > > Is it OK

Re: CLOSING SolrCore! >> ???

2008-09-26 Thread Grant Ingersoll
Can you provide more information? What happened right before seeing this msg? What version of Solr are you on? -Grant On Sep 26, 2008, at 3:26 AM, sunnyfr wrote: Hello everybody, I've big issue with the website, I don't know how but I can't start it again: this is my Catalina.log [EMAIL

Re: Searching Question

2008-09-26 Thread Grant Ingersoll
Is a thread and all of it's posts a single document? In other words, how are you modeling your posts as Solr documents? Also, where are you keeping track of the number of replies? Is that in Solr or in a DB? -Grant On Sep 25, 2008, at 8:51 PM, Jake Conk wrote: Hello, We are using Solr

Re: spellcheck: buildOnOptimize?

2008-09-26 Thread Grant Ingersoll
That seems reasonable. Another thing to think about, is maybe it is useful to provide some event metadata to the events that contain information about what triggered them. Something like a SolrEvent class such that postCommit looks like postCommit(SolrEvent evt) and public void newSearch

Re: Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Grant Ingersoll
How are you creating the tokens? What are you setting for the offsets and the positions? One thing that is helpful is Solr's built in Analysis tool via the Admin interface (http://localhost:8983/solr/admin/) From there, you can plug in verbose mode, and see what the position and offsets a

Re: How to select one entity at a time?

2008-09-26 Thread Norberto Meijome
On Fri, 26 Sep 2008 02:35:18 -0700 (PDT) con <[EMAIL PROTECTED]> wrote: > What you meant is correct only. Please excuse for that I am new to solr. :-( Con, have a read here : http://www.ibm.com/developerworks/java/library/j-solr1/ it helped me pick up the basics a while back. it refers to 1.2,

Re: How to select one entity at a time?

2008-09-26 Thread Norberto Meijome
On Fri, 26 Sep 2008 02:35:18 -0700 (PDT) con <[EMAIL PROTECTED]> wrote: > What you meant is correct only. Please excuse for that I am new to solr. :-( hi Con, nothing to be excused for..but you may want to read the wiki , as it provides quite a lot of information that should answer your questions

Re: Create Indexes

2008-09-26 Thread Norberto Meijome
On Fri, 26 Sep 2008 16:32:05 +0530 Dinesh Gupta <[EMAIL PROTECTED]> wrote: > Is it OK to create whole index by Solr web-app? > If not than ,How can I create index? > > I have attached some file that create index now. > Dinesh, you sent the same email 2 1/2 hours ago. sending it again will not g

Create Indexes

2008-09-26 Thread Dinesh Gupta
Hi All, Please give me some links so that I can start from basics. I have a large database of product. 1) Product can be associated with multiple category 2) a category can be associated with multiple catalogs. 3) category & catalog association goes on a table called category-catalog

RE: NullPointerException

2008-09-26 Thread Dinesh Gupta
Hi, Yes, cdt & mdt are the date in MYSQL DB > Date: Fri, 26 Sep 2008 13:58:24 +0530 > From: [EMAIL PROTECTED] > To: solr-user@lucene.apache.org > Subject: Re: NullPointerException > > I dunno if the problem is w/ date. are cdt and mdt date fields in the DB? > > On Fri, Sep 26, 2008 at 12:58 AM

Re: How to select one entity at a time?

2008-09-26 Thread con
What you meant is correct only. Please excuse for that I am new to solr. :-( I want to index all the query results. (I think this will be done by the data-config.xml) Now while accessing this indexed data, i need this filtering. ie. Either user or manager. I tried your suggestion: http://localhos

Re: How to select one entity at a time?

2008-09-26 Thread Norberto Meijome
On Fri, 26 Sep 2008 00:46:07 -0700 (PDT) con <[EMAIL PROTECTED]> wrote: > To be more specific: > I have the data-config.xml just like: > > > > > > > > > >

Re: Create Indexes

2008-09-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, Sep 26, 2008 at 1:02 PM, Dinesh Gupta <[EMAIL PROTECTED]> wrote: > > > Hi All, > > Please give me some links so that I can start from basics. > > I have a large database of product. > > 1) Product can be associated with multiple category > 2) a category can be associated with multiple catal

Re: NullPointerException

2008-09-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
I dunno if the problem is w/ date. are cdt and mdt date fields in the DB? On Fri, Sep 26, 2008 at 12:58 AM, Shalin Shekhar Mangar <[EMAIL PROTECTED]> wrote: > I'm not sure about why the NullPointerException is coming. Is that the whole > stack trace? > > The mdt and cdt are date in schema.xml but

Re: How to select one entity at a time?

2008-09-26 Thread con
To be more specific: I have the data-config.xml just like: I have 3 search

Create Indexes

2008-09-26 Thread Dinesh Gupta
Hi All, Please give me some links so that I can start from basics. I have a large database of product. 1) Product can be associated with multiple category 2) a category can be associated with multiple catalogs. 3) category & catalog association goes on a table called category-catalog

CLOSING SolrCore! >> ???

2008-09-26 Thread sunnyfr
Hello everybody, I've big issue with the website, I don't know how but I can't start it again: this is my Catalina.log [EMAIL PROTECTED]:/# tail -f /var/log/tomcat5.5/catalina.2008-09-25.log INFO: [book] CLOSING SolrCore! Sep 25, 2008 5:56:16 PM org.apache.solr.core.SolrCore closeSearcher INFO:

Integrating external stemmer in Solr and pre-processing text

2008-09-26 Thread Jaco
Hello, I need to work with an external stemmer in Solr. This stemmer is accessible as a COM object (running Solr in tomcat on Windows platform). I managed to integrate this using the com4j library. I tested two scenario's: 1. Create a custom FilterFactory and Filter class for this. The external st