Re: Difficulty with Multi-Word Synonyms

2009-09-17 Thread Robert Muir
thank you again for the bug report with test case! Is there a recommended workaround that avoids combining the new and old > APIs? if you aren't able to patch lucene, maybe apply this workaround patch to your solr. this will dodge the problem for your case, by forcing it to only use next(Token)

Re: Solr via ruby

2009-09-17 Thread rajan chandi
We are planning to use the external Solr on tomcat for scalability reasons. We thought that EmbeddedSolrServer uses HTTP too to talk with Ruby and vise-versa as in acts_as_solr ruby plugin. If Ruby is not using the HTTP to talk EmbeddedSolrServer, what is it using? Thanks and Regards Rajan Chand

Re: Retrieving a field from all result docuemnts & couple of more queries

2009-09-17 Thread Shashikant Kore
Hoss, As I mentioned previously, I prefer to do this with as little java code as possible. That's the motivation for me to take a look at solr. Here is the code snippet. OpenBitSet resultBitset = new OpenBitSet(this.searcher.maxDoc()); this.searcher.search(query, new HitCollector() {

Re: Difficulty with Multi-Word Synonyms

2009-09-17 Thread Gregg Donovan
Thanks. And thanks for the help -- we're hoping to switch from query-time to index-time synonym expansion for all of the reasons listed on the wiki, so this will be great to resolve. I create

Re: Difficulty with Multi-Word Synonyms

2009-09-17 Thread Yonik Seeley
On Thu, Sep 17, 2009 at 6:29 PM, Lance Norskog wrote: > Please add a Jira issue for this. It will get more attention there. > > BTW, thanks for creating such a precise bug report. +1 Thanks, I had missed this. This is serious, and looks due to a Lucene back compat break. I've added the testcase

Re: multicore shards and relevancy score

2009-09-17 Thread Lance Norskog
(I responded in the other thread.) This looks like a Ruby client bug. If you do the same query with the HTTP url, it should work. On Tue, Sep 15, 2009 at 7:41 AM, Paul Rosen wrote: > Shalin Shekhar Mangar wrote: >> >> On Tue, Sep 15, 2009 at 2:39 AM, Paul Rosen >> wrote: >> >>> I've done a few ex

Re: Solr exception with missing required field (meta_guid_s)

2009-09-17 Thread Lance Norskog
The "Global Unique ID" is a field that must be set, and must have a different value in every document. The schema design is unusual - meta_guid_s is a wildcard name. Since the field must always be set, it will be more clear if the schema gives this a name that is not a wildcard. For example, "meta

Re: Best strategy to commit often under load.

2009-09-17 Thread Lance Norskog
Yes, the Solr autoCommit feature is the right tool for this task. It lets the indexing app be simple. In some earlier Solr releases, the "number of documents" feature did not work, so you had to use the "number of milliseconds" feature. Note that each commit causes Solr to forget all cached data a

Re: Difficulty with Multi-Word Synonyms

2009-09-17 Thread Lance Norskog
Please add a Jira issue for this. It will get more attention there. BTW, thanks for creating such a precise bug report. On Mon, Sep 14, 2009 at 1:52 PM, Gregg Donovan wrote: > I'm running into an odd issue with multi-word synonyms in Solr (using > the latest [9/14/09] nightly ). Things generally

Re: "Only one usage of each socket address" error

2009-09-17 Thread Lance Norskog
How is your data uploaded to Solr? Is it your own Java program, or shell scripts, or the DataImportHandler? Does this error appear in the Solr server log, or your program's log? How do you run Solr? In a web server like Jetty or Tomcat, or as the Jetty embedded Solr? Please post the entire excep

Re: shards and facet_count

2009-09-17 Thread Lance Norskog
Yes. &facet=false means "don't do any faceting". This is why you don't get any facet data back. This is probably a bug in the solr-ruby code. Version number 0.0.x is probably a hint about its production-ready status :) A distributed search system is easier to administer if you add a new core that

Re: Retrieving a field from all result docuemnts & couple of more queries

2009-09-17 Thread Chris Hostetter
: You will need to get SolrIndexSearcher.java and modify following:- : : public static final int GET_SCORES = 0x01; No. Do not do that. There is no reason for anyone, to EVER modify that line of code. Absolutely NONE If you've made that change to your version of Solr,

Re: Default Query Type For Facet Queries

2009-09-17 Thread Lance Norskog
There are also filter queries. Also, in the future we will also add other query types for other features. Do we want them all to change? Whatever we do, it should be consistent across all of the query types. On Fri, Sep 11, 2009 at 9:33 PM, Shalin Shekhar Mangar wrote: > On Sat, Sep 12, 2009 at 1

Re: Load synonyms dynamically

2009-09-17 Thread Chris Hostetter
: Is there a way to load the synonyms dynamically. : : I mean if the synonym.txt file changes then during query time the newly : added synonym should be active. : : Currently it required a reindex it only requires reindexing if you used the synonyms as part of an "index" analyzer ... if you use

Re: Misleading log messages while deploying solr

2009-09-17 Thread Chris Hostetter
: As per my understandings and investigations, if we disable STDERR from the : jboss configs, we will not be able to see any STDERR coming from any of the : APIs - which can be real error messages. : So if we know the exact reason why this message from solr is showing up, we : can block this at s

Re: How to leverage the LogMergePolicy "calibrateSizeByDeletes" patch in Solr ?

2009-09-17 Thread Jibo John
On Sep 17, 2009, at 1:30 PM, Shalin Shekhar Mangar wrote: On Fri, Sep 18, 2009 at 1:06 AM, Jibo John wrote: Hello, Came across a lucene patch ( http://issues.apache.org/jira/browse/LUCENE-1634) that would consider the number of deleted documents as the criteria when deciding which segme

Re: How to leverage the LogMergePolicy "calibrateSizeByDeletes" patch in Solr ?

2009-09-17 Thread Shalin Shekhar Mangar
On Fri, Sep 18, 2009 at 1:06 AM, Jibo John wrote: > Hello, > > Came across a lucene patch ( > http://issues.apache.org/jira/browse/LUCENE-1634) that would consider the > number of deleted documents as the criteria when deciding which segments to > merge. > > Since we expect to have very frequent

Re: anyway to get Document update time stamp

2009-09-17 Thread Shalin Shekhar Mangar
On Fri, Sep 18, 2009 at 12:30 AM, siping liu wrote: > > I understand there's no "update" in Solr/lucene, it's really delete+insert. > Is there anyway to get a Document's insert time stamp, w/o explicitely > creating such a data field in the document? If so, how can I query it, for > instance "get

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Chris Harris
It looks like this works as a fix for me as well. (I'm not currently using JMX for anything anyway.) Curiously, the single-core example solrconfig.xml also has "", but it doesn't seem to be a problem there. 2009/9/17 Dadasheva, Olga : > Hi, > > FWIW: disabling fixed this problem for me. > > Than

How to leverage the LogMergePolicy "calibrateSizeByDeletes" patch in Solr ?

2009-09-17 Thread Jibo John
Hello, Came across a lucene patch (http://issues.apache.org/jira/browse/LUCENE-1634 ) that would consider the number of deleted documents as the criteria when deciding which segments to merge. Since we expect to have very frequent deletes, we hope this would help reclaim the space consumed

RE: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Dadasheva, Olga
Hi, FWIW: disabling fixed this problem for me. Thanks you! -Olga -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Thursday, September 17, 2009 1:09 PM To: solr-user@lucene.apache.org Subject: Re: Latest trunk locks execution thread i

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Chris Harris
Ok, let me revise my previous report: It turns out if I take the single-core or multicore example stuff from running "ant example" on r815830 and plop them into Tomcat, this doesn't hang. However, if I plop Solr as I configure it for production use into Tomcat, it does hang. The main ways I can t

anyway to get Document update time stamp

2009-09-17 Thread siping liu
I understand there's no "update" in Solr/lucene, it's really delete+insert. Is there anyway to get a Document's insert time stamp, w/o explicitely creating such a data field in the document? If so, how can I query it, for instance "get all documents that are older than 24 hours"? Thanks. __

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Grant Ingersoll
On Sep 16, 2009, at 6:54 PM, Yonik Seeley wrote: On a quick look, it looks like this was caused (or at least triggered by) https://issues.apache.org/jira/browse/SOLR-1427 Registering the bean in the SolrCore constructor causes it to immediately turn around and ask for the stats which asks fo

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Chris Hostetter
: Maybe moving the resourceLoader.inform(infoRegistry) call PRIOR to : resourceLoader.inform( this ) *or* AFTER latch.countDown() would solve : this problem? If this really is the problem, then a more general purpose solution to future proof us against similar problems down the road would prob

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Chris Hostetter
: Interesting... I still haven't been able to reproduce a hang with : either jetty or tomcat. : I enabled replication and JMX... still nothing. I haven't tried to reproduce the problem, and i don't even have a concrete theory as to what the problem is ... but i did want to point out something t

Re: Searching with or without diacritics

2009-09-17 Thread AHMET ARSLAN
> The sequence of the TokenizerChain is > not correct... Filters must be after tokenizer: Correct for TokenFilter(s), wrong for charFilter(s). MappingCharFilterFactory comes before tokenizer.

Re: Spell checking not working

2009-09-17 Thread Avlesh Singh
A typo, I think is the reason ... positionIncrementGap="100"> multiValued="true"/> > In the fieldType definition the field name is "testSpell" but the field called "spell" uses a type called "textSpell". Please revert back if this is not the reason. Cheers Avlesh On Thu, Sep 17, 2009 at 7:10

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Yonik Seeley
Interesting... I still haven't been able to reproduce a hang with either jetty or tomcat. I enabled replication and JMX... still nothing. -Yonik http://www.lucidimagination.com On Thu, Sep 17, 2009 at 12:35 PM, Chris Harris wrote: > I found what looks like the same issue when I tried to install

Re: Latest trunk locks execution thread in SolrCore.getSearcher()

2009-09-17 Thread Chris Harris
I found what looks like the same issue when I tried to install r815830 under Tomcat. (It works ok with the normal Jetty example/start.jar.) I haven't checked the stack trace, but Tomcat would hang right after the message INFO: Adding debug component:org.apache.solr.handler.component.debugcompon..

copyfield at search time?

2009-09-17 Thread DHast
is it possible to do somehting like this: Now im wondering how to do something like this: if so, i dont seem to be making progress thanks -- View this message in context: http://www.nabble.com/copyfield-at-search-time--tp25491979p25491979.html Sent from the Solr - User mailing l

Re: Solr via ruby

2009-09-17 Thread Erik Hatcher
On Sep 17, 2009, at 11:40 AM, Ian Connor wrote: Is there any support for connection pooling or a more optimized data exchange format? The solr-ruby library (as do other Solr + Ruby libraries) use the ruby response format and eval it. solr-ruby supports keeping the HTTP connection alive to

Re: Searching with or without diacritics

2009-09-17 Thread Koji Sekiguchi
The sequence of the TokenizerChain is not correct... Filters must be after tokenizer: Koji György Frivolt wrote: I tried to use ISOLatin1AccentFilterFactory under solr 1.3 . It partly works, but does not recognize most of the characters I

Re: Facet vs searching, field types,

2009-09-17 Thread DHast
DHast wrote: > > > > DHast wrote: >> >> hello, >> my documents have a field called member_of, >> this field holds a list, so it will look like this: >> member_of: fred bob mike journals wsjournals jen steve >> if 'member_of' is a 'string' and i search: >> goats AND member_of(fred bob journa

Re: Facet vs searching, field types,

2009-09-17 Thread DHast
DHast wrote: > > hello, > my documents have a field called member_of, > this field holds a list, so it will look like this: > member_of: fred bob mike journals wsjournals jen steve > if 'member_of' is a 'string' and i search: > goats AND member_of(fred bob journals) > i get the right results,

Solr via ruby

2009-09-17 Thread Ian Connor
Hi, Is there any support for connection pooling or a more optimized data exchange format? We are looking at any further ways to optimize the solr queries so we can possibly make more of them in the one request. The JSON like format seems pretty tight but I understand when the distributed search t

Facet vs searching, field types,

2009-09-17 Thread DHast
hello, my documents have a field called member_of, this field holds a list, so it will look like this: member_of: fred bob mike journals wsjournals jen steve if 'member_of' is a 'string' and i search: goats AND member_of(fred bob journals) i get the right results, however, the faceting doesnt wo

Re: Searching with or without diacritics

2009-09-17 Thread György Frivolt
I tried to use ISOLatin1AccentFilterFactory under solr 1.3 . It partly works, but does not recognize most of the characters I need to map. So I tried to use MappingCharFilterFactory based on the documentation it needs a different tokenizer, I set it, and also a mapping file, this is a simple txt wi

Spell checking not working

2009-09-17 Thread Villemos, Gert
I'm trying to setup a spell checker but failing misserably. I would like to have a spell check based on actual values injected into the index from other fields. The configuration is shown below. After indexing and running a query with 'spellcheck.build=true' I can see that the spellcheck index fi

Re: [DIH] URLDataSource and fetching a link

2009-09-17 Thread Grant Ingersoll
On Sep 16, 2009, at 9:13 PM, Walter Underwood wrote: I would use the RSS feed (hopefully in Atom format) as a source of links, then use a regular web spider to fetch the content. I seriously doubt that DIH is up to the task of general fetching from the Wild Wild Web. That is a dirty and di

Re: [DIH] URLDataSource and fetching a link

2009-09-17 Thread Grant Ingersoll
On Sep 16, 2009, at 9:13 PM, Walter Underwood wrote: I would use the RSS feed (hopefully in Atom format) as a source of links, then use a regular web spider to fetch the content. I seriously doubt that DIH is up to the task of general fetching from the Wild Wild Web. That is a dirty and di

Re: Questions on copyField

2009-09-17 Thread Rahul R
Thank you Shalin. Regards Rahul On Thu, Sep 17, 2009 at 11:49 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > On Thu, Sep 17, 2009 at 11:19 AM, Rahul R wrote: > > > Shalin, > > Can you please elaborate a little more on the third response > > *You can send the location's value direc

Re: DeltaImport problem

2009-09-17 Thread KirstyS
It seems my deltaImportQuery was incorrect. I am now getting the documents indexed. thanks so much for helping! KirstyS wrote: > > same response .. > delta-import > idle > > − > > 3 > 2 > 0 > 2009-09-17 11:58:13 > 2009-09-17 11:58:13 > 2009-09-17 11:58:13 > 2009-09-17 11:58:13 > 2 > 0:0:0.156

Re: DeltaImport problem

2009-09-17 Thread KirstyS
same response .. delta-import idle − 3 2 0 2009-09-17 11:58:13 2009-09-17 11:58:13 2009-09-17 11:58:13 2009-09-17 11:58:13 2 0:0:0.156 used http://localhost:8080/solr/dataimport?command=delta-import&commit=true Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: > > can you try the same command a commit=t

Re: DeltaImport problem

2009-09-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
check the logs and see if you have any messages like Exception while adding ... or Error creating document : 2009/9/17 Noble Paul നോബിള്‍ नोब्ळ् : > the status command is command=status > > 2009/9/17 Noble Paul നോബിള്‍  नोब्ळ् : >> can you try the same command a commit=true ? >> >> On Thu, Sep 1

Re: DeltaImport problem

2009-09-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
the status command is command=status 2009/9/17 Noble Paul നോബിള്‍ नोब्ळ् : > can you try the same command a commit=true ? > > On Thu, Sep 17, 2009 at 3:15 PM, KirstyS wrote: >> >> No, I don't see any commit... >> >> I ran with this command >> http://localhost:8080/solr/dataimport?command=delta-i

Re: DeltaImport problem

2009-09-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
can you try the same command a commit=true ? On Thu, Sep 17, 2009 at 3:15 PM, KirstyS wrote: > > No, I don't see any commit... > > I ran with this command > http://localhost:8080/solr/dataimport?command=delta-import > and these are the results > > delta-import > idle > > − > > 3 > 2 > 0 > 2009-

Re: DeltaImport problem

2009-09-17 Thread KirstyS
No, I don't see any commit... I ran with this command http://localhost:8080/solr/dataimport?command=delta-import and these are the results delta-import idle − 3 2 0 2009-09-17 11:42:14 2009-09-17 11:42:14 2009-09-17 11:42:14 2009-09-17 11:42:14 2 0:0:0.172 − This response format is experimen

Re: DeltaImport problem

2009-09-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
What does the status command say after delta-import? does it say committed? On Thu, Sep 17, 2009 at 2:50 PM, KirstyS wrote: > > Hi! thanks so much, I have the latest release. > Now, it shows that the delta import is successful but when I query for the > updated article I am expecting, it has not

Re: DeltaImport problem

2009-09-17 Thread KirstyS
I got the build from 2009-09-12... :o( Any ideas what is wrong or how to fix? foamdino wrote: > > Hi, > >> Hi! thanks so much, I have the latest release. >> Now, it shows that the delta import is successful but when I query for >> the >> updated article I am expecting, it has not updated. It's

Re: DeltaImport problem

2009-09-17 Thread Kevin Jackson
Hi, > Hi! thanks so much, I have the latest release. > Now, it shows that the delta import is successful but when I query for the > updated article I am expecting, it has not updated. It's like the index is > not being updated with the changes...Any ideas? I have experienced the same issue with a

Re: DeltaImport problem

2009-09-17 Thread KirstyS
Hi! thanks so much, I have the latest release. Now, it shows that the delta import is successful but when I query for the updated article I am expecting, it has not updated. It's like the index is not being updated with the changes...Any ideas? thanks Kirsty Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: >

Re: Extract info from parent node during data import (redirect:)

2009-09-17 Thread Fergus McMenemie
JIRA SOLR-1437 created "DIH: Enhance XPathRecordReader to deal with //tagname and other improvements." >Fergus, > >Implementing wildcard (//tagname) is definitely possible. I would love >to see it working. But if you wish to take a dig at it I shall do >whatever I can to help. > >>What is th

Re: autosuggest - terms componet - document details

2009-09-17 Thread Shalin Shekhar Mangar
On Wed, Sep 16, 2009 at 10:47 PM, Yerraguntla wrote: > > I just started using terms/autosuggest service.Application need the > document > details along with result items. What params I need to use to fetch the > document details. > There is no way to get document details using the TermsComponent