Re: fl rename of unique key in solrcloud

2014-11-14 Thread Jeon Woosung
Could you let me know version of the solr? On Sat, Nov 15, 2014 at 5:05 AM, Suchi Amalapurapu wrote: > Hi > Getting the following exception when using fl renaming with unique key in > the schema. > http:///solr//select?q=dress&fl=a1:p1 > > where p1 is the unique key for > For collections with s

Re: solr IRC

2014-11-14 Thread Anurag Sharma
Also like know, is this the only IRC or there are other's as well like solr dev, lucene dev etc On Sat, Nov 15, 2014 at 10:59 AM, Anurag Sharma wrote: > I tried couple of weeks earlier as well. As suggested, will try again > after mid next week. > > On Sat, Nov 15, 2014 at 10:53 AM, Alexandre R

Re: solr IRC

2014-11-14 Thread Anurag Sharma
I tried couple of weeks earlier as well. As suggested, will try again after mid next week. On Sat, Nov 15, 2014 at 10:53 AM, Alexandre Rafalovitch wrote: > If you tried this week it is because everybody was at the conference. Try > again mid next week. > > Regards, > Alex > On 14/11/2014 11

Re: solr IRC

2014-11-14 Thread Alexandre Rafalovitch
If you tried this week it is because everybody was at the conference. Try again mid next week. Regards, Alex On 14/11/2014 11:35 pm, "Anurag Sharma" wrote: > Is this correct link to Solr IRC - > http://webchat.freenode.net/?channels=#solr > I tried couple of times using the IRC, the list of

Re: Hierarchical faceting

2014-11-14 Thread Evan Pease
Hi Rashmi, Here is some more details on how to use PathHierarchyTokenizer that Oleg provided the link to. If this is your document: > *Sample document* > > name=Pbook1 > category=NonFic/Sci/Phy/Quantum > author=ABC > price=20.00 > Then, in your schema.xml:

solr IRC

2014-11-14 Thread Anurag Sharma
Is this correct link to Solr IRC - http://webchat.freenode.net/?channels=#solr I tried couple of times using the IRC, the list of online users are always good but never get any response on the query and also don't see any communication/discussion.

Re: Hierarchical faceting

2014-11-14 Thread Oleg Savrasov
Hi Rashmi, I believe you are looking for PathHierarchyTokenizer, see https://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/path/PathHierarchyTokenizer.html Oleg 2014-11-14 17:53 GMT-05:00 rashmy1 : > Hello, > I'm trying to setup Solr for fetching hierarchical facets.

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-14 Thread Shawn Heisey
On 11/14/2014 9:51 AM, henry cleland wrote: > How do I search only a subset of my corpus based on a large list of non > consecutive unique key ids (cannot do a range query). > Is there a way around doing this q=id:(id1 OR id2 OR id3 OR id4 ... OR > id4 ) AND name:* > > Also what is the limit o

Hierarchical faceting

2014-11-14 Thread rashmy1
Hello, I'm trying to setup Solr for fetching hierarchical facets. Please advice which of the below approaches should be followed for my scenario. *Scenario: * NonFic Hist HistBook1 HistBook2 Sci Phy Quantum

Duplicate scoring situation in DelegatingCollector

2014-11-14 Thread Andy Crossen
Hi folks, I have a DelegatingCollector installed via a PostFilter (kind of like an AnalyticsQuery) that needs the document score to a) add to a collection of score-based stats, and b) decide whether to keep the document based on the score. If I keep the document, I call super.collect() (where sup

Re: Suggest dictionaries not rebuilding after restart

2014-11-14 Thread Michael Sokolov
Yeah - I would want it fixed as a default setting of some sort, maybe in-built in the Suggester class, so you wouldn't be required to have something in config to make it work in a reasonable way. Glad my insomnia went to some purpose. -MIke On 11/14/2014 02:12 PM, Walter Underwood wrote: Tha

fl rename of unique key in solrcloud

2014-11-14 Thread Suchi Amalapurapu
Hi Getting the following exception when using fl renaming with unique key in the schema. http:///solr//select?q=dress&fl=a1:p1 where p1 is the unique key for For collections with single shard, this works flawlessly but results in the following exception in case of multiple shards. How do we fix

Re: DIH Blob data

2014-11-14 Thread Anurag Sharma
Thanks Michael & Eric for the succinct response. On Sat, Nov 15, 2014 at 12:13 AM, Michael Sokolov < msoko...@safaribooksonline.com> wrote: > There is a binary type > > -Mike > > On 11/14/2014 12:21 PM, Anurag Sharma wrote: > >> bq: We routinely store images and pdfs in Solr. There *is* a benefit

Re: Suggest dictionaries not rebuilding after restart

2014-11-14 Thread Walter Underwood
That fixed it. I bet that would fix the problem with the very long startup that another user had. That’s a bug in the default solrconfig.xml, it should persist the dictionaries. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Nov 14, 2014, at 12:42 AM, Michael

Re: DIH Blob data

2014-11-14 Thread Erick Erickson
Right, a more nuanced comment involves what _type_ of docs you're storing, and what the ratio of searchable-to-overall size is. Consider an image. The searchable data may be 0.01% of the file size. Or even worse, a movie. As always, "it depends". I guess that personally I'm not a fan of using Solr

Re: DIH Blob data

2014-11-14 Thread Michael Sokolov
On 11/14/2014 01:43 PM, Erick Erickson wrote: Just skimming, so maybe I misinterpreted. ExternalFileField and ExternalFileFieldReloader refer to storing values for each doc in an external file, they have nothing to do with storing _files_. The usual pattern is to have Solr store just enough da

Re: DIH Blob data

2014-11-14 Thread Michael Sokolov
There is a binary type -Mike On 11/14/2014 12:21 PM, Anurag Sharma wrote: bq: We routinely store images and pdfs in Solr. There *is* a benefit, since you don't need to manage another storage system, you don't have to worry about Solr getting out of sync with the other system, you can use Solr r

Re: DIH Blob data

2014-11-14 Thread Erick Erickson
Just skimming, so maybe I misinterpreted. ExternalFileField and ExternalFileFieldReloader refer to storing values for each doc in an external file, they have nothing to do with storing _files_. The usual pattern is to have Solr store just enough data to have the system-of-record return the actual

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-14 Thread Jürgen Wagner (DVT)
Hi guy, there's not much of a search operation here. Why not store the documents in a key/value store and simply fetch them by matching ids? Another approach: as there is no query, you could easily partition the set of ids and fetch the results in multiple batches. The maximum number of clause

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-14 Thread Anurag Sharma
Is it possible to add another integer dyanmicField to the selected doc ids? If yes, further can add update incremental/same values to these docs now search can be done to this subset using range/filter query. On Fri, Nov 14, 2014 at 10:21 PM, henry cleland wrote: > Hi guys, > How do I search onl

Re: DIH Blob data

2014-11-14 Thread Anurag Sharma
bq: We routinely store images and pdfs in Solr. There *is* a benefit, since you don't need to manage another storage system, you don't have to worry about Solr getting out of sync with the other system, you can use Solr replication for all your assets, etc. Do the same holds good for large Blobs l

Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-14 Thread henry cleland
Hi guys, How do I search only a subset of my corpus based on a large list of non consecutive unique key ids (cannot do a range query). Is there a way around doing this q=id:(id1 OR id2 OR id3 OR id4 ... OR id4 ) AND name:* Also what is the limit of "OR"s i can apply on the query if that is th

Re: One ZooKeeper and many Solr clouds

2014-11-14 Thread Enrico Trucco
Thank you very much, Jürgen. 2014-11-14 13:51 GMT+01:00 "Jürgen Wagner (DVT)" < juergen.wag...@devoteam.com>: > Hello Enrico, > you may use the chroot feature of Zookeeper to root the different > SolrCloud instances differently. Instead of zoohost1:2181, you can use > zoohost1:2181/cluster1 a

Re: One ZooKeeper and many Solr clouds

2014-11-14 Thread Jürgen Wagner (DVT)
Hello Enrico, you may use the chroot feature of Zookeeper to root the different SolrCloud instances differently. Instead of zoohost1:2181, you can use zoohost1:2181/cluster1 as the Zookeeper location. Unless there is a load issue with high rates of updates and other data traffic, a single Zookeep

One ZooKeeper and many Solr clouds

2014-11-14 Thread Enrico Trucco
Hello I am considering to start using Solr Cloud and to share a single ZooKeeper between different Solr clouds and eventually other software. In all the examples I see online, the configuration of a Solr cloud is stored in the root node of ZooKeeper. I was wandering if it is possible to specify t

RE: Handling growth

2014-11-14 Thread Toke Eskildsen
Patrick Henry [patricktheawesomeg...@gmail.com] wrote: >I am working with a Solr collection that is several terabytes in size over > several hundred millions of documents. Each document is very rich, and > over the past few years we have consistently quadrupled the size our > collection annually.

Re: Suggest dictionaries not rebuilding after restart

2014-11-14 Thread Michael Sokolov
It looks like you have to define "storeDir", and if you don't then the rebuild no longer happens, as you said. I think that goes in the config block you showed, but I haven't tested this (we use a different suggester with its own persistence strategy). -Mike On 11/14/14 2:01 AM, Walter Under