Re: MongoDb vs Solr

2017-08-04 Thread Walter Underwood
MarkLogic can do many-to-many. I worked there six years ago. They use search engine index structure with generational updates, including segment level caches. With locking. Pretty good stuff. A many to many relationship is an intersection across posting lists, with transactions.

Re: MongoDb vs Solr

2017-08-04 Thread David Hastings
Also, id love to see an example of a many to many relationship in a nosql db as you described, since that's a rdbms concept. If it exists in a nosql environment I would like to learn how... > On Aug 4, 2017, at 10:56 PM, Dave wrote: > > Uhm. Dude are you

Re: MongoDb vs Solr

2017-08-04 Thread Dave
Uhm. Dude are you drinking? 1. Lucidworks would never say that. 2. Maria is not a json +MySQL. Maria is a fork of the last open source version of MySQL before oracle bought them 3.walter is 100% correct. Solr is search. The only complex data structure it has is an array. Something like mongo

Re: MongoDb vs Solr

2017-08-04 Thread GW
The people @ Lucidworks would beg to disagree but I know exactly what you are saying Walter. A simple flat file like a cardx is fine and dandy as a Solrcloud noSQL DB. I like to express it as knowing when to fish and when to cut bait. As soon as you are in the one - many or many - many world a

Re: Metrics in 6.5.1 names and stuff

2017-08-04 Thread Shalin Shekhar Mangar
Hi Walter, You need the metrics with the prefix "QUERY./srp" -- those will give you request counts and percentiles for the /srp handler for a node. Comments inline: On Fri, Aug 4, 2017 at 3:23 AM, Walter Underwood wrote: > I’m trying to get what I want out of the metrics

Re: Limiting the number of queries/updates to Solr

2017-08-04 Thread S G
timeAllowed parameter is a not a good choice for rate limiting and could crash the whole Solr cluster. In fact, timeAllowed parameter should increase the chances of crashing the whole cluster: When the timeAllowed for a query is over, it's client will get a failure but the server handling the

Re: MongoDb vs Solr

2017-08-04 Thread Walter Underwood
Solr is NOT a database. If you need a database, don’t choose Solr. If you need both a database and search, choose MarkLogic. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Aug 4, 2017, at 4:16 PM, Francesco Viscomi wrote: > >

Re: MongoDb vs Solr

2017-08-04 Thread Dave
Ones a search engine and the other is a nosql db. They're nothing alike and are completely different tools for completely different jobs. > On Aug 4, 2017, at 7:16 PM, Francesco Viscomi wrote: > > Hi all, > why i have to choose solr if mongoDb is easier to learn and to

MongoDb vs Solr

2017-08-04 Thread Francesco Viscomi
Hi all, why i have to choose solr if mongoDb is easier to learn and to use? Both are NoSql database, is there a good reason to chose solr and not mongoDb? thanks really much -- Ing. Viscomi Francesco

Re: Returning unique values for suggestion

2017-08-04 Thread Zheng Lin Edwin Yeo
Hi Walter, Can we consider this as a bug/improvement for the /suggest handler which uses highlighter? Regards, Edwin On 20 July 2017 at 10:14, Zheng Lin Edwin Yeo wrote: > I am getting something similar to yours too, but I'm using Solr 6.5.1. > > > "highlighting":{ >

Re: Sum of double fields in JSON Facet

2017-08-04 Thread Zheng Lin Edwin Yeo
Hi Amrit, Thanks for your reply. So does this means that we can consider this as a bug in JSON Facets? Regards, Edwin On 26 July 2017 at 01:57, Amrit Sarkar wrote: > Zheng, > > You may want to check https://issues.apache.org/jira/browse/SOLR-7452. I > don't know

Re: Cookies with SOLRJ?

2017-08-04 Thread Shawn Heisey
On 8/4/2017 8:59 AM, Sanders, Marshall (CAI - Atlanta) wrote: > Is there a way to easily set/get a cookie from a solrj request/response or an > example of this someone can share? > > The types of queries our application uses could greatly benefit from > stickying our application to a specific

Re: Replication Question

2017-08-04 Thread Shawn Heisey
On 8/2/2017 8:56 AM, Michael B. Klein wrote: > SCALE DOWN > 1) Call admin/collections?action=BACKUP for each collection to a > shared NFS volume > 2) Shut down all the nodes > > SCALE UP > 1) Spin up 2 Zookeeper nodes and wait for them to stabilize > 2) Spin up 3 Solr nodes and wait for them to

Re: Limiting the number of queries/updates to Solr

2017-08-04 Thread Varun Thacker
Hi Hrishikesh, I think SOLR-7344 is probably an important addition to Solr. It could help users isolate analytical queries ( streaming ) , search queries and indexing requests and throttle requests Let's continue the discussion on the Jira On Thu, Aug 3, 2017 at 2:03 AM, Rick Leir

streaming expression search on top of merged collections

2017-08-04 Thread Minh Truong
Hi all, I have a question related to Streaming expression. Is it possible to perform search on top of a merged collection? If it possible how do you write it? Thank! Regards, Minh The contents of this e-mail and any attachments are intended solely for the use

Re: Move index directory to another partition

2017-08-04 Thread Shawn Heisey
On 8/2/2017 9:17 AM, Erick Erickson wrote: > Not entirely sure about AWS intricacies, but getting a new replica to > use a particular index directory in the general case is just > specifying dataDir=some_directory on the ADDREPLICA command. The index > just needs an HTTP connection (uses the old

Re: AW: plus sign in request / looking for + in title

2017-08-04 Thread d.ku...@technisat.de
Hey, that is a good point. What is the best way for filtering? About the plus at the request, we are doing on the whole request an URL encode.. Thanks David > Am 04.08.2017 um 17:34 schrieb Erick Erickson : > > Glad to hear it. Two things: > > 1> you might have

Re: Get results in multiple orders (multiple boosts)

2017-08-04 Thread Rick Leir
Luca I hope you have a web app in front of Solr. It could accept parameters from the browser, then construct the query as necessary to do your sorting. Cheers -- Rick On August 4, 2017 5:32:31 AM EDT, Luca Dall'Osto wrote: >Hello, >sorry for the late, I was out

Re: Different order of docs between SOLR-4.10.4 to SOLR-6.5.1

2017-08-04 Thread Erick Erickson
In addition to Shawn's comments, deleted but not merged documents alter the statistics used for scoring, so the only hope that the scores are comparable would be on an optimized index. And note that I would recommend optimizing _only_ for testing, don't use it in a production system unless the

Re: AW: plus sign in request / looking for + in title

2017-08-04 Thread Erick Erickson
Glad to hear it. Two things: 1> you might have to do some additional filtering when using WhitespaceTokenizer. It, well, splits on whitespace so things like punctuation will come through as part of the token. So "My dog has fleas." (note the period after fleas) would have the period included in

Cookies with SOLRJ?

2017-08-04 Thread Sanders, Marshall (CAI - Atlanta)
Is there a way to easily set/get a cookie from a solrj request/response or an example of this someone can share? The types of queries our application uses could greatly benefit from stickying our application to a specific solr instance and I'd like to read the load balancer cookie and then

Per Text Field Similarity Measures for Learning to Rank

2017-08-04 Thread Michael Alcorn
Hi all, I recently prototyped a learning to rank system in Python that produced promising results, so I'm now looking into how to replicate that process in our Solr setup. For my Python implementation, I was using a number of features that were per field text comparisons, e.g.: 1.

AW: AW: plus sign in request / looking for + in title

2017-08-04 Thread d.ku...@technisat.de
Hey, thanks. Yeah i found a way.. I sued for these files my on fieldtype. In these I'm using the WhitespaceTokenizerFactory for query an index.. and now everything is like it should be.. :-) Thanks David -Ursprüngliche Nachricht- Von: Shawn Heisey [mailto:apa...@elyograg.org]

Re: Different order of docs between SOLR-4.10.4 to SOLR-6.5.1

2017-08-04 Thread Shawn Heisey
On 8/4/2017 1:02 AM, SOLR4189 wrote: > I need to upgrade from SOLR-4.10.3 to SOLR-6.5.1 in production environment. > When I checked it in the test environment, I noticed the order of returned > docs for each query is different. The score has changed as well. I use same > similarity algorithm -

Re: AW: plus sign in request / looking for + in title

2017-08-04 Thread Shawn Heisey
On 8/4/2017 2:15 AM, d.ku...@technisat.de wrote: > So how can I prevent e.g. the ST (standartTokenizer) to remove the plus sign? > An suggestions? You can't. The standard tokenizer really isn't configurable at all. You'd need to change your analysis chain (tokenizer and filters) to produce the

Re: Get results in multiple orders (multiple boosts)

2017-08-04 Thread Luca Dall'Osto
Hello, sorry for the late, I was out of my home. In response to Rick:  I can't do that because:  1) each user should have multiple sort (for example user "A" can sort by date and then by category and then by name ...) . 2) the sort is not natural sort: user has a custom order for a field (for

AW: plus sign in request / looking for + in title

2017-08-04 Thread d.ku...@technisat.de
Her Erick, thanks for reply. Analysis is a good point I tried "hd+" at the Field Value and you were right: ST text hd raw_bytes [68 64] start 0 end 2 positionLength 1 type position 1 So how can I prevent e.g. the ST (standartTokenizer) to remove thepus sign? An suggestions? thanks

Different order of docs between SOLR-4.10.4 to SOLR-6.5.1

2017-08-04 Thread SOLR4189
Hey all, I need to upgrade from SOLR-4.10.3 to SOLR-6.5.1 in production environment. When I checked it in the test environment, I noticed the order of returned docs for each query is different. The score has changed as well. I use same similarity algorithm - OccapiBM25 as in previous version.