bitcask hash algo

2011-06-07 Thread Aaron Blohowiak
as far as i can tell, bitcask c_src is using the murmurhash2 algo, which has a known flaw ( https://sites.google.com/site/murmurhash/murmurhash2flaw ).. while this is not *likely* to cause an issue, I was wondering if there was a reason that it does not use murmurhash3 ? if this is not the appropr

Re: Correct way to use pbc/mapreduce to do multiget where keys and bucket names are binary values?

2011-06-07 Thread Jacques
I've been working on this. I have it working with an anonymous javascript function. I was hoping to move it to using the "map_object_value" built-in erlang function. However, when I attempt to use this function, I get failures if any of my keys don't exist. Is there a way to construct my map ph

Re: Has there been any talk of dropping the PB interface?

2011-06-07 Thread Mike Oxford
Protobufs is "tighter on the wire" than BERT, due to predefined schema and better packing of things like numbers. The same goes for Thrift. If you need more languages for protobuf, have a look at http://code.google.com/p/protobuf/wiki/ThirdPartyAddOns Rock on. -mox On Tue, Jun 7, 2011 at 3:39

Has there been any talk of dropping the PB interface?

2011-06-07 Thread Andrew Berman
I'm curious if there has been any talk to drop the protocol buffers interface in favor of one of the more user-friendly serialization libraries which support more languages, like Bert (http://bert-rpc.org/) or MessagePack (http://msgpack.org/). I would think Bert is a perfect fit for Riak since it

Re: riaksearch memory growth issues

2011-06-07 Thread Gordon Tillman
Thanks David, If there is anything I can do from this end to help please don't hesitate to ask. --gordon On Jun 7, 2011, at 15:34 , David Smith wrote: > Gordon, > > Thanks for the test case. I've queued it up for review by a dev, as > time permits. > > D. > > On Tue, Jun 7, 2011 at 1:33 PM,

speeding up riaksearch precommit indexing

2011-06-07 Thread Steve Webb
Hey there. I'm inserting twitter spritzer tweets into a bucket that doesn't have a precommit index hook, and a few fields from the tweet into a second bucket that does have the precommit hook. Speeds on the inserts into the indexed bucket are an order or magnitude slower than the non-indexed

Re: riaksearch memory growth issues

2011-06-07 Thread David Smith
Gordon, Thanks for the test case. I've queued it up for review by a dev, as time permits. D. On Tue, Jun 7, 2011 at 1:33 PM, Gordon Tillman wrote: > Guys I have put together a simple test to reproduce the error that we are > seeing. > It is on github here: > https://github.com/gordyt/riaksearch

Pruning (merging) after storage reaches a certain size?

2011-06-07 Thread Steve Webb
Hello there. I'm loading a 2-node (1GB mem, 20GB storage, vmware VMs) riaksearch cluster with the spritzer twitter feed. I used the bitcask 'expiry_secs' to expire data after 3 days. I'm curious - I'm up to about 10GB of storage and I'm guessing that I'll be full in 3-4 more days of ingesti

Re: riaksearch memory growth issues

2011-06-07 Thread Gordon Tillman
Guys I have put together a simple test to reproduce the error that we are seeing. It is on github here: https://github.com/gordyt/riaksearch-test This is a multi-threaded test that connects to Riak using the protocol buffers interface. Each iteration of the run loop issues one simple search a

Re: riak_search moving forward

2011-06-07 Thread David Smith
On Tue, Jun 7, 2011 at 9:02 AM, Jasinek, Jason wrote: > > Are there plans to add in additional Erlang Analyzers that would provide > additional terms?  Analyzers that I would like to see added would including > stemming and ngrams. Hi Jason, We're not currently planning on implementing those s

RE: riak_search moving forward

2011-06-07 Thread Jasinek, Jason
David, Are there plans to add in additional Erlang Analyzers that would provide additional terms? Analyzers that I would like to see added would including stemming and ngrams. Jason -Original Message- From: riak-users-boun...@lists.basho.com [mailto:riak-users-boun...@lists.basho.com

Re: Correct way to use pbc/mapreduce to do multiget where keys and bucket names are binary values?

2011-06-07 Thread Russell Brown
On 7 Jun 2011, at 15:29, Jacques wrote: > Have you had any success with reading the server response? Yes. Sorry I didn't post a reply. It was trivial (but fraught, see below), I just used the OtpInputStream to deserialize the byte array returned from pbc.MapReduceResponse. Like: By

Re: Correct way to use pbc/mapreduce to do multiget where keys and bucket names are binary values?

2011-06-07 Thread Jacques
Have you had any success with reading the server response? Thanks, Jacques On Sat, Jun 4, 2011 at 1:19 PM, Russell Brown wrote: > > On 4 Jun 2011, at 18:22, Jacques wrote: > > I like the sound of option 3 also. I'll have a look at it this weekend and >> get back to you. >> > > Awesome! Thanks.

Re: Finding Value greater then anyvalue

2011-06-07 Thread Rusty Klophaus
Hi Muhammad, Riak Search uses Lucene Query syntax. Because Lucene syntax is meant for text searching, doing a 'greater than' query is somewhat convoluted, but still possible: *riakc_pb_socket:search(Client, "player", "score: {500 TO 99]")* * * That tells Riak Search to query the "players"

Finding Value greater then anyvalue

2011-06-07 Thread Muhammad Yousaf
Hi, I am using Riaksearch with erlang PB client by following "https://github.com/basho/riak-erlang-client/"; my schema is { schema, [ {version, "1.1"}, {default_field, "playername"}, {default_op, "or"}, {n_val, 3}, {analyzer_factory, {erlang, tex

Re: Logical operators in Ripple

2011-06-07 Thread Sean Cribbs
If this is done incorrectly in Ripple, please file an issue[1] on Github -- or even better, send a pull-request. We should get this fixed on the wiki[2] ASAP as well. Sean Cribbs Developer Advocate Basho Technologies, Inc. http://basho.com/ [1] https://github.com/seancribbs/ripple/issues [2]