Riak Search Bulk Delete (1.4.8)

2014-08-14 Thread Mark Richard Thomas
Hello How do I bulk delete from an index? I tried the following but nothing was deleted: search-cmd solr my_bucket /delete.xml Where delete.xml contains: deletequery*:*/query/delete Interestingly, this query does not return any documents: search-cmd search my_bucket *:* Mark Thomas |

Riak Search - Fast Bulk Insert

2014-08-14 Thread Mark Richard Thomas
Hello What's the fastest way (best practice) to insert 20 million documents into a Riak Search index? search-cmd solr my_bucket /insert.xml For a proof-of-concept I've create a file (index.html) containing 100,000 documents: add doc/doc doc/doc : /add Thanks Mark Thomas | Software Engineer

Re: Riak Search - Fast Bulk Insert

2014-08-14 Thread Eric Redmond
Note that the search-cmd is for search pre 2.0, which does not use solr. If you're planning on using the new Solr based search, you'll need to run Riak 2.0, and write an import script as Dmitri pointed out. Eric Redmond, Engineer @ Basho On Thu, Aug 14, 2014 at 7:38 AM, Dmitri Zagidulin

Re: Riak Search - Fast Bulk Insert

2014-08-14 Thread Alexander Sicular
And, afaik, a single index.xml file with multiple docs should probably be broken up into one file per doc to make better use of the parallelism already mentioned. Regards, Alexander @siculars http://siculars.posthaven.com Sent from my iRotaryPhone On Aug 14, 2014, at 10:43, Eric Redmond

Slow bucket create/delete in Riak CS

2014-08-14 Thread David Whittington
In testing Riak CS I've noticed that buckets that have had a lot of object updates take a while to delete even if they're empty at the time of deletion. Also, after deletion, creating a new bucket with the same name takes a similar amount of time. This effect seems to persist indefinitely. Here's

Re: Slow bucket create/delete in Riak CS

2014-08-14 Thread Kota Uenishi
David, I'm sure your version of Riak CS is 1.5.0. At 1.5.0 we fixed bug #475 by traversing whole bucket in creation and deletion [1]. This traverse reads all objects in the bucket, even if it is already deleted. Deleted objects remains physically for a while - (called leeway seconds) - with it's