Would something like the following work?
row=time_client id_client counter
Where the client id is a unique id per client instance, it would be
allocated once using Zookeeper or an Accumulo Conditional writer when the
client starts. The client counter would be an AtomicLong in the client.
On
This sounds super close to a type 1 UUID -
https://en.wikipedia.org/wiki/Universally_unique_identifier#Version_1_.28MAC_address_.26_date-time.29
On Tue, Jun 23, 2015 at 8:14 AM, Keith Turner ke...@deenlo.com wrote:
Would something like the following work?
row=time_client id_client counter
That solution might be prone to duplicates if the same document is
encountered by multiple ingest clients.
Another option might be:
row=time_hash(document)
--
Christopher L Tubbs II
http://gravatar.com/ctubbsii
On Tue, Jun 23, 2015 at 9:14 AM, Keith Turner ke...@deenlo.com wrote:
Would
Hi All,
I have an application which can index data at very high rate from
multiple clients. I need to generate a unique id to store documents.
It Should
(1) use the current system time in millies.
(2) it should be designed to sort lexicographically on the bases of time.
(3) if I just store the
Thanks Josh. It really worked for me.
On Wednesday 17 June 2015 08:43 PM, Josh Elser wrote:
Madhvi,
Understood. A few more questions..
How are you passing these IDs to the batch scanner? Are you providing
individual Ranges for each ID (e.g. `new Range(new Key(row1, ,
id1), true, new