Re: Performance Question

2016-07-18 Thread Todd Lipcon
K, sounds good. The upgrade shouldn't invalidate any performance testing or anything -- just fixes this important bug. -Todd > On Jul 18, 2016, at 10:29 AM, Todd Lipcon <t...@cloudera.com> wrote: > > Hi Ben, > > Any chance that you are running Kudu 0.9.0 instead of 0.9.1? There'

Re: Performance Question

2016-07-18 Thread Todd Lipcon
nknown) > > Does anyone know what this means? > > Thanks, > Ben > > > On Jul 11, 2016, at 10:47 AM, Todd Lipcon <t...@cloudera.com> wrote: > > On Mon, Jul 11, 2016 at 10:40 AM, Benjamin Kim <bbuil...@gmail.com> wrote: > >> Todd, >> >&g

Re: Performance Question

2016-07-11 Thread Todd Lipcon
currently the only easy way to recover is to recreate the table. If this sounds really painful, though, maybe we can work up some kind of tool you could use to just recreate the missing tablets (with those rows lost). -Todd > > On Jul 11, 2016, at 10:37 AM, Todd Lipcon <t...@cloudera.com&

Re: Performance Question

2016-07-11 Thread Todd Lipcon
gt; analysis and thorough benchmarking, but it is certainly a goal of Kudu to > be a great platform for ingesting and analyzing data through Spark. Up > till this point most of the Spark work has been community driven, but more > thorough integration testing of the Spark connector is go

Re: Performance Question

2016-07-01 Thread Todd Lipcon
gate queries look longer from 6s to 9s. > > I’m still impressed. > We aim to please ;-) If you have any interest in writing up these experiments as a blog post, would be cool to post them for others to learn from. -Todd > On Jun 15, 2016, at 12:47 AM, Todd Lipcon <t...@cloude

[ANNOUNCE] Apache Kudu (incubating) 0.9.1 released

2016-07-01 Thread Todd Lipcon
The Apache Kudu (incubating) team is happy to announce the release of Kudu 0.9.1! Kudu is an open source storage engine for structured data which supports low-latency random access together with efficient analytical access patterns. It is designed within the context of the Apache Hadoop ecosystem

Re: Performance Question

2016-06-29 Thread Todd Lipcon
> As you can see, we are using UUID java library to create the key. > OK. You will have better insert performance if instead your key is something that is increasing with time (eg System.currentTimeMillis() + UUID). -Todd > On Jun 29, 2016, at 1:32 PM, Todd Lipcon <t...@cloudera.com>

Re: Performance Question

2016-06-29 Thread Todd Lipcon
n't fit comfortably in cache. This is one area that HBase is fundamentally going to be faster based on its design. -Todd > On Jun 28, 2016, at 4:26 PM, Todd Lipcon <t...@cloudera.com> wrote: > > Cool, thanks for the report, Ben. For what it's worth, I think there's > still some low

Re: Performance Question

2016-06-28 Thread Todd Lipcon
in our setup. We have the compute cluster >> with Spark, YARN, etc. on its own, and we have the storage cluster with >> HBase, Kudu, etc. on another. We beefed up the hardware specs on the >> compute cluster and beefed up storage capacity on the storage cluster. We >> g

Re: Performance Question

2016-06-15 Thread Todd Lipcon
rier", "pop", "hostname", "profile_expires", "timestamp_iso", "reference_id", "identity_organization", "identity_method" Most queries are like counts of how many users use what browser, how many are unique users, etc. The

Re: Performance Question

2016-06-15 Thread Todd Lipcon
eally do > some conclusive tests? I want to see if I can match your results on my 50 > node cluster. > > Thanks, > Ben > > On May 30, 2016, at 10:33 AM, Todd Lipcon <t...@cloudera.com> wrote: > > On Sat, May 28, 2016 at 7:12 AM, Benjamin Kim <bbuil...@gmail.com&

Re: Imroving the insert peformance with INSERT INTO SELECT - gFlagfile

2016-05-30 Thread Todd Lipcon
Hi Amit Answers below On Sun, May 29, 2016 at 11:37 AM, Amit Adhau wrote: > Hi, > > What is the significance of using below gflags, which can help in imroving > the insert peformance with INSERT INTO SELECT clause. > > --num_tablets_to_open_simultaneously=8 > This only

Re: Performance Question

2016-05-27 Thread Todd Lipcon
efer to use Kudu since we are already a Cloudera shop, >> and Kudu is easy to deploy and configure using Cloudera Manager. I also >> hope that some of Aerospike’s speed optimization techniques can make it >> into Kudu in the future, if they have not been already thought of or >> included. >> >> Just some thoughts… >> >> Cheers, >> Ben > > > > -- > -- > Mike Percy > Software Engineer, Cloudera > > > > -- Todd Lipcon Software Engineer, Cloudera

Re: Kudu Data Storage Size Mistmatch On dashboard / data folder

2016-04-28 Thread Todd Lipcon
> > La información contenida en este mensaje puede ser confidencial. Ha sido > enviada para el uso exclusivo del destinatario(s) previsto. Si el lector de > este mensaje no fuera el destinatario previsto, por el presente queda Ud. > notificado que cualquier lectura, uso, publicación, diseminación, > distribución o copiado de esta comunicación o su contenido está > estrictamente prohibido. En caso de que Ud. hubiera recibido este mensaje > por error le agradeceremos notificarnos por e-mail inmediatamente y > eliminarlo de su sistema. Muchas gracias. > > -- Todd Lipcon Software Engineer, Cloudera

Re: Exception at inserting big amount of data

2016-04-27 Thread Todd Lipcon
e puede ser confidencial. Ha sido > enviada para el uso exclusivo del destinatario(s) previsto. Si el lector de > este mensaje no fuera el destinatario previsto, por el presente queda Ud. > notificado que cualquier lectura, uso, publicación, diseminación, > distribución o copiado de esta comunicación o su contenido está > estrictamente prohibido. En caso de que Ud. hubiera recibido este mensaje > por error le agradeceremos notificarnos por e-mail inmediatamente y > eliminarlo de su sistema. Muchas gracias. > > -- Todd Lipcon Software Engineer, Cloudera

Blog post on some YCSB optimization

2016-04-27 Thread Todd Lipcon
Hi all, In case you don't use RSS to follow the blog, I figured I'd ping the mailing list. I just published a post about some exploration I've been doing using YCSB lately. Users might find it interesting: http://getkudu.io/2016/04/26/ycsb.html -Todd -- Todd Lipcon Software Engineer, Cloudera

Re: Weekly update 4/25

2016-04-26 Thread Todd Lipcon
is doing to make sure it's at least as good. -Todd > > > *From:* Todd Lipcon [mailto:t...@cloudera.com] > *Sent:* Tuesday, April 26, 2016 12:59 PM > > *To:* user@kudu.incubator.apache.org > *Subject:* Re: Weekly update 4/25 > > > > On Tue, Apr 26, 2016 at 8:28 AM,

Re: where is kudu's dump core located?

2016-04-06 Thread Todd Lipcon
, Todd Lipcon <t...@cloudera.com> wrote: > BTW, I filed https://issues.apache.org/jira/browse/KUDU-1396 for this > bug. Thanks for helping us track it down! > > On Tue, Apr 5, 2016 at 11:05 PM, Todd Lipcon <t...@cloudera.com> wrote: > >> Hi Darren, >> >> Than

Re: Unsubscribe

2016-02-24 Thread Todd Lipcon
Please email user-unsubscribe@ -Todd On Wed, Feb 24, 2016 at 10:48 AM, Andrea Ferretti <ferrettiand...@gmail.com> wrote: > -- Todd Lipcon Software Engineer, Cloudera

Re: [KUDU Tablet]unrecoverable crash

2016-02-19 Thread Todd Lipcon
BTW, you may not want to 'rm' but rather move them aside so that you don't lose data. -Todd On Fri, Feb 19, 2016 at 5:28 PM, Todd Lipcon <t...@cloudera.com> wrote: > If you have a replicated cluster, it's likely that the master already > re-replicated non-corrupt versions of t

Re: [KUDU Tablet]unrecoverable crash

2016-02-19 Thread Todd Lipcon
in > {5963266013874118554})}] > > It throws the same fatal error and crashes immediately no matter how many > times i try to restart the service. > > Any ideas to get out of this situation? I don't want to lose the data. > > > --Nick > -- Todd Lipcon Software Engineer, Cloudera