Re: [ANNOUNCE] HBaseConAsia 2018 CFP now open!

2018-05-16 Thread Yu Li
Thanks for adding the link Sean!

bq. Any plans to similarly include a Chinese language version of the
conference page?
Yes the local PC is preparing a Chinese website (instead of nested
translation) and will add a link as soon as it completes, the to-be-updated
main page would be pretty much like:

Sounds good? Thanks.


Best Regards,
Yu

On 17 May 2018 at 03:07, Sean Busbey  wrote:

> This is great! I updated the hbase landing page to point at the conference
> page.
>
> I see the CFP is in English and Chinese. Any plans to similarly
> include a Chinese language version of the conference page?
>
> On Wed, May 16, 2018 at 9:15 AM, Yu Li  wrote:
> > The website (https://hbase.apache.org/hbaseconasia-2018/) is available
> now
> > and will be updated continually, FYI.
> >
> > Best Regards,
> > Yu
> >
> > On 15 May 2018 at 14:48, Yu Li  wrote:
> >
> >> Yes, it's not available yet but will be soon (as stated in the
> >> announcement but probably ignored at a glance) :-)
> >>
> >> Best Regards,
> >> Yu
> >>
> >> On 15 May 2018 at 14:29, OpenInx  wrote:
> >>
> >>> Hi
> >>>
> >>> The https://hbase.apache.org/hbaseconasia-2018/  is returning 404 Not
> >>> Found
> >>> now ...
> >>>
> >>> On Tue, May 15, 2018 at 1:40 PM, Yu Li  wrote:
> >>>
> >>> > All,
> >>> >
> >>> > I'm pleased to announce HBaseConAsia 2018 which is to be held in
> >>> Beijing,
> >>> > China on Aug. 17th.
> >>> >
> >>> > A call for proposals is available now[1], and we encourage all HBase
> >>> users
> >>> > and developers to contribute a talk[2] and plan to attend the event
> >>> > (however, event registration is not yet available).
> >>> >
> >>> > We will update more details for the event at [3] (not available yet
> but
> >>> > will be soon), please watch it and feel free to ask the
> >>> > d...@hbase.apache.org
> >>> >  mailing list or myself if any questions.
> >>> >
> >>> > Thanks and please start planning those talks!
> >>> >
> >>> > - Yu (on behalf of the HBase PMC)
> >>> >
> >>> > [1] *https://easychair.org/cfp/hbaseconasia-2018
> >>> > *
> >>> > [2] https://easychair.org/conferences/?conf=hbaseconasia2018
> >>> > [3] https://hbase.apache.org/hbaseconasia-2018/
> >>> >
> >>>
> >>>
> >>>
> >>> --
> >>> ==
> >>> Openinx  blog : http://openinx.github.io
> >>>
> >>> TO BE A GREAT HACKER !
> >>> ==
> >>>
> >>
> >>
>


Re: How to improve HBase read performance.

2018-05-16 Thread Kang Minwoo
Here is information.

store about 5 billion a day.
size of each log is about 4k.
compression by snappy.

Best regards,
Minwoo Kang


보낸 사람: Ted Yu 
보낸 날짜: 2018년 5월 17일 목요일 00:01
받는 사람: user@hbase.apache.org
제목: Re: How to improve HBase read performance.

bq. store a lot of logs in HBase

Kang:
Can you tell us a bit more about how you store (and access) the log(s) -
size of each log, whether log is encoded in hbase ?

ORC is columnar format while hbase uses different format.

Thanks

On Wed, May 16, 2018 at 6:41 AM, Marcell Ortutay 
wrote:

> This ticket: https://issues.apache.org/jira/browse/HBASE-20459 was fixed
> in
> the latest version of HBase, upgrading to latest may help with performance
>
> On Wed, May 16, 2018 at 3:55 AM, Kang Minwoo 
> wrote:
>
> > Hi, Users.
> >
> > I store a lot of logs in HBase.
> > However, the reading speed of the log is too slow compared to the Hive
> ORC
> > file.
> > I know that HBase is slow compared to the Hive ORC file.
> > The problem is that it is too slow.
> > HBase is about 6 times slower.
> >
> > Is there a good way to speed up HBase's reading speed?
> > Should I put a lot of servers?
> >
> > I am using HBase 1.2.6.
> >
> > Best regards,
> > Minwoo Kang
> >
>


Re: [ANNOUNCE] HBaseConAsia 2018 CFP now open!

2018-05-16 Thread Sean Busbey
This is great! I updated the hbase landing page to point at the conference page.

I see the CFP is in English and Chinese. Any plans to similarly
include a Chinese language version of the conference page?

On Wed, May 16, 2018 at 9:15 AM, Yu Li  wrote:
> The website (https://hbase.apache.org/hbaseconasia-2018/) is available now
> and will be updated continually, FYI.
>
> Best Regards,
> Yu
>
> On 15 May 2018 at 14:48, Yu Li  wrote:
>
>> Yes, it's not available yet but will be soon (as stated in the
>> announcement but probably ignored at a glance) :-)
>>
>> Best Regards,
>> Yu
>>
>> On 15 May 2018 at 14:29, OpenInx  wrote:
>>
>>> Hi
>>>
>>> The https://hbase.apache.org/hbaseconasia-2018/  is returning 404 Not
>>> Found
>>> now ...
>>>
>>> On Tue, May 15, 2018 at 1:40 PM, Yu Li  wrote:
>>>
>>> > All,
>>> >
>>> > I'm pleased to announce HBaseConAsia 2018 which is to be held in
>>> Beijing,
>>> > China on Aug. 17th.
>>> >
>>> > A call for proposals is available now[1], and we encourage all HBase
>>> users
>>> > and developers to contribute a talk[2] and plan to attend the event
>>> > (however, event registration is not yet available).
>>> >
>>> > We will update more details for the event at [3] (not available yet but
>>> > will be soon), please watch it and feel free to ask the
>>> > d...@hbase.apache.org
>>> >  mailing list or myself if any questions.
>>> >
>>> > Thanks and please start planning those talks!
>>> >
>>> > - Yu (on behalf of the HBase PMC)
>>> >
>>> > [1] *https://easychair.org/cfp/hbaseconasia-2018
>>> > *
>>> > [2] https://easychair.org/conferences/?conf=hbaseconasia2018
>>> > [3] https://hbase.apache.org/hbaseconasia-2018/
>>> >
>>>
>>>
>>>
>>> --
>>> ==
>>> Openinx  blog : http://openinx.github.io
>>>
>>> TO BE A GREAT HACKER !
>>> ==
>>>
>>
>>


Re: How to improve HBase read performance.

2018-05-16 Thread Ted Yu
bq. store a lot of logs in HBase

Kang:
Can you tell us a bit more about how you store (and access) the log(s) -
size of each log, whether log is encoded in hbase ?

ORC is columnar format while hbase uses different format.

Thanks

On Wed, May 16, 2018 at 6:41 AM, Marcell Ortutay 
wrote:

> This ticket: https://issues.apache.org/jira/browse/HBASE-20459 was fixed
> in
> the latest version of HBase, upgrading to latest may help with performance
>
> On Wed, May 16, 2018 at 3:55 AM, Kang Minwoo 
> wrote:
>
> > Hi, Users.
> >
> > I store a lot of logs in HBase.
> > However, the reading speed of the log is too slow compared to the Hive
> ORC
> > file.
> > I know that HBase is slow compared to the Hive ORC file.
> > The problem is that it is too slow.
> > HBase is about 6 times slower.
> >
> > Is there a good way to speed up HBase's reading speed?
> > Should I put a lot of servers?
> >
> > I am using HBase 1.2.6.
> >
> > Best regards,
> > Minwoo Kang
> >
>


Re: [ANNOUNCE] HBaseConAsia 2018 CFP now open!

2018-05-16 Thread Yu Li
The website (https://hbase.apache.org/hbaseconasia-2018/) is available now
and will be updated continually, FYI.

Best Regards,
Yu

On 15 May 2018 at 14:48, Yu Li  wrote:

> Yes, it's not available yet but will be soon (as stated in the
> announcement but probably ignored at a glance) :-)
>
> Best Regards,
> Yu
>
> On 15 May 2018 at 14:29, OpenInx  wrote:
>
>> Hi
>>
>> The https://hbase.apache.org/hbaseconasia-2018/  is returning 404 Not
>> Found
>> now ...
>>
>> On Tue, May 15, 2018 at 1:40 PM, Yu Li  wrote:
>>
>> > All,
>> >
>> > I'm pleased to announce HBaseConAsia 2018 which is to be held in
>> Beijing,
>> > China on Aug. 17th.
>> >
>> > A call for proposals is available now[1], and we encourage all HBase
>> users
>> > and developers to contribute a talk[2] and plan to attend the event
>> > (however, event registration is not yet available).
>> >
>> > We will update more details for the event at [3] (not available yet but
>> > will be soon), please watch it and feel free to ask the
>> > d...@hbase.apache.org
>> >  mailing list or myself if any questions.
>> >
>> > Thanks and please start planning those talks!
>> >
>> > - Yu (on behalf of the HBase PMC)
>> >
>> > [1] *https://easychair.org/cfp/hbaseconasia-2018
>> > *
>> > [2] https://easychair.org/conferences/?conf=hbaseconasia2018
>> > [3] https://hbase.apache.org/hbaseconasia-2018/
>> >
>>
>>
>>
>> --
>> ==
>> Openinx  blog : http://openinx.github.io
>>
>> TO BE A GREAT HACKER !
>> ==
>>
>
>


Re: How to improve HBase read performance.

2018-05-16 Thread Marcell Ortutay
This ticket: https://issues.apache.org/jira/browse/HBASE-20459 was fixed in
the latest version of HBase, upgrading to latest may help with performance

On Wed, May 16, 2018 at 3:55 AM, Kang Minwoo 
wrote:

> Hi, Users.
>
> I store a lot of logs in HBase.
> However, the reading speed of the log is too slow compared to the Hive ORC
> file.
> I know that HBase is slow compared to the Hive ORC file.
> The problem is that it is too slow.
> HBase is about 6 times slower.
>
> Is there a good way to speed up HBase's reading speed?
> Should I put a lot of servers?
>
> I am using HBase 1.2.6.
>
> Best regards,
> Minwoo Kang
>


How to parse data stored in child node of ZK /hbase/region-in-transition

2018-05-16 Thread Nand kishor Bansal
Hi Users,

I'm trying to find out if any region is going through split by looking at
data in zookeeper.

I triggered a region split manually and captured the data from child ZK
node /hbase/region-in-transition which corresponded to the region.

"ÿ^@^@^@^Tmaster:6^R!¯^R<83><9d>^OJPBUF^H3^R4bar,,1526471941526.ac8b42de46021dcbf3d597326eb60de1.^X«<8f>Óƶ,"^V
localhost^PôÔ^C^X÷<84>ÊŶ,*^@"

But when I tried to parse the above data using protobuf class

ClusterStatusProtos.RegionInTransition, it failed to parse.


Is there a way to parse this data somehow and determine it correspond
to a region in SPLIT or SPLITTING state.

I am using HBase 1.2.0 and Zookeeper 3.4.8


Thanks,

Nand


How to improve HBase read performance.

2018-05-16 Thread Kang Minwoo
Hi, Users.

I store a lot of logs in HBase.
However, the reading speed of the log is too slow compared to the Hive ORC file.
I know that HBase is slow compared to the Hive ORC file.
The problem is that it is too slow.
HBase is about 6 times slower.

Is there a good way to speed up HBase's reading speed?
Should I put a lot of servers?

I am using HBase 1.2.6.

Best regards,
Minwoo Kang