Re: HBase Meetup April 2023

2023-04-19 Thread Tak Lon (Stephen) Wu
Hi everyone,

Thank you for attending our first meetup of 2023 on 04/19/2023 . Below you
can find the resources that were discussed during the event:

* Slides [1]
* Recorded video (including chat history) [2]
* Notes [3][4]

During the meetup, we covered several topics

1. HBase-2.6
 - We discussed HBase-2.6 and we will soon begin release tasks
2. HBase-3.0 and further roadmap
 - We talked about HBase-3.0 and its further roadmap. Some of the major
blockers for HBase-3.0 release
   are stabilizing the runtime, conducting significant tests and
validations, and having early adopters
   who have already run 3.0 in production.
 - We also need a discussion thread on what tests should be covered for
major releases
 - it would be useful to have testing infrastructure in Kubernetes to
double the number of tests
   performed quickly.
3. The community over code (was ApacheCon) 2023
 - We need to start a discussion thread on collecting ideas and structuring
our highlights/visions.
   It would be best to have use cases or support cases from development
teams or business partners.
 - Members of HBase community are who would likely be doing the
presentations
   structure a generalized topic to discuss HBase
 - The call for presentation proposal deadline is by 00:01 UTC on July
13th, 2023
4. HBase on K8s
 - Good work has been done in different $dayjob places with various
implementations, such as
   - WAL on Bookkeeper or HDFS, and HFiles on S3, Azure Blob Storage or HDFS
   - We should reconsider the implementation of the Write-Ahead Log (WAL)
and explore the
 possibility of implementing it either as a stream-based or file-based
approach.
This may require us to modify the WAL interface accordingly, and we will
need to have a
nother round of discussions among the developers to determine the best
approach.
   - mentioned a write heavy use case, or a read heavy use case, data
caching, see more in [4]
 - We discussed the need for dedicated threads to identify the common areas
that the HBase community
   should take ownership of, e.g.
   - common interfaces or functional requirements in HDFS/ HDFS clients,
Zookeeper
   - Modernizing the common infrastructure, e.g. what options could be done
for WAL and for HFiles.
 This item will be discussed by separate development threads
   - a basic example of running HBase on K8s, e.g. for testing , or
standalone mode.
   - need a thread to request apache infra or other sponsors to provide the
infra and resources.
5. HBase with Ozone
   - Wei-chiu presenting the current state of Ozone and its architecture
and development on
 supporting HBase, thanks a lot.
   - Requesting Ozone could be cloud native as an alternative solution for
self-manage storage.


We suggested several future activities, such as
1. setting up a follow-up meetup in two weeks to discuss the remaining
topics,
2. establishing regular quarterly meetups,
3. gathering presentation ideas for the community for the next conference,
   such as Community Over Code and ApacheCon Asia
4. any other discussion would be triggered by the feature developer

Remaining topics for next meetup
* HBase Backup
* Deploy HBase on Cloud without any self-deployed services
* Security/TLS/mTLS
* RIT issues
* High Availability story with HBase 3.0+?

[1]
https://docs.google.com/presentation/d/1xf7DWbEN8-tdiEvPLD6AE2xRB_dBJLFuGMff1QoQKNE/edit#slide=id.g22f8eeb42ae_0_15
[2]
https://cloudera.zoom.us/rec/share/UuGS5IgT3mIXi-ceyhHIMRhqfXW5sJ9cVdF7P9TxwZWLyHBzQ4qrOg3wWxnwXYNH.2iuEHkknZaJR_YZ-
(password will be sent to the slack chat room of hbase )
[3] chat history in Zoom,
https://drive.google.com/file/d/14qn3-2Ug7rW_1ILFMIq7TGMWA_XCpXQw/view?usp=share_link
[4] notes took by our team,
https://drive.google.com/file/d/12G4gI0ECDTAXz5Ci5aodcBCr8gP8DPLx/view?usp=share_link

Thanks,
Stephen

On Thu, Apr 13, 2023 at 11:26 AM Tak Lon (Stephen) Wu 
wrote:

> Hi everyone,
>
> For those didn't put your email on the signup, I searched and sent the
> invite to your apache.org email ….. if you don’t have the meeting invite
> or would like to have the meeting invite on your calendar, please ping me
> your preferred gmail / email , then I will resend the invite to you
> directly…sorry for that.
>
> See you soon on 4/19, and please feel free to ping me if you have any
> questions.
>
> Thanks,
> Stephen
>
> On Sat, Apr 8, 2023 at 10:28 AM Tak Lon (Stephen) Wu 
> wrote:
>
>> Hi guys,
>>
>> The final date will be on 4/19 Wednesday PDT 7:30 AM because we have more
>> folks joining (and I will cancel the invites on 4/12).
>>
>> the physical location is 5470 Great America Pkwy , Santa Clara, CA 95054
>> (Cloudera building Room Ontario), the virtual meeting link is
>> https://cloudera.zoom.us/j/91365181291 (I'm reusing my $dayjob room).
>>
>> If you would like to join and didn't receive the invite (sorry if I missed
>> you on the signup sheet), please let me know directly.
>>
>> See you soon!
>>
>> Thanks,
>> Stephen
>>
>>
>>
>> On Wed, Apr 5, 2023 at 4:06 PM Tak 

Re: HBase Meetup April 2023

2023-04-19 Thread Tak Lon (Stephen) Wu
Hi everyone,

Thank you for attending our first meetup of 2023 on 04/19/2023 . Below you
can find the resources that were discussed during the event:

* Slides [1]
* Recorded video (including chat history) [2]
* Notes [3][4]

During the meetup, we covered several topics

1. HBase-2.6
 - We discussed HBase-2.6 and we will soon begin release tasks
2. HBase-3.0 and further roadmap
 - We talked about HBase-3.0 and its further roadmap. Some of the major
blockers for HBase-3.0 release
   are stabilizing the runtime, conducting significant tests and
validations, and having early adopters
   who have already run 3.0 in production.
 - We also need a discussion thread on what tests should be covered for
major releases
 - it would be useful to have testing infrastructure in Kubernetes to
double the number of tests
   performed quickly.
3. The community over code (was ApacheCon) 2023
 - We need to start a discussion thread on collecting ideas and structuring
our highlights/visions.
   It would be best to have use cases or support cases from development
teams or business partners.
 - Members of HBase community are who would likely be doing the
presentations
   structure a generalized topic to discuss HBase
 - The call for presentation proposal deadline is by 00:01 UTC on July
13th, 2023
4. HBase on K8s
 - Good work has been done in different $dayjob places with various
implementations, such as
   - WAL on Bookkeeper or HDFS, and HFiles on S3, Azure Blob Storage or HDFS
   - We should reconsider the implementation of the Write-Ahead Log (WAL)
and explore the
 possibility of implementing it either as a stream-based or file-based
approach.
This may require us to modify the WAL interface accordingly, and we will
need to have a
nother round of discussions among the developers to determine the best
approach.
   - mentioned a write heavy use case, or a read heavy use case, data
caching, see more in [4]
 - We discussed the need for dedicated threads to identify the common areas
that the HBase community
   should take ownership of, e.g.
   - common interfaces or functional requirements in HDFS/ HDFS clients,
Zookeeper
   - Modernizing the common infrastructure, e.g. what options could be done
for WAL and for HFiles.
 This item will be discussed by separate development threads
   - a basic example of running HBase on K8s, e.g. for testing , or
standalone mode.
   - need a thread to request apache infra or other sponsors to provide the
infra and resources.
5. HBase with Ozone
   - Wei-chiu presenting the current state of Ozone and its architecture
and development on
 supporting HBase, thanks a lot.
   - Requesting Ozone could be cloud native as an alternative solution for
self-manage storage.


We suggested several future activities, such as
1. setting up a follow-up meetup in two weeks to discuss the remaining
topics,
2. establishing regular quarterly meetups,
3. gathering presentation ideas for the community for the next conference,
   such as Community Over Code and ApacheCon Asia
4. any other discussion would be triggered by the feature developer

Remaining topics for next meetup
* HBase Backup
* Deploy HBase on Cloud without any self-deployed services
* Security/TLS/mTLS
* RIT issues
* High Availability story with HBase 3.0+?

[1]
https://docs.google.com/presentation/d/1xf7DWbEN8-tdiEvPLD6AE2xRB_dBJLFuGMff1QoQKNE/edit#slide=id.g22f8eeb42ae_0_15
[2]
https://cloudera.zoom.us/rec/share/UuGS5IgT3mIXi-ceyhHIMRhqfXW5sJ9cVdF7P9TxwZWLyHBzQ4qrOg3wWxnwXYNH.2iuEHkknZaJR_YZ-
(password will be sent to the slack chat room of hbase )
[3] chat history in Zoom,
https://drive.google.com/file/d/14qn3-2Ug7rW_1ILFMIq7TGMWA_XCpXQw/view?usp=share_link
[4] notes took by our team,
https://drive.google.com/file/d/12G4gI0ECDTAXz5Ci5aodcBCr8gP8DPLx/view?usp=share_link

Thanks,
Stephen

On Thu, Apr 13, 2023 at 11:26 AM Tak Lon (Stephen) Wu 
wrote:

> Hi everyone,
>
> For those didn't put your email on the signup, I searched and sent the
> invite to your apache.org email ….. if you don’t have the meeting invite
> or would like to have the meeting invite on your calendar, please ping me
> your preferred gmail / email , then I will resend the invite to you
> directly…sorry for that.
>
> See you soon on 4/19, and please feel free to ping me if you have any
> questions.
>
> Thanks,
> Stephen
>
> On Sat, Apr 8, 2023 at 10:28 AM Tak Lon (Stephen) Wu 
> wrote:
>
>> Hi guys,
>>
>> The final date will be on 4/19 Wednesday PDT 7:30 AM because we have more
>> folks joining (and I will cancel the invites on 4/12).
>>
>> the physical location is 5470 Great America Pkwy , Santa Clara, CA 95054
>> (Cloudera building Room Ontario), the virtual meeting link is
>> https://cloudera.zoom.us/j/91365181291 (I'm reusing my $dayjob room).
>>
>> If you would like to join and didn't receive the invite (sorry if I missed
>> you on the signup sheet), please let me know directly.
>>
>> See you soon!
>>
>> Thanks,
>> Stephen
>>
>>
>>
>> On Wed, Apr 5, 2023 at 4:06 PM Tak