Re: [VOTE] Apache Accumulo 2.1.4-rc2

2025-08-20 Thread Keith Turner
Looked through the changes since rc0 and updated the 2.1.4 release notes. Pushed two commits to the website repo. https://github.com/apache/accumulo/compare/2.1.4-rc0...2.1.4-rc2 On Tue, Aug 12, 2025 at 9:37 PM Christopher wrote: > Accumulo Developers, > > Please consider the following candidat

Re: [VOTE] Apache Accumulo 2.1.4-rc2

2025-08-19 Thread Keith Turner
+1 Verified tar ball hashes and signatures. Built src tar and compared src tar to branch. Looked at changes since rc0. On Tue, Aug 12, 2025 at 9:37 PM Christopher wrote: > Accumulo Developers, > > Please consider the following candidate for Apache Accumulo 2.1.4. > (note: rc1 was abandoned befo

Re: [TEST][VOTE] Apache Accumulo 2.1.4-rc0

2025-07-02 Thread Keith Turner
Spun up a single node instance using the bin tgz from the staging repo and built accumulo testing against the jars in the staging repo. Then did a short successful run of continuous ingest and bulk random walk which were both successful. Tried running a bulk random walk test with external compact

Re: [TEST][VOTE] Apache Accumulo 2.1.4-rc0

2025-06-30 Thread Keith Turner
Looked over the commits since 2.1.3 and made a few updates to the release notes. Pushed those to the website. On Sat, Jun 28, 2025 at 12:02 AM Christopher wrote: > Accumulo Developers, > > This is a test release candidate only. There is no need to vote. > However, please consider using this for

Re: New committer: John Kucera

2025-06-09 Thread Keith Turner
Congrats and welcome John. On Wed, Jun 4, 2025 at 8:54 PM Christopher wrote: > Hi Accumulo Devs, > > The Project Management Committee (PMC) for Apache Accumulo > has invited John Kucera (jkucera / @meatballspaghetti on GitHub) > to become a committer and we are pleased to announce that they > ha

Re: Hello

2025-06-03 Thread Keith Turner
On Tue, Jun 3, 2025 at 11:37 AM Robert Kurc wrote: > Hi all! > > I am a high-school student who is interested in contributing to the > Accumulo project, but I'm not entirely sure where to start. Would anyone be > willing to help? > I can try to help. A good first step towards contributing to any

Re: [DISCUSS] Reconsidering upcoming release versions

2025-03-07 Thread Keith Turner
That proposal sounds good to me. The 3.1 branch could be deleted sooner rather than later. Creating the 3.1-deprecations branch does not seem urgent and could be done at any time. When the 3.1-deprecations branch does exists will need to agree on how to do merges from 2.1. Seems like merges fro

Re: Scan States Explained

2024-08-27 Thread Keith Turner
not find one. Wrote a test and found the batch scanner was not closing sessions when the batch scanner object was closed. Opened https://github.com/apache/accumulo/pull/4841 to fix the batch scanner and add new tests. Keith On Fri, Aug 23, 2024 at 5:47 PM Logan Jones wrote: > Hi Ke

Re: Scan States Explained

2024-08-23 Thread Keith Turner
Logan, I documented the meaning of those values in the following PR https://github.com/apache/accumulo/pull/4827 Does that help? Keith On Fri, Aug 23, 2024 at 7:26 AM Logan Jones wrote: > Hello > > I was hoping to understand the different scan states that can be > returned by t

Re: [VOTE] Apache Accumulo 2.1.3-rc1

2024-07-31 Thread Keith Turner
+1 binding * Ran mvn verify -Psunny on source tar contents * Verified source tar contents matched 2.13-rc1 tag * Verified hash and signatures for tar files * Attempted to build Fluo against the staging repo and ran into this issue[1] with the accumulo2-maven-plugin and 2.1.3 RC1. Was able to

Re: Welcome new committer, Kevin!

2024-07-26 Thread Keith Turner
Welcome and congratulations On Thu, Jul 25, 2024 at 6:34 PM Christopher wrote: > Hi Accumulo Devs, > > I just want to offer my welcome to our newest committer and PMC > member, Kevin Rathbun! > > ASF Id: krathbun > GitHub: kevinrr888 > > Best regards, > Christopher >

Re: [DISCUSS] Classic GitHub Projects sunsetting

2024-07-03 Thread Keith Turner
Using milestones and selecting the earliest release where an issue will land sounds good to me. For planning purposes we generally want to know what is the oldest branch a change should land in and milestones work perfectly for this. For retrospective purposes milestones are not perfect, however

Re: [DRAFT][ANNOUNCE] Apache Accumulo Access 1.0.0-beta

2024-02-20 Thread Keith Turner
I see a wrapping issue w/ the following sentence : This initial release of Accumulo Access is designated as beta because the API may change as we work to integrate it into Accumulo. On Tue, Feb 20, 2024 at 8:06 AM Dave Marion wrote: > All, > > Please review and provide feedback on the draft

Re: [VOTE] Apache Accumulo-Access 1.0.0-beta-rc4

2024-02-16 Thread Keith Turner
+1 Verified checksums and signatures Compared source zip to branch Updated #3746 to use the version 1.0.0-beta of Accumulo Access, configured maven to point to the staging repository, ran all accumulo unit test (which tests Accumulo classes that use accumulo-access), and ran ComprehensiveIT (while

Re: [VOTE] Apache Accumulo-Access 1.0.0-rc3

2024-02-12 Thread Keith Turner
+1 Verified checksums Compared source zip to branch Looked over recent changes in branch Built and ran the getting started example against the staging repo : https://github.com/apache/accumulo-access/pull/45 On Mon, Feb 12, 2024 at 9:57 AM Dave Marion wrote: > Accumulo Developers, > > Please c

Re: [VOTE] Apache Accumulo 1.10.4-rc1

2023-11-15 Thread Keith Turner
+1 Looked over the diffs between 1.10.3 and 1.10.4-rc1 On Mon, Nov 13, 2023 at 9:42 AM Christopher wrote: > Accumulo Developers, > > Please consider the following candidate for Apache Accumulo 1.10.4 (a > closeout final release for 1.10) > > Git Commit: > 47ac68d1a220a90bc80618f9684252b243d

Re: [GitHub] [accumulo-access] keith-turner opened a new pull request, #2: adds asf yaml file

2023-09-07 Thread Keith Turner
I overlooked an option for setting the email when creating the new git repository. Hopefully this PR will fix it. On Thu, Sep 7, 2023 at 4:32 PM keith-turner (via GitHub) wrote: > > keith-turner opened a new pull request, #2: > URL: https://github.com/apache/accumulo-access/pull/2 &

Re: New Accumulo git repository

2023-09-07 Thread Keith Turner
w repo > > exists. > > > I believe we have to use non-snapshot dependencies when creating an > > > Accumulo release. Right? I have no issue with creating the repo now and > > no > > > issue with the name, since we can rename it. > > > > > > On W

Re: New Accumulo git repository

2023-09-07 Thread Keith Turner
Accumulo release. Right? I have no issue with creating the repo now and > no > > issue with the name, since we can rename it. > > > > On Wed, Sep 6, 2023 at 3:18 PM Keith Turner wrote: > > > > > I would like to create a new git repository named accumulo-access in >

New Accumulo git repository

2023-09-06 Thread Keith Turner
I would like to create a new git repository named accumulo-access in the ASF and want to know if anyone has thoughts about creating this repo. This new repo would be a home for the Accumulo access module being created in 3715[1]. Once this repository is created, then the following could happen.

Re: New committer - Dan Roberts

2023-08-11 Thread Keith Turner
Welcome and congratulations Dan. On Wed, Aug 9, 2023 at 6:27 PM dev1 wrote: > The Project Management Committee (PMC) for Apache Accumulo has invited Dan > Roberts to become a committer / PMC member and we are pleased to announce > that they have accepted. > > Being a committer enables easier con

Re: Maintaining State in an Iterator

2023-07-12 Thread Keith Turner
Made an inline comment below in an attempt to improve one of my comments. On Wed, Jul 12, 2023 at 12:31 PM Keith Turner wrote: > Responses inline below. > > On Thu, Jul 6, 2023 at 7:02 AM Logan Jones wrote: > >> Hi Keith: >> >> Thanks so much for the respons

Re: Maintaining State in an Iterator

2023-07-12 Thread Keith Turner
Responses inline below. On Thu, Jul 6, 2023 at 7:02 AM Logan Jones wrote: > Hi Keith: > > Thanks so much for the response. We will base things off the > RowEncodingIterator then. > > A few follow up questions out of curiosity: > > >1. It is likely that my iterato

Re: Maintaining State in an Iterator

2023-07-05 Thread Keith Turner
There are two options for this. One is to buffer the row in memory and encode it in your iterator like the whole row iterator does. The other is to use the isolated scanner[1][2], but this does not work for batch scans. Accumulo should not tear iterators down until after they return something, t

Re: [VOTE] Apache Accumulo 2.1.1-rc2

2023-06-16 Thread Keith Turner
+1 Ran CI test w/ agitation on a 10 node cluster for a few hours. Ingested and verified ~8B entries. Used the default accumuo-testing agitation settings. Ran 10x random walkers running the bulk test on a 10 node cluster using a modification[1] to the test graph. Ran this for a few hours. Saw 1

Re: [VOTE] Apache Accumulo 2.1.1-rc1

2023-06-13 Thread Keith Turner
-1 also because of #3491 I think #3491 is a good reason to do a release. Given the timing it does not make sense to me to spend time validating two releases back to back. On Mon, Jun 12, 2023 at 8:06 PM Christopher wrote: > Accumulo Developers, > > Please consider the following candidate for A

Re: [DISCUSS] Should we have Jekyll build directly to asf-site branch?

2023-04-17 Thread Keith Turner
On Wed, Apr 12, 2023 at 12:34 PM Christopher wrote: > > I don't think Dan's suggested flow is much different than what we have > now. In both cases, the steps are: > > 1. Initial PR to a branch (currently against main; Dan's approach > would be against a separate main-like staging branch) > 2. Aut

Re: Dynamic Scaling of Accumulo

2023-04-04 Thread Keith Turner
hosted range, so need to decide which goal state the tablet should end up with. > > On Mon, Apr 3, 2023 at 2:13 PM Keith Turner wrote: > > > On Mon, Apr 3, 2023 at 10:45 AM Dave Marion wrote: > > > > > > Looking through the code to see what would have to change to

Re: Dynamic Scaling of Accumulo

2023-04-03 Thread Keith Turner
gt; > > for bulk import / ScanServers? Or does "on-demand availability" > > > > somehow apply to all interactions, including bulk import and > > > > ScanServer reads? > > > > > > > > I think the "ondemand" state is confusing, becau

Re: Dynamic Scaling of Accumulo

2023-03-28 Thread Keith Turner
n-demand availability for live-ingest > and immediate consistency as a new table state. I think that should be > left as either some kind of automatic internal behavior, or as a > secondary fine-grained control over an online table (like pinned > tablets, either permanently pinned or t

Re: Dynamic Scaling of Accumulo

2023-03-27 Thread Keith Turner
bly sufficient to let the users know the > > risks of bulk importing and never bringing it online for compactions. It > > seems like that's a risk some users might be okay with for their use case. > > > > On Thu, Mar 23, 2023, 19:38 Dave Marion wrote: > > > > >

Re: [DISCUSS] Enable Github wiki in asf.yaml?

2023-02-25 Thread Keith Turner
I would like to try a wiki for design documents, I think it would be less cumbersome than the website and we can always link from the website and issues to the wiki. I think its ok to give it a try and abandon it in the future, if abandoned would just need to properly communicate that. The conten

Re: [DRAFT][ANNOUNCE] Apache Accumulo 2.1.0

2022-11-02 Thread Keith Turner
nges. If the user guide documentation is all in place then the release notes could link to it. > > On Wed, Nov 2, 2022 at 6:53 AM Keith Turner wrote: > > > In the release notes I do not see mention of the new cluster yaml file > > that replaced the tservers, monitor, gc, mas

Re: [DRAFT][ANNOUNCE] Apache Accumulo 2.1.0

2022-11-02 Thread Keith Turner
In the release notes I do not see mention of the new cluster yaml file that replaced the tservers, monitor, gc, master files. Am I just missing it? Thats a really nice user facing change that should be mentioned. On Wed, Nov 2, 2022 at 7:55 AM Christopher wrote: > > The following is a draft anno

Re: [VOTE] Apache Accumulo 2.1.0-rc4

2022-10-31 Thread Keith Turner
+1 after reading about the testing Dave did. On Thu, Oct 27, 2022 at 7:02 AM Christopher wrote: > > Accumulo Developers, > > Please consider the following candidate for Apache Accumulo 2.1.0. > The only change since 2.1.0-rc3 was > https://github.com/apache/accumulo/pull/3051 > > Git Commit: >

Re: [VOTE] Apache Accumulo 2.1.0-rc4

2022-10-31 Thread Keith Turner
I ran a bulk random walk test for 4.5 days using commit 6099e81. The RW bulk test was modified to continually loop, rerunning itself after verify. Commit 6099e81 is almost the same as the commit for this RC, the only diff being the version number (2.1.0-SNAPSHOT vs 2.1.0). The test are running on

Re: [VOTE] Apache Accumulo 2.1.0-rc2

2022-10-26 Thread Keith Turner
-1 We probably want the changes in https://github.com/apache/accumulo/pull/3049 The changes in #3044 broke an important part of Ample. The problem does not manifest until there are multiple tables, which is why the bulk ITs were happy. On Wed, Oct 26, 2022 at 4:21 AM Christopher wrote: > > Acc

Re: [VOTE] Apache Accumulo 2.1.0-rc1

2022-10-25 Thread Keith Turner
I found a new bug while running the bulk random walk test. When the first row of a file being imported is equal to the last row of the first tablet, the bulk import will get stuck indefinitely. It took a while to track down the cause, I dug around in a heap dump of the stuck RW test process and

Re: [VOTE] 2.1.0-beta-1 or 2.1.0-alpha-1 Release

2022-05-31 Thread Keith Turner
Should we also create a new 2.1 branch along with doing an alpha or beta release? On Tue, May 31, 2022 at 7:24 AM Christopher wrote: > > I'm not sure it makes sense to vote on whether to do a vote. I'm okay > with creating a beta release later this week. When we're ready to do > that, we can just

Re: 2.1 Release TODO

2022-04-04 Thread Keith Turner
sed. > > > > On Mon, Apr 4, 2022 at 9:11 AM Mike Miller wrote: > > > > > Yes I agree, that was the goal of this email thread. I found a few more > > > tickets that should be addressed for the next release. > > > > > > Ivan - There was some wor

Re: [VOTE] Apache Accumulo 1.10.2-rc1

2022-02-11 Thread Keith Turner
+1 Looked through the diffs from 1.10.1 to 1.10.2 using the following. https://github.com/apache/accumulo/compare/rel/1.10.1...1.10.2-rc1 On Tue, Feb 8, 2022 at 10:08 AM Christopher wrote: > > Accumulo Developers, > > Please consider the following candidate for Apache Accumulo 1.10.2. > > Git C

Re: [DISCUSS] Version number of next release?

2021-10-21 Thread Keith Turner
If we were to move to 3.0, it would be nice to reach consensus on the specific reasons why we are doing it. If this were to include dropping deprecated APIs it would be nice to identify which APIs would be dropped and why before deciding. This way people can make an informed decision about suppor

Re: Metrics Replacement

2021-09-21 Thread Keith Turner
On Tue, Sep 21, 2021 at 3:45 PM Dave Marion wrote: > > There is a WIP pull request against 2.1.0-SNAPSHOT for replacing the Hadoop > Metrics2 framework with Micrometer[1]. Micrometer suggests using a naming > pattern[2] for the metrics internally where words are all lowercase > separated by a peri

Re: [accumulo] branch 1.10 updated: removes extraneous code from TabletIteratorTest

2021-08-06 Thread Keith Turner
that my two commits were the only unmerged commits before doing the sours merge. On Thu, Aug 5, 2021 at 7:30 PM Christopher wrote: > > Hey Keith, > > Just curious because of all the activity around this change in 1.10 > (the subsequent partial revert and the merge commits to main,

Re: new committer: Dominic Garguilo

2021-08-03 Thread Keith Turner
Welcome and congratulations! On Thu, Jul 29, 2021 at 1:41 PM Christopher wrote: > > The Project Management Committee (PMC) for Apache Accumulo has invited > Dominic Garguilo to become a committer and PMC member and we are > pleased to announce that they have accepted. > > Dominic has been contrib

Re: Code/Concept Review for Draft PR Accumulo #2084

2021-05-12 Thread Keith Turner
I can take a look at it tomorrow. On Wed, May 12, 2021 at 12:11 PM Jeffrey Manno wrote: > > Hello everyone, > > If anyone is free/willing to share their thoughts on my draft PR to fix an > issue with log recovery . The > general issue that I am trying

Re: External Compactions

2021-05-11 Thread Keith Turner
configured to use external compactions and the other not. Seems like #1 and #2 must be tested, thinking the results of #3 would be positive based on past experience and it would be nice to test but maybe not required. On Tue, May 11, 2021 at 10:34 AM Dave Marion wrote: > > Keith and I hav

Re: Which String deduplication option?

2021-02-08 Thread Keith Turner
Recently while running some large map reduce jobs I learned that Hadoop uses String.intern() in its RPC code (below is a link to an example on one place where Hadoop does this). I learned this because when I ran jstack on NN, RM, and/or AM that were under distress sometimes I kept seeing RPC serve

Re: New committer/PMC member: Karthick Narendran

2021-01-22 Thread Keith Turner
Welcome and congratulations! On Fri, Jan 22, 2021 at 3:13 AM Christopher wrote: > > The Project Management Committee (PMC) for Apache Accumulo > has invited Karthick Narendran to become a committer and PMC > member and we are pleased to announce that they have accepted. > > Karthick has contribut

Re: [ANNOUNCE] New PMC Chair and VP Apache Accumulo

2020-11-19 Thread Keith Turner
Congrats Ed. On Wed, Nov 18, 2020 at 3:52 PM Michael Wall wrote: > > I am happy to announce the PMC recently voted Ed Coleman as the new > PMC Chair and Vice President of Apache Accumulo. Ed was unanimously > approved at today's board meeting. > > I want to thank the community for your support o

Re: New committer/PMC member: Jeffrey Manno

2020-11-03 Thread Keith Turner
Congratulations and welcome Jeff! On Mon, Nov 2, 2020 at 11:50 AM Mike Miller wrote: > > The Project Management Committee (PMC) for Apache Accumulo has invited > Jeffrey Manno > to become a committer/PMC member and we are pleased to announce that he > accepted. > In Accumulo, Jeff has fixed multi

Re: [VOTE] Apache Accumulo 1.10.0-rc2

2020-09-01 Thread Keith Turner
On Mon, Aug 31, 2020 at 12:29 PM Keith Turner wrote: > > I started investigating this today. I quickly noticed two things. > First when I scanned the table for some of the data reported as > missing by the MR job it was there. Second the range of the supposed > missing data cov

Re: [VOTE] Apache Accumulo 1.10.0-rc2

2020-08-31 Thread Keith Turner
the vote is currently tied, and can't pass > > > as-is and this issue needed to be checked first). Mike is unavailable > > > today due to personal matters, so would be unlikely to call the vote > > > himself. However, based on this new information, I will wait

Re: [VOTE] Apache Accumulo 1.10.0-rc2

2020-08-31 Thread Keith Turner
cluster, but I think there is enough space to run the job though. The cluster has 9TB of 25TB used. I'll report back when the job finishes. On Sun, Aug 30, 2020 at 12:07 PM karthick rn wrote: > > -1 > > Keith and I tested the continuous ingest with agitation on 11 nodes (9 >

Re: [VOTE] Apache Accumulo 1.10.0-rc2

2020-08-30 Thread Keith Turner
-1 Because of the issue Karthick described w/ continuous ingest verification. We also ran random walk w/o agitation on 11 Azure VMs (10 workers) for ~40 hours. This used the same cluster configuration that Karthick mentioned. That test went well. Two of the walkers failed with the following er

Re: [VOTE] Apache Accumulo 1.10.0-rc2

2020-08-29 Thread Keith Turner
Karthick and I are working together to run random walk and continuous ingest on two clusters using this RC. After continuous ingest ran for 24 hrs we tried to start verification and ran into the following issue. I plan to vote after the test completes, but wanted to let anyone else running contin

Re: [LAZY][VOTE] change default branch to 'main'

2020-08-04 Thread Keith Turner
+1 On Mon, Aug 3, 2020 at 7:58 AM Christopher wrote: > > As a follow-up from our previous conversation on this issue, I have > already started a new branch named 'main' for my own future > contributions (that name because it appears to be the trending > alternative to 'master'), and for others wh

Re: [VOTE] "Manager" as new name for "master" service

2020-08-04 Thread Keith Turner
+1 On Mon, Aug 3, 2020 at 9:54 AM Christopher wrote: > > Based on the feedback on > https://github.com/apache/accumulo/issues/1638 , the following two > names have taken a clear lead in popularity for the new name for the > service currently known as "master": Manager and Coordinator. Of the > tw

Re: [DISCUSS] Rename Accumulo master

2020-06-23 Thread Keith Turner
On Thu, Jun 18, 2020 at 2:59 PM Kepner, Jeremy - LLSC - MITLL wrote: > > Perhaps the following approach might make sense: > > (1) Identify the changes that would need to be made. I created the following project on Github to try to organize all the different aspects of this. I am going to work on

Re: [DISCUSS] Rename Accumulo master

2020-06-18 Thread Keith Turner
On Thu, Jun 18, 2020 at 8:40 AM Ed Coleman wrote: > > For processes, would Root be too confusing? We would then have rservers and > tservers which may be more descriptive of functionality. > > This discussion is also going on the NiFi lists (and I assume elsewhere) One > thing that popped out i

Re: [DISCUSS] Rename Accumulo master

2020-06-18 Thread Keith Turner
a delay. > > -joey > On Jun 18, 2020, 9:44 AM -0500, Keith Turner , wrote: > > On Wed, Jun 17, 2020 at 3:47 PM Kepner, Jeremy - LLSC - MITLL > > wrote: > > > > > > Will it break user code? > > > > I don't think the change has to br

Re: [DISCUSS] Rename Accumulo master

2020-06-18 Thread Keith Turner
On Wed, Jun 17, 2020 at 3:47 PM Kepner, Jeremy - LLSC - MITLL wrote: > > Will it break user code? I don't think the change has to break existing code, could use deprecation for APIs. The properties could be automatically translated with a warning logged or servers could refuse to start if old pr

Re: [DISCUSS] Rename Accumulo master

2020-06-17 Thread Keith Turner
On Wed, Jun 17, 2020 at 5:23 PM Michael Wall wrote: > > On Wed, Jun 17, 2020 at 3:44 PM Brian Loss wrote: > > > I agree—things have changed in the world since this was last discussed, > > and I think it’s time to make the change even though it will be disruptive. > > I support changing both the m

Re: [DISCUSS] Rename Accumulo master

2020-06-17 Thread Keith Turner
I am in favor of making a change and would be happy to help. On Wed, Jun 17, 2020 at 3:07 PM Billie Rinaldi wrote: > > Hi Accumulo folks! I would like to start a discussion about renaming the > Accumulo master. Previous discussions were held a few years ago [1]. Some > things have changed since w

Re: Slack chat Wednesday to discuss compaction PR

2020-05-13 Thread Keith Turner
ack or notes following the > discussion. Can someone please take notes and share? > Thank you! > > On Mon, May 11, 2020, 10:35 AM Keith Turner wrote: > > > Christopher suggested a chat on slack to discuss the following. For > > anyone who is interested in joining, would Wed

Re: Slack chat Wednesday to discuss compaction PR

2020-05-13 Thread Keith Turner
I created the following very quickly for this discussion. I should have created it sooner, but I just realized it would be useful an hour ago. Its rough, but it has almost everything I wanted to mention. https://gist.github.com/keith-turner/f6f7ca661d88b935b74f57dbdeeb07d3 Keith On Tue, May

Slack chat Wednesday to discuss compaction PR

2020-05-11 Thread Keith Turner
Christopher suggested a chat on slack to discuss the following. For anyone who is interested in joining, would Wed 5/13 at 1300 EST work? If not, we can find another time once we know who is interested. https://github.com/apache/accumulo/pull/1605 Keith

New committer/PMC member: Arvind Shyamsundar

2020-04-16 Thread Keith Turner
The Apache Accumulo Project Management Committee (PMC) invited Arvind Shyamsundar to become a committer/PMC member and we are pleased to announce that he accepted. Arvind has fixed multiple bugs in Accumulo recently and added support for running multiple tservers per node to the scripts. Welcome

Re: Video chat this week

2020-03-24 Thread Keith Turner
Slack has a call feature, but I have never used it. I can do 11 Wed or Thur this week. On Mon, Mar 23, 2020 at 10:14 PM Christopher wrote: > > Accumulo Devs, > > I happen to have some extra time on my hands this week at home, and > thought it might be nice to set up a video meeting to chat with

Re: [REPORT][DRAFT] Apache Accumulo January 2020

2020-01-08 Thread Keith Turner
This looks great. Your project activity section is packed with useful information and really short. On Tue, Jan 7, 2020 at 5:51 PM Michael Wall wrote: > > The Apache Accumulo PMC decided to draft its quarterly board > reports on the dev list. Here is a draft of our report which is due > tomorrow

Re: Issues building 1.9-snapshot and Hadoop 3.1.3

2019-11-21 Thread Keith Turner
Can you check that your accumulo-site.xml only adds $HADOOP_PREFIX/share/hadoop/client/[^.].*.jar for hadoop deps for the setting general.classpaths? Not completely sure, but I think this will use the hadoop shaded jars. Do not want the non-shaded hadoop jars like $HADOOP_PREFIX/share/hadoop/comm

Re: Issues building 1.9-snapshot and Hadoop 3.1.3

2019-11-21 Thread Keith Turner
Another possible path to solve this is with a different classpath and dependency for hadoop 3. In Accumulo 2.0 we depend on the hadoop client shaded jar, which has its own shaded and relocated version of Guava internally. Using the Hadoop shaded jar would solve this problem. Not sure what that c

Re: Issues building 1.9-snapshot and Hadoop 3.1.3

2019-11-21 Thread Keith Turner
I looked at the history[1] of the hadoop project pom and found that HADOOP-16213[2] seems to be the cause of this change. So it seems like we need to bump the guava version if we want to work with newer versions of Hadoop 3. One of the goals of 1.9 (and I think 1.10) is to be a bridge version betw

Re: Multiple instance volumes

2019-11-20 Thread Keith Turner
On Tue, Nov 19, 2019 at 6:45 PM karthick rn wrote: > > Hi, > > When provisioning multiple volumes, for ex. HDFS & Azure Data Lake storage, > it would be good to choose which volume we want the system tables like > metadata, root, replication tables to be created. Currently, Accumulo > randomly cre

Re: Reminder: Accumulo Hack Day next Tuesday, November 12, 2019

2019-11-08 Thread Keith Turner
That is a good idea. I took a crack at creating a few. https://github.com/apache/accumulo/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22 On Fri, Nov 8, 2019 at 4:08 AM Nikhil Manchanda wrote: > > > Hi folks: > > A few of us who are not in the area intend to mirror this on the > we

Re: [VOTE] Proposal to release version 1.10

2019-11-04 Thread Keith Turner
On Fri, Nov 1, 2019 at 12:46 PM Sean Busbey wrote: > > Correct, it is up to every user of SemVer to define the public API and > AFAIK we have chosen not to include things like the Java version > needed to run Accumulo in ours[1]. > > That doesn't mean it's not crappy to our downstream users to do

Re: [VOTE] Proposal to release version 1.10

2019-11-04 Thread Keith Turner
On Fri, Nov 1, 2019 at 4:04 PM Christopher wrote: > > My understanding is that this proposal doesn't include any other > changes other than bumping the required java version in the POM, but > it does lay the groundwork for: > > * Can more easily backport patches to 1.x that were originally written

Re: [VOTE] Proposal to release version 1.10

2019-11-04 Thread Keith Turner
. This seems like it could take a lot of time to do well. I would personally rather spend that time on 2.1.0 features. Keith On Thu, Oct 31, 2019 at 8:37 PM Ed Coleman wrote: > > As suggested in the LTS discussion ([LAZY][VOTE] A basic, but concrete, LTS > proposal), I'm breaking th

Re: [LAZY][VOTE] A basic, but concrete, LTS proposal

2019-11-04 Thread Keith Turner
ke to see the change made with a prominent mention in the release notes. > > Thanks, > Nikhil > > > On Thu, Oct 31 2019, Keith Turner wrote: > > +1 > > > > I am in favor of the LTS idea because I think it makes it more > > efficient for everyone to easily come to

Re: [LAZY][VOTE] A basic, but concrete, LTS proposal

2019-10-31 Thread Keith Turner
On Thu, Oct 31, 2019 at 10:28 AM Josh Elser wrote: > > Seems fine to me. > > Any expectations on how upgrades work within an LTS release? How about > across LTS releases? The behavior I would like to see is that for any release you can always upgrade from the previous release. For LTS releases y

Re: [LAZY][VOTE] A basic, but concrete, LTS proposal

2019-10-31 Thread Keith Turner
+1 I am in favor of the LTS idea because I think it makes it more efficient for everyone to easily come together and focus their efforts in the same direction for the benefit of everyone. I think this is a really good starting plan for LTS. Overtime we will probably find issues with the plan and

Re: Introduction: Arvind Shyamsundar

2019-10-31 Thread Keith Turner
Welcome Arvind. I would be interested in reading about your experiences with Accumulo on Azure. On Mon, Oct 28, 2019 at 7:21 PM Arvind Shyamsundar wrote: > > Hello! > I'm Arvind Shyamsundar and I work at Microsoft. I'm located in Redmond, WA. > Along with a few colleagues, we have been working o

Re: Help with PR 1333

2019-10-16 Thread Keith Turner
I can take a look at it. On Wed, Oct 16, 2019 at 9:37 AM David Mollitor wrote: > > Hello Gang, > > I work with a customer that uses Accumulo. My full-time position is not in > development, so while I'm curious to look into Accumulo a bit, I can't make > substantial contributions at this time. H

Re: adding HDFS erasure coding awareness to Accumulo

2019-09-19 Thread Keith Turner
On Thu, Sep 5, 2019 at 7:25 PM Seidl, Ed wrote: > > Hi All. > > Following up on a conversation with Christopher on the slack channel, what > follows is a modest proposal to make hosting Accumulo tables on erasure coded > HDFS directories easier. This post turned out to be pretty long…if you > a

Re: Accumulo 2.0 Proxy

2019-09-11 Thread Keith Turner
Let me know if you need assistance with anything. On Mon, Sep 9, 2019 at 9:38 AM Roberts, Geoffry [USA] wrote: > > All, > > As suggested, I am offering to help with the proxy project. Full disclosure: > What I need ultimately is to get Julia 1.2 talking to Accumulo 2.0 > efficiently. It is my

Re: Algorithm or Data Structure Used for Selecting Next Record when Merging rfiles

2019-08-12 Thread Keith Turner
MultiIterator[1] which is a HeapIterator[2] is used to merge rfiles. [1]: https://github.com/apache/accumulo/blob/c99a22387bfe3f2de93c6a27727d7b38054a699b/core/src/main/java/org/apache/accumulo/core/iterators/system/MultiIterator.java [2]: https://github.com/apache/accumulo/blob/c99a22387bfe3f2d

Re: New committer/PMC member: Holly Keebler

2019-08-09 Thread Keith Turner
Welcome Holly! On Fri, Aug 9, 2019 at 3:21 PM Christopher wrote: > > Devs, > > The Project Management Committee (PMC) for Apache Accumulo > has invited Holly Keebler to become a committer and we are pleased > to announce that she has accepted. > > Holly started with contributions fixing some smal

Re: New committer/PMC member: Adam Lerman

2019-08-08 Thread Keith Turner
ral pull requests in the last year to enhance > features in the accumulo-shell and the VolumeChooser in 1.9 and 2.0, > as well as other user experience enhancements. Most recently, he > worked with Keith Turner to incorporate some additional logging to > help administrators find busy tabl

Re: [DRAFT][ANNOUNCE] Apache Accumulo 2.0.0

2019-08-06 Thread Keith Turner
LGTM On Mon, Aug 5, 2019 at 8:01 PM Mike Miller wrote: > > The Apache Accumulo project is pleased to announce the release of > Apache Accumulo 2.0.0! Apache Accumulo 2.0.0 contains many new > features, bug fixes, performance improvements and redesigned > components. Hadoop 3 and Java 8 are requ

Re: [DISCUSS] Release schedule

2019-08-05 Thread Keith Turner
e when a scheduled release > > is approaching? > > What happens if a scheduled release is approaching, but the community > > is focused on critical patches for a previous release? > > > > On Fri, Aug 2, 2019 at 12:43 PM Keith Turner wrote: > > > > > &g

Re: [VOTE] Accumulo2 Maven Plugin 1.0.0-rc1

2019-08-05 Thread Keith Turner
+1 Verified signatures Was able to build Fluo against this RC On Sun, Aug 4, 2019 at 6:50 PM Christopher wrote: > > Accumulo Developers, > > Please consider the following candidate for Accumulo2 Maven Plugin 1.0.0. > > Git Commit: > c9ec8487898819edf3d70838abfe416077d27360 > Branch: > 1.

Re: [DISCUSS] Release schedule

2019-08-02 Thread Keith Turner
I am in favor of this. One possible way to move this forward would be to write the following two proposals. * Transition proposal : lays out how we will transition to this plan. For example it could outline that 1.9 will become the first LTS. * Release schedule proposal : lays out how the A

Re: [VOTE] Apache Accumulo 2.0.0-rc2

2019-08-01 Thread Keith Turner
+1 Verified sigs of tar balls. Built Fluo against the RC Ran some quick bulk import CI test and garbage collection simultation test. On Mon, Jul 29, 2019 at 4:10 PM Christopher wrote: > > Accumulo Developers, > > Please consider the following candidate for Apache Accumulo 2.0.0. > Changes since

Re: Encouraging introductions for new contributors

2019-07-31 Thread Keith Turner
On Wed, Jul 31, 2019 at 11:39 AM Christopher wrote: > > Accumulo Devs, > > It occurs to me that with the increased use of GitHub PRs, we have a > number of contributors which we do not have their contact email > addresses, and who have never participated on the mailing lists. > Perhaps they are no

Re: [RESULT][VOTE] Apache Accumulo 2.0.0-rc1

2019-07-10 Thread Keith Turner
On Sat, Jun 22, 2019 at 10:09 AM Jeffrey Manno < > > jeffreymann...@gmail.com> > > > > wrote: > > > > > > > > > Just giving an update on where my testing is at in case I forget. > > > > > > > > > > So far haven&#x

Re: [VOTE] Apache Accumulo 2.0.0-rc1

2019-06-21 Thread Keith Turner
-1 mainly because of #1231. Also would really like to be able to validate external projects can build against RC, but was unable to. I have been doing a lot testing this past week on EC2 using 2.0.0-SNAPSHOT built from 2ac7c7e. I successfully ran continuous ingest with agitation and multiple in

Re: [TEST][VOTE] Apache Accumulo 2.0.0-rc0

2019-06-17 Thread Keith Turner
I have been running a new test[1] against this rc and things have been going well so far. I have only been running on a single node. I plan to run different test on a EC2 cluster this week using this RC I tried using Java 11 to run the test, but yarn would not start (using Apache hadoop 3.2.0).

Re: native snappy library not available: SnappyCompressor has not been loaded.

2019-05-31 Thread Keith Turner
Maybe best to just not set it. Then it will default to gz which is always available. On Fri, May 31, 2019 at 4:51 PM Jeffrey Manno wrote: > Out of available ones, which do you think is best? There is zStandard and a > few others I can't recall. > > On Fri, May 31, 2019, 4:4

Re: native snappy library not available: SnappyCompressor has not been loaded.

2019-05-31 Thread Keith Turner
On Thu, May 30, 2019 at 5:33 PM Jeffrey Manno wrote: > That makes sense. Do you think we should update accumulo-testing? Currently > I think that makes sense to update the defaults in accumulo-testing to not use snappy. > 'cingest createtable' function > utilizes that compression method which

Re: [RESULT][VOTE] Apache Accumulo 1.9.3-rc3

2019-04-11 Thread Keith Turner
On Wed, Apr 10, 2019 at 8:05 PM Christopher wrote: > > Completed release tasks: > > In git: > * Merge 1.9.3-rc3-next into 1.9, and 1.9 into master to preserve history > * Create GPG-signed release tag rel/1.9.3 > * Clean up RC branches > > In dist-SVN: > * Upload release artifacts, signatures, and

  1   2   3   4   5   6   7   8   9   10   >