I found a new bug while running the bulk random walk test.   When the
first row of a file being imported is equal to the last row of the
first tablet, the bulk import will get stuck indefinitely.  It took a
while to track down the cause, I dug around in a heap dump of the
stuck RW test process and found what row was causing it to get stuck.
After finding that I was able to create an IT that reproduces the
scenario and it gets stuck too.  I can also see the problem in the
code now.  This is the new bulk import code, have not looked into this
situation with the old bulk import code.  I was running these tests
against an older commit than this RC, but the bug exists in this RC.
I am about to start working on a fix for this issue.  Is this worth a
new RC?

There are probably other situations that can cause this bug, but the
bulk import code does read ahead of metadata tablets so that covers up
some but not all other cases.  The code could possibly get stuck on
any file where the first row in the file is equal to some tablets last
row.

On Mon, Oct 24, 2022 at 10:13 PM Christopher <ctubb...@apache.org> wrote:
>
> Accumulo Developers,
>
> Please consider the following candidate for Apache Accumulo 2.1.0.
>
> Git Commit:
>     92b07213f5e3e7f77be56f0866316b2f0eebe191
> Branch:
>     2.1.0-rc1
>
> If this vote passes, a gpg-signed tag will be created using:
>     git tag -f -s -m 'Apache Accumulo 2.1.0' rel/2.1.0 \
>     92b07213f5e3e7f77be56f0866316b2f0eebe191
>
> Staging repo:
> https://repository.apache.org/content/repositories/orgapacheaccumulo-1094
> Source (official release artifact):
> https://repository.apache.org/content/repositories/orgapacheaccumulo-1094/org/apache/accumulo/accumulo/2.1.0/accumulo-2.1.0-src.tar.gz
> Binary:
> https://repository.apache.org/content/repositories/orgapacheaccumulo-1094/org/apache/accumulo/accumulo/2.1.0/accumulo-2.1.0-bin.tar.gz
>
> Append ".asc" to download the cryptographic signature for a given artifact.
> (You can also append ".sha1" or ".md5" instead in order to verify the
> checksums
> generated by Maven to verify the integrity of the Nexus repository staging
> area.)
>
> Signing keys are available at https://www.apache.org/dist/accumulo/KEYS
> (Expected fingerprint: 8CC4F8A2B29C2B040F2B835D6F0CDAE700B6899D)
>
> In addition to the tarballs and their signatures, the following checksum
> files will be added to the dist/release SVN area after release:
> accumulo-2.1.0-src.tar.gz.sha512 will contain:
> SHA512 (accumulo-2.1.0-src.tar.gz) =
> 4ce9f2cccd1f126eaa46c1c504b56255c18add04ab655821e1dc64ea74a1954f2124a88e75d3223792184eaa0b49c13ac6f00f563bf94069c79b45fc6a0fd5c6
> accumulo-2.1.0-bin.tar.gz.sha512 will contain:
> SHA512 (accumulo-2.1.0-bin.tar.gz) =
> 9b619fa56f5d3532c226aa0fa0d88a0ee53692f890c047f7e0436f6e6814397562201302cfbfb2531f5705750f3576480bc522a8ba662c55c4242def801351dd
>
> Release notes (in progress) can be found at:
> https://accumulo.staged.apache.org/release/accumulo-2.1.0
>
> Release testing instructions:
> https://accumulo.apache.org/contributor/verifying-release
>
> Please vote one of:
> [ ] +1 - I have verified and accept...
> [ ] +0 - I have reservations, but not strong enough to vote against...
> [ ] -1 - Because..., I do not accept...
> ... these artifacts as the 2.1.0 release of Apache Accumulo.
>
> This vote will remain open until at least Thu Oct 27 09:30:00 PM UTC 2022.
> (Thu Oct 27 05:30:00 PM EDT 2022 / Thu Oct 27 02:30:00 PM PDT 2022)
> Voting can continue after this deadline until the release manager
> sends an email ending the vote.
>
> Thanks!
>
> P.S. Hint: download the whole staging repo with
>     wget -erobots=off -r -l inf -np -nH \
>
> https://repository.apache.org/content/repositories/orgapacheaccumulo-1094/
>     # note the trailing slash is needed

Reply via email to