cshannon commented on code in PR #5071:
URL: https://github.com/apache/accumulo/pull/5071#discussion_r1845234382
##########
server/manager/src/main/java/org/apache/accumulo/manager/split/Splitter.java:
##########
@@ -191,6 +186,29 @@ public FileInfo getCachedFileInfo(TableId tableId,
TabletFile tabletFile) {
}
public void initiateSplit(SeedSplitTask seedSplitTask) {
- splitExecutor.execute(seedSplitTask);
+ // Want to avoid queuing the same tablet multiple times, it would not
cause bugs but would waste
+ // work. Use the metadata row to identify a tablet because the KeyExtent
also includes the prev
+ // end row which may change when splits happen. The metaRow is
conceptually tableId+endRow and
+ // that does not change for a split.
+ Text metaRow = seedSplitTask.getExtent().toMetaRow();
+ int qsize = queuedTablets.size();
+ if (qsize < 10_000 && queuedTablets.add(metaRow)) {
Review Comment:
This is not something that needs to be changed now as it was already hard
coded, but I am wondering if there would ever be a use case requiring tuning
the size of max number of queued tablets that would make creating a property
for this worthwhile
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]