[GitHub] milleruntime closed pull request #6: ACCUMULO-4718 Add CLASSPATH to accumulo-testing script

2017-10-17 Thread git
milleruntime closed pull request #6: ACCUMULO-4718 Add CLASSPATH to 
accumulo-testing script
URL: https://github.com/apache/accumulo-testing/pull/6
 
 
   

This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:

As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):

diff --git a/bin/accumulo-testing b/bin/accumulo-testing
index 84a1d40..3412564 100755
--- a/bin/accumulo-testing
+++ b/bin/accumulo-testing
@@ -70,6 +70,7 @@ fi
 
 function build_shade_jar() {
   export 
at_shaded_jar="$at_home/core/target/accumulo-testing-core-$at_version-shaded.jar"
+  export CLASSPATH="$at_shaded_jar:$CLASSPATH"
   if [ ! -f "$at_shaded_jar" ]; then
 echo "Building $at_shaded_jar"
 cd "$at_home" || exit 1
@@ -185,7 +186,7 @@ 
randomwalk_main="org.apache.accumulo.testing.core.randomwalk.Framework"
 case "$1" in
 ci-createtable)
   build_shade_jar
-  java -Dlog4j.configuration="file:$log4j_config" -cp "$at_shaded_jar" 
org.apache.accumulo.testing.core.continuous.CreateTable "$at_props"
+  java -Dlog4j.configuration="file:$log4j_config" 
org.apache.accumulo.testing.core.continuous.CreateTable "$at_props"
   ;;
 ci-local)
   if [ -z "$2" ]; then
@@ -195,7 +196,7 @@ ci-local)
   fi
   determine_app_main "$2"
   build_shade_jar
-  java -Dlog4j.configuration="file:$log4j_config" -cp "$at_shaded_jar" 
"$ci_main" "$at_props"
+  java -Dlog4j.configuration="file:$log4j_config" "$ci_main" "$at_props"
   ;;
 ci-yarn)
   if [ -z "$2" ]; then
@@ -229,7 +230,7 @@ rw-local)
 exit 1
   fi
   build_shade_jar
-  java -Dlog4j.configuration="file:$log4j_config" -cp "$at_shaded_jar" 
"$randomwalk_main" "$at_props" "$2"
+  java -Dlog4j.configuration="file:$log4j_config" "$randomwalk_main" 
"$at_props" "$2"
   ;;
 rw-yarn)
   if [ -z "$2" ]; then
diff --git a/conf/accumulo-testing-env.sh.example 
b/conf/accumulo-testing-env.sh.example
index e03c973..677c23d 100644
--- a/conf/accumulo-testing-env.sh.example
+++ b/conf/accumulo-testing-env.sh.example
@@ -20,6 +20,8 @@ test -z "$HADOOP_PREFIX" && export 
HADOOP_PREFIX=/path/to/hadoop
 export ACCUMULO_VERSION=`accumulo version`
 export HADOOP_VERSION=`hadoop version | head -n1 | awk '{print $2}'`
 export ZOOKEEPER_VERSION=3.4.9
+# Make sure Hadoop configuration directory is on the classpath
+export CLASSPATH=$HADOOP_PREFIX/etc/hadoop
 
 # Agitator
 # 


 

----
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #309: ACCUMULO-4722 Use Objects.equals in Pairs class"

2017-10-15 Thread git
jmark99 opened a new pull request #309: ACCUMULO-4722 Use Objects.equals in 
Pairs class"
URL: https://github.com/apache/accumulo/pull/309
 
 
   ACCUMULO-4722 Use Objects.equals() in Pair equals method
   
   Removed private equals method in Pairs class and instead made use of the
   java.util.Objects.equals method. Updated the test class to verify
   several variations of the equals method.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime commented on issue #6: ACCUMULO-4718 Add HADOOP_CONF_DIR to rw-local cp

2017-10-13 Thread git
milleruntime commented on issue #6: ACCUMULO-4718 Add HADOOP_CONF_DIR to 
rw-local cp
URL: https://github.com/apache/accumulo-testing/pull/6#issuecomment-336525460
 
 
   @mikewalch As far as I can tell, I believe my changes in 52baec5 will also 
satisfy ACCUMULO-4723
   
   It seemed to run fine with the quick test I ran using: accumulo-testing 
rw-yarn 1 Bulk.xml
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #308: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-13 Thread git
keith-turner commented on issue #308: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/308#issuecomment-336489706
 
 
   This is closed by 63f5713708900e50b2177405be699cd4d07e97a0
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #308: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-13 Thread git
keith-turner closed pull request #308: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/308
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-13 Thread git
keith-turner commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/306#issuecomment-336489561
 
 
   @jmark99 I squashed+merged the changes.   There was one diff, I ran mvn 
compile and it reformatted some stuff.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-13 Thread git
keith-turner closed pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #307: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-13 Thread git
keith-turner closed pull request #307: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/307
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #6: ACCUMULO-4718 Add HADOOP_CONF_DIR to rw-local cp

2017-10-12 Thread git
ctubbsii commented on a change in pull request #6: ACCUMULO-4718 Add 
HADOOP_CONF_DIR to rw-local cp
URL: https://github.com/apache/accumulo-testing/pull/6#discussion_r144427141
 
 

 ##
 File path: bin/accumulo-testing
 ##
 @@ -229,7 +229,7 @@ rw-local)
 exit 1
   fi
   build_shade_jar
-  java -Dlog4j.configuration="file:$log4j_config" -cp "$at_shaded_jar" 
"$randomwalk_main" "$at_props" "$2"
+  java -Dlog4j.configuration="file:$log4j_config" -cp 
"$at_shaded_jar:$HADOOP_CONF_DIR" "$randomwalk_main" "$at_props" "$2"
 
 Review comment:
   One way we can decouple this from specific environment variables (which may 
or may not exist), is to simply remove the `-cp` section, and set `CLASSPATH` 
above the call to `java`.
   Something like `export CLASSPATH="$at_shaded_jar:$CLASSPATH"`
   
   That way, the caller can simply set `CLASSPATH=/path/to/hadoop/confdir` 
prior to running the `accumulo-testing` command.
   
   We could also add a sanity check prior to running java to make sure that the 
`$CLASSPATH` contains a directory with `core-site.xml` and/or `hdfs-site.xml` 
(or whatever the Hadoop client configuration uses by default).
 
----
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/306#issuecomment-336292989
 
 
   Yes, @keith-turner , #307 has the correct documentation for 1.8. This 
documentation is also valid for the current master line of development. These 
was a slight modification to the loadDefaults() method between 1.7 and 1.8 that 
necessitated a small change to the documentation.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/306#issuecomment-336292123
 
 
   I can merge this.  @jmark99  does #307 currently have the changes you want 
for 1.8?  If so I will look at that when merging forward.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on issue #306: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/306#issuecomment-336289581
 
 
   Thanks @keith-turner. I've rendered the javadocs for all versions on my 
local machine and they render correctly. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144411823
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -199,15 +199,23 @@ public ClientConfiguration(Configuration... configs) {
   }
 
   /**
-   * Attempts to load a configuration file from the system. Uses the 
"ACCUMULO_CLIENT_CONF_PATH" environment variable, split on File.pathSeparator, 
for a list
-   * of target files. If not set, uses the following in this order- 
~/.accumulo/config $ACCUMULO_CONF_DIR/client.conf -OR- 
$ACCUMULO_HOME/conf/client.conf
-   * (depending on whether $ACCUMULO_CONF_DIR is set) /etc/accumulo/client.conf
+   * Attempts to load a configuration file from the system using the default 
search paths. Uses the ACCUMULO_CLIENT_CONF_PATH environment variable,
+   * split on File.pathSeparator, for a list of target files.
+   * 
+   * If ACCUMULO_CLIENT_CONF_PATH is not set, uses the following in 
this order:
+   * 
+   * ~/.accumulo/config
+   * either:
+   * 
+   * $ACCUMULO_CONF_DIR/client.conf, if 
$ACCUMULO_CONF_DIR is defined.
+   * $ACCUMULO_HOME/conf/client.conf, otherwise.
+   * 
+   * /etc/accumulo/client.conf
+   * /etc/accumulo/conf/client.conf
+   * 
+   * 
*
-   * A client configuration will then be read from each location using 
PropertiesConfiguration to construct a configuration. That means the latest 
item will be
-   * the one in the configuration.
-   *
-   * @see PropertiesConfiguration
-   * @see File#pathSeparator
+   * @since 1.6.0
 
 Review comment:
   Thanks for the catch! I realized I had inadvertently removed a line of 
documentation when I was performing the previous revision.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144407156
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -199,15 +199,23 @@ public ClientConfiguration(Configuration... configs) {
   }
 
   /**
-   * Attempts to load a configuration file from the system. Uses the 
"ACCUMULO_CLIENT_CONF_PATH" environment variable, split on File.pathSeparator, 
for a list
-   * of target files. If not set, uses the following in this order- 
~/.accumulo/config $ACCUMULO_CONF_DIR/client.conf -OR- 
$ACCUMULO_HOME/conf/client.conf
-   * (depending on whether $ACCUMULO_CONF_DIR is set) /etc/accumulo/client.conf
+   * Attempts to load a configuration file from the system using the default 
search paths. Uses the ACCUMULO_CLIENT_CONF_PATH environment variable,
+   * split on File.pathSeparator, for a list of target files.
+   * 
+   * If ACCUMULO_CLIENT_CONF_PATH is not set, uses the following in 
this order:
+   * 
+   * ~/.accumulo/config
+   * either:
+   * 
+   * $ACCUMULO_CONF_DIR/client.conf, if 
$ACCUMULO_CONF_DIR is defined.
+   * $ACCUMULO_HOME/conf/client.conf, otherwise.
+   * 
+   * /etc/accumulo/client.conf
+   * /etc/accumulo/conf/client.conf
+   * 
+   * 
*
-   * A client configuration will then be read from each location using 
PropertiesConfiguration to construct a configuration. That means the latest 
item will be
-   * the one in the configuration.
-   *
-   * @see PropertiesConfiguration
-   * @see File#pathSeparator
+   * @since 1.6.0
 
 Review comment:
   This since tag is not needed because its the same as the class level since 
tag.  Methods inherit the since tag from the class.   Only need to set it on 
method if it differs from class since tag.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on issue #220: ACCUMULO-4591 replication latency metrics added

2017-10-12 Thread git
ctubbsii commented on issue #220: ACCUMULO-4591 replication latency metrics 
added
URL: https://github.com/apache/accumulo/pull/220#issuecomment-336260108
 
 
   Closing this, as it was superseded by #305
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii closed pull request #220: ACCUMULO-4591 replication latency metrics added

2017-10-12 Thread git
ctubbsii closed pull request #220: ACCUMULO-4591 replication latency metrics 
added
URL: https://github.com/apache/accumulo/pull/220
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime opened a new pull request #6: ACCUMULO-4718 Add HADOOP_CONF_DIR to rw-local cp

2017-10-12 Thread git
milleruntime opened a new pull request #6: ACCUMULO-4718 Add HADOOP_CONF_DIR to 
rw-local cp
URL: https://github.com/apache/accumulo-testing/pull/6
 
 
   The test "rw-local Bulk.xml" was throwing errors because it couldn't find 
the HDFS containing the files to import.  The hadoop conf wasn't on the 
classpath so it was defaulting to the disk "file:///" file system.  This 
doesn't seem to affect the rest of the tests.  Not sure if this is the best 
place to include HADOOP_CONF_DIR...
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] adamjshook commented on issue #220: ACCUMULO-4591 replication latency metrics added

2017-10-12 Thread git
adamjshook commented on issue #220: ACCUMULO-4591 replication latency metrics 
added
URL: https://github.com/apache/accumulo/pull/220#issuecomment-336228003
 
 
   This PR can be closed with #305 being merged.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144368491
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -41,7 +41,22 @@
 
 /**
  * Contains a list of property keys recognized by the Accumulo client and 
convenience methods for setting them.
- *
+ * 
+ * When loading a configuration file from the system, Accumulo uses the 
ACCUMULO_CLIENT_CONF_PATH environment variable,
 
 Review comment:
   >  One for the 1.7 change and another for the 1.8 change and then the 1.8 
change will be merged forward to 2.0?
   
   So you want the changes in 1.8 to be slightly different?  If so it would be 
nice to end up with the following on 1.8 branch.
   
* Your commit with 1.8 changes
* Merge commit with your changes from 1.7 
* Your 1.7 commit
   
   Lets go for the easiest way to get to those changes.  Lets get the changes 
in 1.7 squared away and then we can get the 1.8 changes figured out.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144362198
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -41,7 +41,22 @@
 
 /**
  * Contains a list of property keys recognized by the Accumulo client and 
convenience methods for setting them.
- *
+ * 
+ * When loading a configuration file from the system, Accumulo uses the 
ACCUMULO_CLIENT_CONF_PATH environment variable,
 
 Review comment:
   Sounds good. I'll update to make the adjustments.  So if I understand you 
correctly I will only need 2 PR's.  One for the 1.7 change and another for the 
1.8 change and then the 1.8 change will be merged forward to 2.0? Thanks.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144360720
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -41,7 +41,22 @@
 
 /**
  * Contains a list of property keys recognized by the Accumulo client and 
convenience methods for setting them.
- *
+ * 
+ * When loading a configuration file from the system, Accumulo uses the 
ACCUMULO_CLIENT_CONF_PATH environment variable,
 
 Review comment:
   >  It can be moved back into the method call only if you think that would be 
more appropriate.
   
   Personally I would put the docs on the loadDefault method and then any 
constuctors that call load default I would state in the constuctors javadoc 
that it follows the search behavior described in loadDefatult using s javadoc 
link.  
   
   If the docs are at the class level, I think the class level docs would need 
to detail which constuctors and methods do and/or do not follow the search 
behavior.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-12 Thread git
jmark99 commented on issue #304: ACCUMULO-4716 Don't cache blks over max array 
size
URL: https://github.com/apache/accumulo/pull/304#issuecomment-336212445
 
 
   Thanks, Keith.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-12 Thread git
keith-turner commented on issue #304: ACCUMULO-4716 Don't cache blks over max 
array size
URL: https://github.com/apache/accumulo/pull/304#issuecomment-336211364
 
 
   @jmark99  thanks for the contribution.  I squashed these commits and cherry 
picked them back to 1.7 in commit 6ead4740dbc24485b51d73d0a006d44bb2f52833.  I 
then merged the changes forward in commits 
ae3aa10027cc0926cd5b319d31bc3fd005ffb50f and 
47d13beff82e547019faf551e9bdc76c4429e576.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-12 Thread git
keith-turner closed pull request #304: ACCUMULO-4716 Don't cache blks over max 
array size
URL: https://github.com/apache/accumulo/pull/304
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144357873
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -41,7 +41,22 @@
 
 /**
  * Contains a list of property keys recognized by the Accumulo client and 
convenience methods for setting them.
- *
+ * 
+ * When loading a configuration file from the system, Accumulo uses the 
ACCUMULO_CLIENT_CONF_PATH environment variable,
 
 Review comment:
   Not all the constructors do the search. The described behavior occurs when 
load configurations use a default load path. Previously the path order was 
described only in the method. I moved it up to class level based upon the 
suggestion given by @madrob in the ticket comment.  It can be moved back into 
the method call only if you think that would be more appropriate.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #307: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 commented on issue #307: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/307#issuecomment-336207583
 
 
   The change for 1.8 is slightly different than for 1.7. Although the 1.8 and 
2.0 changes are identical so the #307 could be merged forward.  I wasn't sure 
how that situation was handled so I ended up creating  a pull request for 
master as well.  
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] asfgit closed pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-12 Thread git
asfgit closed pull request #305: [ACCUMULO-4591] Add replication latency metrics
URL: https://github.com/apache/accumulo/pull/305
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #307: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on issue #307: ACCUMULO-4170 Clarify ClientConfiguration 
javadocs
URL: https://github.com/apache/accumulo/pull/307#issuecomment-336189538
 
 
   Is this the same change as #306?  We usually make the change in the oldest 
branch and then merge it forward.  
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
keith-turner commented on a change in pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306#discussion_r144339507
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/client/ClientConfiguration.java
 ##
 @@ -41,7 +41,22 @@
 
 /**
  * Contains a list of property keys recognized by the Accumulo client and 
convenience methods for setting them.
- *
+ * 
+ * When loading a configuration file from the system, Accumulo uses the 
ACCUMULO_CLIENT_CONF_PATH environment variable,
 
 Review comment:
   Does the behavior described here only happen when calling `loadDefault()`?  
Or do all of the constuctors do this search?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #308: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 opened a new pull request #308: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/308
 
 
   ACCUMULO-4170 Clarify ClientConfiguration javadocs
   
   Updated the javadoc information with the ClientConfiguration class.
   Specifically reworked the default search path information to be
   displayed as a list rather than inline, thereby easing readability.
   Additionally moved the search path list to the class level vice the
   method level.
   
   
![classv2](https://user-images.githubusercontent.com/31573345/31499040-9479f3de-af31-11e7-93b0-10d7d3fd5447.png)
   
   
![methodv2](https://user-images.githubusercontent.com/31573345/31499043-978f5456-af31-11e7-8c7d-caff1e2b23f2.png)
   
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #307: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 opened a new pull request #307: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/307
 
 
   ACCUMULO-4170 Clarify ClientConfiguration javadocs
   
   Updated the javadoc information with the ClientConfiguration class.
   Specifically reworked the default search path information to be
   displayed as a list rather than inline, thereby easing readability.
   
   Reworded a few of the sentences and moved the path order up to the class
   level rather than the method level.
   
   
![classv18](https://user-images.githubusercontent.com/31573345/31498872-21324e30-af31-11e7-9f82-a63ed0a5609f.png)
   
   
![methodv18](https://user-images.githubusercontent.com/31573345/31498873-25908910-af31-11e7-9664-6f9243e14665.png)
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #306: ACCUMULO-4170 Clarify ClientConfiguration javadocs

2017-10-12 Thread git
jmark99 opened a new pull request #306: ACCUMULO-4170 Clarify 
ClientConfiguration javadocs
URL: https://github.com/apache/accumulo/pull/306
 
 
   ACCUMULO-4170 Clarify ClientConfiguration javadocs
   
   Updated the javadoc information with the ClientConfiguration class.
   Specifically reworked the default search path information to be
   displayed as a list rather than inline, thereby easing readability.
   
   Reworded a few of the sentences and moved the path order up to the class
   level rather than the method level.
   
   
![classv17](https://user-images.githubusercontent.com/31573345/31498730-cd75b5d4-af30-11e7-99c1-7a4b3c1c85c9.png)
   
   
![methodv17](https://user-images.githubusercontent.com/31573345/31498736-d2ee7da2-af30-11e7-8a00-6bada949cd3a.png)
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on issue #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on issue #305: [ACCUMULO-4591] Add replication latency 
metrics
URL: https://github.com/apache/accumulo/pull/305#issuecomment-335938107
 
 
   No problem, dude. Thanks for the contribution!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] adamjshook commented on issue #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
adamjshook commented on issue #305: [ACCUMULO-4591] Add replication latency 
metrics
URL: https://github.com/apache/accumulo/pull/305#issuecomment-335930968
 
 
   @joshelser Thanks, if it wouldn't be too much of a hassle.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] adamjshook commented on issue #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
adamjshook commented on issue #305: [ACCUMULO-4591] Add replication latency 
metrics
URL: https://github.com/apache/accumulo/pull/305#issuecomment-335895480
 
 
   @joshelser Back at you!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144061407
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -124,4 +142,55 @@ protected int getNumConfiguredPeers() {
   protected int getMaxReplicationThreads() {
 return 
replicationUtil.getMaxReplicationThreads(master.getMasterMonitorInfo());
   }
+
+  protected void addReplicationQueueTimeMetrics() {
+// Exit early if replication table is offline
+if (TableState.ONLINE != Tables.getTableState(master.getInstance(), 
ReplicationTable.ID)) {
+  return;
+}
+
+// Exit early if we have no replication peers configured
+if (replicationUtil.getPeers().isEmpty()) {
+  return;
+}
+
+Set paths = replicationUtil.getPendingReplicationPaths();
+
+// We'll take a snap of the current time and use this as a diff between 
any deleted
+// file's modification time and now. The reported latency will be off by 
at most a
+// number of seconds equal to the metric polling period
+long currentTime = System.currentTimeMillis();
+
+// Iterate through all the pending paths and update the mod time if we 
don't know it yet
+for (Path path : paths) {
+  if (!pathModTimes.containsKey(path)) {
+try {
+  pathModTimes.put(path, 
master.getFileSystem().getFileStatus(path).getModificationTime());
+} catch (IOException e) {
+  // Ignore all IOExceptions
+  // Either the system is unavailable or the file was deleted
+  // since the initial scan and this check
+}
+  }
+}
+
+// Remove all currently pending files
+Set deletedPaths = new HashSet<>(pathModTimes.keySet());
+deletedPaths.removeAll(paths);
+
+// Exit early if we have no replicated files to report on
+if (deletedPaths.isEmpty()) {
+  return;
+}
+
+replicationQueueTimeStat.resetMinMax();
+
+for (Path path : deletedPaths) {
+  // Remove this path and add the latency
+  long modTime = pathModTimes.remove(path);
 
 Review comment:
   It's just the unboxing you really have to worry about. Something like the 
following would be fine:
   
   ```java
   Long modTime = pathModTimes.remove(path);
   if (modTime != null) {
 ...
   }
   ```
   
   Java will handle the unboxing for you once you guaranteed that `modTime` is 
definitely non-null.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] adamjshook commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
adamjshook commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144060426
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -124,4 +142,55 @@ protected int getNumConfiguredPeers() {
   protected int getMaxReplicationThreads() {
 return 
replicationUtil.getMaxReplicationThreads(master.getMasterMonitorInfo());
   }
+
+  protected void addReplicationQueueTimeMetrics() {
+// Exit early if replication table is offline
+if (TableState.ONLINE != Tables.getTableState(master.getInstance(), 
ReplicationTable.ID)) {
+  return;
+}
+
+// Exit early if we have no replication peers configured
+if (replicationUtil.getPeers().isEmpty()) {
+  return;
+}
+
+Set paths = replicationUtil.getPendingReplicationPaths();
+
+// We'll take a snap of the current time and use this as a diff between 
any deleted
+// file's modification time and now. The reported latency will be off by 
at most a
+// number of seconds equal to the metric polling period
+long currentTime = System.currentTimeMillis();
+
+// Iterate through all the pending paths and update the mod time if we 
don't know it yet
+for (Path path : paths) {
+  if (!pathModTimes.containsKey(path)) {
+try {
+  pathModTimes.put(path, 
master.getFileSystem().getFileStatus(path).getModificationTime());
+} catch (IOException e) {
+  // Ignore all IOExceptions
+  // Either the system is unavailable or the file was deleted
+  // since the initial scan and this check
+}
+  }
+}
+
+// Remove all currently pending files
+Set deletedPaths = new HashSet<>(pathModTimes.keySet());
+deletedPaths.removeAll(paths);
+
+// Exit early if we have no replicated files to report on
+if (deletedPaths.isEmpty()) {
+  return;
+}
+
+replicationQueueTimeStat.resetMinMax();
+
+for (Path path : deletedPaths) {
+  // Remove this path and add the latency
+  long modTime = pathModTimes.remove(path);
 
 Review comment:
   I was actually thinking how best to handle this.  It'd be an implementation 
error if the path to be removed wasn't in `pathModTimes`, since `deletedPaths` 
is a subset of `pathModTimes`.  Maybe check for existence and log an error if 
it doesn't exist?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144058257
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -124,4 +142,55 @@ protected int getNumConfiguredPeers() {
   protected int getMaxReplicationThreads() {
 return 
replicationUtil.getMaxReplicationThreads(master.getMasterMonitorInfo());
   }
+
+  protected void addReplicationQueueTimeMetrics() {
+// Exit early if replication table is offline
+if (TableState.ONLINE != Tables.getTableState(master.getInstance(), 
ReplicationTable.ID)) {
+  return;
+}
+
+// Exit early if we have no replication peers configured
+if (replicationUtil.getPeers().isEmpty()) {
+  return;
+}
+
+Set paths = replicationUtil.getPendingReplicationPaths();
+
+// We'll take a snap of the current time and use this as a diff between 
any deleted
+// file's modification time and now. The reported latency will be off by 
at most a
+// number of seconds equal to the metric polling period
+long currentTime = System.currentTimeMillis();
+
+// Iterate through all the pending paths and update the mod time if we 
don't know it yet
+for (Path path : paths) {
+  if (!pathModTimes.containsKey(path)) {
+try {
+  pathModTimes.put(path, 
master.getFileSystem().getFileStatus(path).getModificationTime());
+} catch (IOException e) {
+  // Ignore all IOExceptions
+  // Either the system is unavailable or the file was deleted
+  // since the initial scan and this check
+}
+  }
+}
+
+// Remove all currently pending files
+Set deletedPaths = new HashSet<>(pathModTimes.keySet());
+deletedPaths.removeAll(paths);
+
+// Exit early if we have no replicated files to report on
+if (deletedPaths.isEmpty()) {
+  return;
+}
+
+replicationQueueTimeStat.resetMinMax();
+
+for (Path path : deletedPaths) {
+  // Remove this path and add the latency
+  long modTime = pathModTimes.remove(path);
 
 Review comment:
   This will throw an error is `pathModTimes` ever does not have a mapping for 
`path` because you're unboxing the `Long` into a `long`.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144058388
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -124,4 +142,55 @@ protected int getNumConfiguredPeers() {
   protected int getMaxReplicationThreads() {
 return 
replicationUtil.getMaxReplicationThreads(master.getMasterMonitorInfo());
   }
+
+  protected void addReplicationQueueTimeMetrics() {
 
 Review comment:
   Could you add a simple test case for this, please?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144058516
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -26,38 +29,51 @@
 import org.apache.accumulo.master.Master;
 import org.apache.accumulo.server.metrics.Metrics;
 import org.apache.accumulo.server.replication.ReplicationUtil;
+import org.apache.hadoop.fs.Path;
 import org.apache.hadoop.metrics2.MetricsCollector;
 import org.apache.hadoop.metrics2.MetricsRecordBuilder;
 import org.apache.hadoop.metrics2.MetricsSource;
 import org.apache.hadoop.metrics2.MetricsSystem;
 import org.apache.hadoop.metrics2.lib.Interns;
 import org.apache.hadoop.metrics2.lib.MetricsRegistry;
+import org.apache.hadoop.metrics2.lib.MutableQuantiles;
+import org.apache.hadoop.metrics2.lib.MutableStat;
 
 /**
  *
  */
 public class Metrics2ReplicationMetrics implements Metrics, MetricsSource {
   public static final String NAME = MASTER_NAME + ",sub=Replication", 
DESCRIPTION = "Data-Center Replication Metrics", CONTEXT = "master",
   RECORD = "MasterReplication";
-  public static final String PENDING_FILES = "filesPendingReplication", 
NUM_PEERS = "numPeers", MAX_REPLICATION_THREADS = "maxReplicationThreads";
+  public static final String PENDING_FILES = "filesPendingReplication", 
NUM_PEERS = "numPeers", MAX_REPLICATION_THREADS = "maxReplicationThreads",
+  REPLICATION_QUEUE_TIME_QUANTILES = "replicationQueue10m", 
REPLICATION_QUEUE_TIME = "replicationQueue";
 
   private final Master master;
   private final MetricsSystem system;
   private final MetricsRegistry registry;
   private final ReplicationUtil replicationUtil;
+  private final MutableQuantiles replicationQueueTimeQuantiles;
+  private final MutableStat replicationQueueTimeStat;
+  private final Map<Path,Long> pathModTimes;
 
   Metrics2ReplicationMetrics(Master master, MetricsSystem system) {
 this.master = master;
 this.system = system;
 
+pathModTimes = new HashMap<>();
+
 registry = new MetricsRegistry(Interns.info(NAME, DESCRIPTION));
 replicationUtil = new ReplicationUtil(master);
+replicationQueueTimeQuantiles = 
registry.newQuantiles(REPLICATION_QUEUE_TIME_QUANTILES, "replication queue time 
quantiles in milliseconds", "ops",
 
 Review comment:
   nit: capitalize "Replication" please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144057720
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -124,4 +142,55 @@ protected int getNumConfiguredPeers() {
   protected int getMaxReplicationThreads() {
 return 
replicationUtil.getMaxReplicationThreads(master.getMasterMonitorInfo());
   }
+
+  protected void addReplicationQueueTimeMetrics() {
+// Exit early if replication table is offline
+if (TableState.ONLINE != Tables.getTableState(master.getInstance(), 
ReplicationTable.ID)) {
+  return;
+}
+
+// Exit early if we have no replication peers configured
+if (replicationUtil.getPeers().isEmpty()) {
+  return;
+}
+
+Set paths = replicationUtil.getPendingReplicationPaths();
+
+// We'll take a snap of the current time and use this as a diff between 
any deleted
+// file's modification time and now. The reported latency will be off by 
at most a
+// number of seconds equal to the metric polling period
+long currentTime = System.currentTimeMillis();
+
+// Iterate through all the pending paths and update the mod time if we 
don't know it yet
+for (Path path : paths) {
+  if (!pathModTimes.containsKey(path)) {
+try {
+  pathModTimes.put(path, 
master.getFileSystem().getFileStatus(path).getModificationTime());
+} catch (IOException e) {
+  // Ignore all IOExceptions
+  // Either the system is unavailable or the file was deleted
+  // since the initial scan and this check
 
 Review comment:
   Add a `log.trace` in case you/someone ever runs into troubles and wants to 
try to debug this.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-11 Thread git
joshelser commented on a change in pull request #305: [ACCUMULO-4591] Add 
replication latency metrics
URL: https://github.com/apache/accumulo/pull/305#discussion_r144058956
 
 

 ##
 File path: 
server/master/src/main/java/org/apache/accumulo/master/metrics/Metrics2ReplicationMetrics.java
 ##
 @@ -26,38 +29,51 @@
 import org.apache.accumulo.master.Master;
 import org.apache.accumulo.server.metrics.Metrics;
 import org.apache.accumulo.server.replication.ReplicationUtil;
+import org.apache.hadoop.fs.Path;
 import org.apache.hadoop.metrics2.MetricsCollector;
 import org.apache.hadoop.metrics2.MetricsRecordBuilder;
 import org.apache.hadoop.metrics2.MetricsSource;
 import org.apache.hadoop.metrics2.MetricsSystem;
 import org.apache.hadoop.metrics2.lib.Interns;
 import org.apache.hadoop.metrics2.lib.MetricsRegistry;
+import org.apache.hadoop.metrics2.lib.MutableQuantiles;
+import org.apache.hadoop.metrics2.lib.MutableStat;
 
 /**
  *
  */
 public class Metrics2ReplicationMetrics implements Metrics, MetricsSource {
   public static final String NAME = MASTER_NAME + ",sub=Replication", 
DESCRIPTION = "Data-Center Replication Metrics", CONTEXT = "master",
   RECORD = "MasterReplication";
-  public static final String PENDING_FILES = "filesPendingReplication", 
NUM_PEERS = "numPeers", MAX_REPLICATION_THREADS = "maxReplicationThreads";
+  public static final String PENDING_FILES = "filesPendingReplication", 
NUM_PEERS = "numPeers", MAX_REPLICATION_THREADS = "maxReplicationThreads",
+  REPLICATION_QUEUE_TIME_QUANTILES = "replicationQueue10m", 
REPLICATION_QUEUE_TIME = "replicationQueue";
 
   private final Master master;
   private final MetricsSystem system;
   private final MetricsRegistry registry;
   private final ReplicationUtil replicationUtil;
+  private final MutableQuantiles replicationQueueTimeQuantiles;
+  private final MutableStat replicationQueueTimeStat;
+  private final Map<Path,Long> pathModTimes;
 
   Metrics2ReplicationMetrics(Master master, MetricsSystem system) {
 this.master = master;
 this.system = system;
 
+pathModTimes = new HashMap<>();
+
 registry = new MetricsRegistry(Interns.info(NAME, DESCRIPTION));
 replicationUtil = new ReplicationUtil(master);
+replicationQueueTimeQuantiles = 
registry.newQuantiles(REPLICATION_QUEUE_TIME_QUANTILES, "replication queue time 
quantiles in milliseconds", "ops",
+"latency", 600);
+replicationQueueTimeStat = registry.newStat(REPLICATION_QUEUE_TIME, 
"replication queue time stat in milliseconds", "ops", "latency", true);
 
 Review comment:
   nit: "statistics" instead of "stat" (and capitalize Replication too)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] adamjshook opened a new pull request #305: [ACCUMULO-4591] Add replication latency metrics

2017-10-10 Thread git
adamjshook opened a new pull request #305: [ACCUMULO-4591] Add replication 
latency metrics
URL: https://github.com/apache/accumulo/pull/305
 
 
   Supersedes #220 
   
   Trying to capture the latency based on WAL modification time and the current 
time once the file disappears from the status section of the replication table.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime closed pull request #5: ACCUMULO-4717 Add checkstyle plugin to check API

2017-10-06 Thread git
milleruntime closed pull request #5: ACCUMULO-4717 Add checkstyle plugin to 
check API
URL: https://github.com/apache/accumulo-testing/pull/5
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-06 Thread git
keith-turner commented on a change in pull request #304: ACCUMULO-4716 Don't 
cache blks over max array size
URL: https://github.com/apache/accumulo/pull/304#discussion_r143228666
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/file/blockfile/impl/CachableBlockFile.java
 ##
 @@ -156,6 +156,10 @@ public long getStartPos() throws IOException {
 private boolean closed = false;
 private AccumuloConfiguration accumuloConfiguration = null;
 
+// ACCUMULO-4716 - Define MAX_ARRAY_SIZE smaller than Integer.MAX_VALUE to 
prevent possible OutOfMemory
+// errors when allocating arrays - described in stackoverflow post: 
https://stackoverflow.com/a/8381338
 
 Review comment:
   I like this comment.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-06 Thread git
jmark99 commented on issue #304: ACCUMULO-4716 Don't cache blks over max array 
size
URL: https://github.com/apache/accumulo/pull/304#issuecomment-334795821
 
 
   ACCUMULO-4716 Don't cache blocks over max array size
   
   Modified to move MAX_ARRAY_SIZE from within the method to a constant class 
value instead.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-06 Thread git
keith-turner commented on a change in pull request #304: ACCUMULO-4716 Don't 
cache blks over max array size
URL: https://github.com/apache/accumulo/pull/304#discussion_r143213486
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/file/blockfile/impl/CachableBlockFile.java
 ##
 @@ -342,7 +342,11 @@ private BlockRead getBlock(String _lookup, BlockCache 
cache, BlockLoader loader)
 
 private BlockRead cacheBlock(String _lookup, BlockCache cache, BlockReader 
_currBlock, String block) throws IOException {
 
-  if ((cache == null) || (_currBlock.getRawSize() > cache.getMaxSize())) {
+  // ACCUMULO-4716 - Define MAX_ARRAY_SIZE smaller than Integer.MAX_VALUE 
to prevent possible OutOfMemory
+  // error as described in stackoverflow post: 
https://stackoverflow.com/a/8381338.
+  int MAX_ARRAY_SIZE = Integer.MAX_VALUE - 8;
 
 Review comment:
   Personally, I would make this a static constants because all caps usually 
implies that by convention.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #304: ACCUMULO-4716 Don't cache blks over max array size

2017-10-05 Thread git
jmark99 opened a new pull request #304: ACCUMULO-4716 Don't cache blks over max 
array size
URL: https://github.com/apache/accumulo/pull/304
 
 
   ACCUMULO-4716 Don't cache blocks over max array size
   
   Prevents byte array from caching up to Integer.MAX_VALUE to prevent
   possible OutofMemory error as described in StackOverflow post
   https://stackoverflow.com/a/8381338
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime opened a new pull request #5: ACCUMULO-4717 Add checkstyle plugin to check API

2017-10-05 Thread git
milleruntime opened a new pull request #5: ACCUMULO-4717 Add checkstyle plugin 
to check API
URL: https://github.com/apache/accumulo-testing/pull/5
 
 
   Will only run with "mvn checkstyle:checkstyle" but would be good to check 
usage of non public API.  This is more of a long term goal since there are a 
lot.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime closed pull request #27: Add FineAndDandy as contributor

2017-10-05 Thread git
milleruntime closed pull request #27: Add FineAndDandy as contributor
URL: https://github.com/apache/accumulo-website/pull/27
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy opened a new pull request #27: Add FineAndDandy as contributor

2017-10-05 Thread git
FineAndDandy opened a new pull request #27: Add FineAndDandy as contributor
URL: https://github.com/apache/accumulo-website/pull/27
 
 
   Add FineAndDandy as Accumulo contributor for PR apache/accumulo#303 and 
issue ACCUMULO-4713
   
   Thanks!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-05 Thread git
FineAndDandy commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-334448560
 
 
   Thanks @keith-turner @mjwall I'd love to be added to the people page. I'll 
open a pr
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] mjwall commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
mjwall commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-334290617
 
 
   too slow, but I reviewed this too and approve.  Thanks @FineAndDandy 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
keith-turner closed pull request #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
keith-turner commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-334275091
 
 
   Merged in 0241958ec3cd5e764b94a1fbd485fc859fdc393f
   
   Thanks for the contribution @FineAndDandy, nice work.  Would you like to be 
added to our [people page](http://accumulo.apache.org/people/)?  If so let me 
know what info you would like added or open a pr to the [website 
repo](https://github.com/apache/accumulo-website) to modify `pages/people.md`.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
keith-turner commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-334250968
 
 
   I'm working on merging this.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
jmark99 commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334236976
 
 
   Thanks, @keith-turner. I'll take a look at the ticket you provide.
   
   On Wed, Oct 4, 2017 at 1:44 PM, Keith Turner <notificati...@github.com>
   wrote:
   
   > @jmark99 <https://github.com/jmark99> I updated the people page. I didn't
   > include you username because I think those are ASF usernames. If you are
   > interested I just opened ACCUMULO-4716
   > <https://issues.apache.org/jira/browse/ACCUMULO-4716>. Its something I
   > ran into recently.
   >
   > ?
   > You are receiving this because you were mentioned.
   > Reply to this email directly, view it on GitHub
   > <https://github.com/apache/accumulo/pull/302#issuecomment-334234886>, or 
mute
   > the thread
   > 
<https://github.com/notifications/unsubscribe-auth/AeHFYUQTQ3iMtEQjNJj6DG2WPqtwQyUfks5so8PygaJpZM4PrDna>
   > .
   >
   
 
--------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
keith-turner commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334234886
 
 
   @jmark99 I updated the people page.  I didn't include you username because I 
think those are ASF usernames.  If you are interested I just opened 
[ACCUMULO-4716](https://issues.apache.org/jira/browse/ACCUMULO-4716).  Its 
something I ran into recently.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
jmark99 commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334204287
 
 
   Keith, currently I've been starting with some of the 'newbie' issues as I'm
   attempting to come up to speed on the project. If you have any suggestions
   for tickets that would help me become more familiar with the project
   internals but wouldn't be too overwhelming initially I would be glad to
   hear of some.
   Thanks
   
   On Wed, Oct 4, 2017 at 11:53 AM, Keith Turner <notificati...@github.com>
   wrote:
   
   > I look forward to continuing to supply contributions to the project.
   >
   > Great! If you would like assistance finding issues to work on let me know.
   >
   > ?
   > You are receiving this because you were mentioned.
   > Reply to this email directly, view it on GitHub
   > <https://github.com/apache/accumulo/pull/302#issuecomment-334202417>, or 
mute
   > the thread
   > 
<https://github.com/notifications/unsubscribe-auth/AeHFYRnPIjXaFNc_hkRQyZdKrf2vlRR0ks5so6oSgaJpZM4PrDna>
   > .
   >
   
 
----
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
keith-turner commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334202417
 
 
   >  I look forward to continuing to supply contributions to the project.
   
   Great! If you would like assistance finding issues to work on let me know. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
jmark99 commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334171419
 
 
   Thanks Keith,
   
   Not sure about the email change. No problem though.
   
   Yes, I would be interested in being added to the Accumulo people page.  I
   look forward to continuing to supply contributions to the project.  You can
   add the following info if you would:
   username: jmark99
   name: Mark Owens
   organization: 
   timezone: ET
   
   Thanks,
   mark
   
   
   On Wed, Oct 4, 2017 at 9:56 AM, Keith Turner <notificati...@github.com>
   wrote:
   
   > @jmark99 <https://github.com/jmark99> I merged this through Github and
   > for some reason it changed your email address in the commit. I'm sorry
   > about that, not sure why GH did that.
   >
   > Thanks for the contribution. Would you like to be added to the Accumulo 
people
   > page <http://accumulo.apache.org/people/> as a contributor? If so let me
   > know what info you would like there and I can add it, or you can make a PR
   > to the website repo <https://github.com/apache/accumulo-website> to add
   > yourself. You can just edit pages/people.md
   >
   > ?
   > You are receiving this because you were mentioned.
   > Reply to this email directly, view it on GitHub
   > <https://github.com/apache/accumulo/pull/302#issuecomment-334163552>, or 
mute
   > the thread
   > 
<https://github.com/notifications/unsubscribe-auth/AeHFYWkEVFD5CfiwwJ8NE_13k1H7to3Hks5so46igaJpZM4PrDna>
   > .
   >
   
 
----
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
keith-turner commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334163552
 
 
   @jmark99  I merged this through Github and for some reason it changed your 
email address in the commit.  I'm sorry about that, not sure why GH did that. 
   
   Thanks for the contribution.  Would you like to be added to the Accumulo 
[people page][1] as a contributor?  If so let me know what info you would like 
there and I can add it, or you can make a PR to the [website repo][2] to add 
yourself. You can just edit `pages/people.md`
   
   [1]: http://accumulo.apache.org/people/
   [2]: https://github.com/apache/accumulo-website
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner closed pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
keith-turner closed pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142669137
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/iterators/system/TimeSettingIteratorTest.java
 ##
 @@ -106,4 +117,135 @@ public void testAvoidKeyCopy() throws Exception {
 assertFalse(tsi.hasTop());
   }
 
+  @Test
+  public void testExclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, false);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  @Test
+  public void testInclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, true);
 
 Review comment:
   will fix
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142669050
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/iterators/IteratorUtil.java
 ##
 @@ -312,10 +317,15 @@ public static Range maximizeStartKeyTimeStamp(Range 
range) {
   public static Range minimizeEndKeyTimeStamp(Range range) {
 Range seekRange = range;
 
-if (range.getEndKey() != null && range.getEndKey().getTimestamp() != 
Long.MIN_VALUE) {
+if (range.getEndKey() != null) {
   Key seekKey = new Key(seekRange.getEndKey());
 
 Review comment:
   will fix
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-04 Thread git
FineAndDandy commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142668228
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/iterators/system/TimeSettingIteratorTest.java
 ##
 @@ -106,4 +117,135 @@ public void testAvoidKeyCopy() throws Exception {
 assertFalse(tsi.hasTop());
   }
 
+  @Test
+  public void testExclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, false);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  @Test
+  public void testInclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, true);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  public class IteratorToSortedKeyValueIterator implements 
SortedKeyValueIterator<Key,Value> {
 
 Review comment:
   I think I can rework this to use a deleted flag rather than the internal 
class for testing. The important part was that I have two keys with the same 
Row/CF/CQ/MIN_LONG in a row to reproduce the problem case. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-04 Thread git
jmark99 commented on issue #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#issuecomment-334116454
 
 
   Thanks for the review, Keith. Comments have been removed and changes pushed 
to github.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-03 Thread git
keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#discussion_r142532855
 
 

 ##
 File path: 
server/tserver/src/main/java/org/apache/accumulo/tserver/tablet/Tablet.java
 ##
 @@ -1686,7 +1686,9 @@ public synchronized boolean needsSplit() {
   // BEGIN PRIVATE METHODS RELATED TO MAJOR COMPACTION
 
   private boolean isCompactionEnabled() {
-return !isClosing() && !getTabletServer().isMajorCompactionDisabled();
+// ACCUMULO-2968: Removed reference to !isMajorCompactionDisabled 
in return
 
 Review comment:
   I think this comment can be omitted 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-03 Thread git
keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#discussion_r142532990
 
 

 ##
 File path: 
server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java
 ##
 @@ -1925,20 +1924,11 @@ public SplitRunner(Tablet tablet) {
 
 @Override
 public void run() {
-  if (majorCompactorDisabled) {
-// this will make split task that were queued when shutdown was
-// initiated exit
-return;
-  }
-
+  // ACCUMULO-2968: Removed if condition that always returned 
false.
 
 Review comment:
   I think this comment can be omitted 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-03 Thread git
keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#discussion_r142532833
 
 

 ##
 File path: 
server/tserver/src/main/java/org/apache/accumulo/tserver/tablet/CompactionRunner.java
 ##
 @@ -34,12 +34,9 @@ public CompactionRunner(Tablet tablet, 
MajorCompactionReason reason) {
 
   @Override
   public void run() {
-if (tablet.getTabletServer().isMajorCompactionDisabled()) {
-  // this will make compaction tasks that were queued when shutdown was
-  // initiated exit
-  tablet.removeMajorCompactionQueuedReason(reason);
-  return;
-}
+// ACCUMULO-2968: isMajorCompactionDisabled() previously always returned 
false.
 
 Review comment:
   I think this comment can be omitted 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-03 Thread git
keith-turner commented on a change in pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302#discussion_r142532884
 
 

 ##
 File path: 
server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java
 ##
 @@ -1963,7 +1953,10 @@ public MajorCompactor(AccumuloConfiguration config) {
 
 @Override
 public void run() {
-  while (!majorCompactorDisabled) {
+  // ACCUMULO-2968: While loop previously checked for value of 
!majorCompactionDisabled.
 
 Review comment:
   I think this comment can be omitted 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-10-03 Thread git
keith-turner commented on issue #300: ACCUMULO-4708 Limit RFile block size to 
2GB
URL: https://github.com/apache/accumulo/pull/300#issuecomment-333985010
 
 
   I ran the following test against master and it was successful after I made 
the changes mentioned in the comment at the end of the patch.  Based on this it 
seems that entries large than 2G can currently work.  The small changes I made 
were related to the fact that it seems that MAX_INT-2 is the largest array you 
can allocate in java.
   
   https://gist.github.com/keith-turner/90e318057f55b080dd93aa00277faa4b
   
   I was surprised the test with uncompressible data worked.  I suspect it 
created a compressed block larger than 2G which I was thinking would fail.  
Need to dig and understand why it worked.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
keith-turner commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142518818
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/iterators/system/TimeSettingIteratorTest.java
 ##
 @@ -106,4 +117,135 @@ public void testAvoidKeyCopy() throws Exception {
 assertFalse(tsi.hasTop());
   }
 
+  @Test
+  public void testExclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, false);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  @Test
+  public void testInclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, true);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  public class IteratorToSortedKeyValueIterator implements 
SortedKeyValueIterator<Key,Value> {
 
 Review comment:
   Why not use the 
[SortedMapIterator](https://github.com/apache/accumulo/blob/rel/1.8.1/core/src/main/java/org/apache/accumulo/core/iterators/SortedMapIterator.java)?
  It actually does something for seek, which seems like it would be better for 
the test?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
keith-turner commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142524428
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/iterators/system/TimeSettingIteratorTest.java
 ##
 @@ -106,4 +117,135 @@ public void testAvoidKeyCopy() throws Exception {
 assertFalse(tsi.hasTop());
   }
 
+  @Test
+  public void testExclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, false);
+it.seek(testRange, new HashSet(), false);
+
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("00".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertTrue(it.hasTop());
+assertTrue(it.getTopValue().equals(new Value("11".getBytes(;
+assertTrue(it.getTopKey().getTimestamp() == 111L);
+it.next();
+assertFalse(it.hasTop());
+  }
+
+  @Test
+  public void testInclusiveEndKey() throws IOException {
+Text row = new Text("a");
+Text colf = new Text("b");
+Text colq = new Text("c");
+Text cv = new Text();
+
+List<Map.Entry<Key,Value>> sources = new ArrayList<>();
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("00".getBytes(;
+sources.add(new AbstractMap.SimpleImmutableEntry<>(new Key(row, colf, 
colq, cv, Long.MIN_VALUE), new Value("11".getBytes(;
+
+TimeSettingIterator it = new TimeSettingIterator(new 
IteratorToSortedKeyValueIterator(sources.iterator()), 111L);
+IteratorSetting is = new IteratorSetting(1, TimeSettingIterator.class);
+it.init(null, is.getOptions(), null);
+
+Key startKey = new Key();
+Key endKey = new Key(row, colf, colq, cv, Long.MIN_VALUE);
+Range testRange = new Range(startKey, false, endKey, true);
 
 Review comment:
   Is the inclusiveness of the end key the only diff with these test?  If so, 
personally I would combine the code somehow.  Could have a function that each 
test calls or a single test with a loop like :
   
   ```java
   for(boolean endKeyInclusive : new boolean[]{false,true}){
 //all test code here
   }
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on a change in pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
keith-turner commented on a change in pull request #303: ACCUMULO-4713: 
Correctly set ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#discussion_r142518372
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/iterators/IteratorUtil.java
 ##
 @@ -312,10 +317,15 @@ public static Range maximizeStartKeyTimeStamp(Range 
range) {
   public static Range minimizeEndKeyTimeStamp(Range range) {
 Range seekRange = range;
 
-if (range.getEndKey() != null && range.getEndKey().getTimestamp() != 
Long.MIN_VALUE) {
+if (range.getEndKey() != null) {
   Key seekKey = new Key(seekRange.getEndKey());
 
 Review comment:
   This always creates a new Key, but its not always used. Also, in the 
following if stmt block it recreates the key and does not used the one created 
here.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-333978496
 
 
   @FineAndDandy I was mostly trying to helpfully suggest better log messages 
for future. We can always clean up log messages when we merge, if we want. I 
wouldn't worry about it for this one, unless you really want to, in which case 
a quick `git commit --amend` and force push to your branch would do it.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
FineAndDandy commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-333961927
 
 
   No problem. Do you prefer separate commits or should I fix/rebase and push 
the PR again?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-333949909
 
 
   Haven't reviewed the change yet (hopefully will have time later), but wanted 
to immediately suggest to please use shorter subject lines in git log messages. 
Can provide as much detail as needed in the commit log body, but the subject 
should be short, or it messes up the readability of tons of git tooling. Here's 
one (of many) guide which could be used: 
https://chris.beams.io/posts/git-commit/
   
   A hint: if the first line of the git log message cannot fit into the GitHub 
PR subject, then it's too long. :)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
ctubbsii commented on issue #303: ACCUMULO-4713: Correctly set ranges for 
IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303#issuecomment-333949909
 
 
   Haven't reviewed the change yet (hopefully will have time later), but wanted 
to immediately suggest to please use shorter subject lines in git log messages. 
Can provide as much detail as needed in the commit log body, but the subject 
should be short, or it messes up the readability of tons of git tooling. Here's 
one (of many) guide which could be used: 
https://chris.beams.io/posts/git-commit/
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] FineAndDandy opened a new pull request #303: ACCUMULO-4713: Correctly set ranges for IteratorUtil.maximizeStartKey?

2017-10-03 Thread git
FineAndDandy opened a new pull request #303: ACCUMULO-4713: Correctly set 
ranges for IteratorUtil.maximizeStartKey?
URL: https://github.com/apache/accumulo/pull/303
 
 
   ?TimeStamp() and IteratorUtil.minimizeEndKeyTimeStamp()
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] keith-turner commented on issue #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-10-02 Thread git
keith-turner commented on issue #300: ACCUMULO-4708 Limit RFile block size to 
2GB
URL: https://github.com/apache/accumulo/pull/300#issuecomment-333680449
 
 
   I took another in depth look at this change and I am not sure the premise of 
this fix is correct.  Looking at the code it seems that in the case where the 
key+value exceeds 2G but compresses to less than 2G that it may be ok.  I need 
to do more investigation.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] mikewalch closed pull request #6: ACCUMULO-4516: TeraSortIngest splits argument is ignored if less that 10

2017-10-02 Thread git
mikewalch closed pull request #6: ACCUMULO-4516: TeraSortIngest splits argument 
is ignored if less that 10
URL: https://github.com/apache/accumulo-examples/pull/6
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #302: ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled

2017-10-02 Thread git
jmark99 opened a new pull request #302: ACCUMULO-2968: Axe 
TabletServer.majorCompactorDisabled
URL: https://github.com/apache/accumulo/pull/302
 
 
   ACCUMULO-2968: Axe TabletServer.majorCompactorDisabled
   
   Removed references to majorCompatorDisabled variable defined in
   accumulo/tserver/TabletServer.java.  This private variable was
   defined as 'false' and never modified in source and additionally
   no means were provided to alter the value in the code.
   
   Several locations within TabletServer.java where updated. This
   included a couple of 'if' statement that would never be executed as well
   as some return statement where a constant value of true was
   unnecessarily included.
   
   There was one reference each within CompationRunner.java and Tablet.java
   which were removed also.
   
   The change of most note is probably the while loop within the run method
   of TableServer. Previously the while check read:
  while(!majorCompatorDisabled)
   
   whereas now it reads:
  while(true)...
   
   which was what it would always equate too. This could be seen to be a
   loss of information perhaps. A comment was added to describe what
   existed prior to the update.
   
   After update all unit tests continued to pass as well as the 'sunny-day' 
integration tests.
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #6: ACCUMULO-4516: TeraSortIngest splits argument is ignored if less that 10

2017-10-02 Thread git
jmark99 commented on issue #6: ACCUMULO-4516: TeraSortIngest splits argument is 
ignored if less that 10
URL: https://github.com/apache/accumulo-examples/pull/6#issuecomment-333563921
 
 
   Thanks, Mike
   
   On Mon, Oct 2, 2017 at 11:09 AM, Mike Walch <notificati...@github.com>
   wrote:
   
   > *@mikewalch* approved this pull request.
   >
   > Thanks for the PR! Looks good to me. I will merge it in soon but I want to
   > give others a chance to review it first.
   >
   > ?
   > You are receiving this because you authored the thread.
   > Reply to this email directly, view it on GitHub
   > 
<https://github.com/apache/accumulo-examples/pull/6#pullrequestreview-66488115>,
   > or mute the thread
   > 
<https://github.com/notifications/unsubscribe-auth/AeHFYYhid-RQYwYWUXJvHx0Zcj3O_va1ks5soPyOgaJpZM4Pqp81>
   > .
   >
   
 
----
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #6: ACCUMULO-4516: TeraSortIngest splits argumnet is ignored if less that 10

2017-10-02 Thread git
jmark99 opened a new pull request #6: ACCUMULO-4516: TeraSortIngest splits 
argumnet is ignored if less that 10
URL: https://github.com/apache/accumulo-examples/pull/6
 
 
   ACCUMULO-4516: TeraSortIngest splits argumnet is ignored if less that 10 
arguments are provided.
   
   Modified TeraSortIngest.java to remove arg length check as a condition
   for setting the number of splits.  Instead check to see if the splits option
   is non-zero and if so, set the number of splits.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii closed pull request #301: ACCUMULO-2907 Invalidate "this may not be applicable for your security setup" warning from initialize

2017-09-27 Thread git
ctubbsii closed pull request #301: ACCUMULO-2907 Invalidate "this may not be 
applicable for your security setup" warning from initialize
URL: https://github.com/apache/accumulo/pull/301
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on a change in pull request #301: ACCUMULO-2907 Invalidate "this may not be applicable for your security setup" warning from initialize

2017-09-27 Thread git
jmark99 commented on a change in pull request #301: ACCUMULO-2907 Invalidate 
"this may not be applicable for your security setup" warning from initialize
URL: https://github.com/apache/accumulo/pull/301#discussion_r141462552
 
 

 ##
 File path: 
server/base/src/main/java/org/apache/accumulo/server/init/Initialize.java
 ##
 @@ -644,6 +644,25 @@ private String getRootUserName(Opts opts) throws 
IOException {
 return rootpass.getBytes(UTF_8);
   }
 
+  /**
+   * Create warning message related to initial password, if appropriate.
+   *
+   * ACCUMULO-2907 Remove unnecessary security warning from console message 
unless its actually appropriate.
+   * The warning message should only be displayed when the value of 
instance.security.authenticator
+   * differs between the SiteConfiguration and the DefaultConfiguration values.
+   *
+   * @return String containing warning portion of console message.
+   */
+  private String getInitialPasswordWarning() {
+StringBuilder optionalWarning = new StringBuilder();
+Property authenticatorProperty = Property.INSTANCE_SECURITY_AUTHENTICATOR;
+if 
(SiteConfiguration.getInstance().get(authenticatorProperty).equals(authenticatorProperty.getDefaultValue()))
+  optionalWarning.append(": ");
+else
+  optionalWarning.append(" (this may not be applicable for your security 
setup): ");
+return optionalWarning.toString();
 
 Review comment:
   ACCUMULO-2907: Updated to replace StringBuilder usage with String.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #301: ACCUMULO-2907 Invalidate "this may not be applicable for your security setup" warning from initialize

2017-09-27 Thread git
ctubbsii commented on a change in pull request #301: ACCUMULO-2907 Invalidate 
"this may not be applicable for your security setup" warning from initialize
URL: https://github.com/apache/accumulo/pull/301#discussion_r141460788
 
 

 ##
 File path: 
server/base/src/main/java/org/apache/accumulo/server/init/Initialize.java
 ##
 @@ -644,6 +644,25 @@ private String getRootUserName(Opts opts) throws 
IOException {
 return rootpass.getBytes(UTF_8);
   }
 
+  /**
+   * Create warning message related to initial password, if appropriate.
+   *
+   * ACCUMULO-2907 Remove unnecessary security warning from console message 
unless its actually appropriate.
+   * The warning message should only be displayed when the value of 
instance.security.authenticator
+   * differs between the SiteConfiguration and the DefaultConfiguration values.
+   *
+   * @return String containing warning portion of console message.
+   */
+  private String getInitialPasswordWarning() {
+StringBuilder optionalWarning = new StringBuilder();
+Property authenticatorProperty = Property.INSTANCE_SECURITY_AUTHENTICATOR;
+if 
(SiteConfiguration.getInstance().get(authenticatorProperty).equals(authenticatorProperty.getDefaultValue()))
+  optionalWarning.append(": ");
+else
+  optionalWarning.append(" (this may not be applicable for your security 
setup): ");
+return optionalWarning.toString();
 
 Review comment:
   Can probably avoid use of StringBuilder here, since there is only a single 
string to return in each case, and it's not being concatenated with anything 
else until outside the method. Other than that, this change looks good to me. 
Thanks for the contribution!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 commented on issue #301: ACCUMULO-2907 Invalidate "this may not be applicable for your security setup" warning from initialize

2017-09-27 Thread git
jmark99 commented on issue #301: ACCUMULO-2907 Invalidate "this may not be 
applicable for your security setup" warning from initialize
URL: https://github.com/apache/accumulo/pull/301#issuecomment-332635247
 
 
   ACCUMULO-2907: Updated pull request to display security warning message only 
when applicable.  Compare the value of instance.security.authenticator in the  
Site configuration to the default configuration. If the values differ then the 
warning message is displayed.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] mikewalch closed pull request #26: Minor updates to 2.0 docs

2017-09-26 Thread git
mikewalch closed pull request #26: Minor updates to 2.0 docs
URL: https://github.com/apache/accumulo-website/pull/26
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] mikewalch opened a new pull request #26: Minor updates to 2.0 docs

2017-09-25 Thread git
mikewalch opened a new pull request #26: Minor updates to 2.0 docs
URL: https://github.com/apache/accumulo-website/pull/26
 
 
   * /docs/2.0/ redirects to getting-started/design
   * Updated layout
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime closed pull request #299: ACCUMULO-4702 Removed Beta Guava dependencies

2017-09-25 Thread git
milleruntime closed pull request #299: ACCUMULO-4702 Removed Beta Guava 
dependencies
URL: https://github.com/apache/accumulo/pull/299
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] milleruntime commented on issue #299: ACCUMULO-4702 Removed Beta Guava dependencies

2017-09-25 Thread git
milleruntime commented on issue #299: ACCUMULO-4702 Removed Beta Guava 
dependencies
URL: https://github.com/apache/accumulo/pull/299#issuecomment-331901805
 
 
   If there are no objections for this solution to 1.7 I will merge this today. 
 There will need to be follow on work for 1.8 and master.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-09-22 Thread git
ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile 
block size to 2GB
URL: https://github.com/apache/accumulo/pull/300#discussion_r140602218
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/file/rfile/RFileTest.java
 ##
 @@ -2271,4 +2273,40 @@ public void testRootTabletEncryption() throws Exception 
{
 
 conf = null;
   }
+
+  @Rule
+  public ExpectedException exception = ExpectedException.none();
+
+  @Test
+  public void testKeyTooLarge() throws Exception {
+// If we don't have enough memory (6G) skip these very memory-intensive 
unit tests
+Assume.assumeTrue(Runtime.getRuntime().maxMemory() >= 5726797824L);
 
 Review comment:
   It's not clear where these numeric literals come from. Perhaps it can be put 
in a variable, and their computation shown, so it's clear where they come from? 
If they are arbitrary, then a comment explaining that (eg. "arbitrary size in 
the range of ... ") would be helpful.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-09-22 Thread git
ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile 
block size to 2GB
URL: https://github.com/apache/accumulo/pull/300#discussion_r140601051
 
 

 ##
 File path: pom.xml
 ##
 @@ -1548,6 +1548,7 @@
 true
 true
 true
+-Xmx1G
 
 Review comment:
   I think this was intended to be placed in the mail properties section, not 
the one inside this profile. A simple mistake. :)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-09-22 Thread git
ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile 
block size to 2GB
URL: https://github.com/apache/accumulo/pull/300#discussion_r140601051
 
 

 ##
 File path: pom.xml
 ##
 @@ -1548,6 +1548,7 @@
 true
 true
 true
+-Xmx1G
 
 Review comment:
   I think this was intended to be placed in the main properties section, not 
the one inside this profile. A simple mistake. :)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] mikewalch closed pull request #25: Simplified documentation

2017-09-22 Thread git
mikewalch closed pull request #25: Simplified documentation
URL: https://github.com/apache/accumulo-website/pull/25
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] jmark99 opened a new pull request #301: Accumulo 2907 Invalidate "this may not be applicable for your security setup" warning from initialize

2017-09-21 Thread git
jmark99 opened a new pull request #301: Accumulo 2907 Invalidate "this may not 
be applicable for your security setup" warning from initialize
URL: https://github.com/apache/accumulo/pull/301
 
 
   ACCUMULO-2907 Invalidate "this may not be applicable for your security 
setup" warning from initialize
   
   Updated console message in the getRootPassword method to remove unnecessary 
warning about the users security setup.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-09-21 Thread git
ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile 
block size to 2GB
URL: https://github.com/apache/accumulo/pull/300#discussion_r140381442
 
 

 ##
 File path: 
core/src/test/java/org/apache/accumulo/core/conf/DefaultConfigurationTest.java
 ##
 @@ -51,4 +51,11 @@ public void testGetProperties() {
   public void testSanityCheck() {
 ConfigSanityCheck.validate(c);
   }
+
+  @Test
+  public void testDefaultBlockSizeIsAllowed() {
+long bsi = c.getAsBytes(Property.TABLE_FILE_COMPRESSED_BLOCK_SIZE_INDEX);
+assert (bsi > 0);
+assert (bsi < Integer.MAX_VALUE);
 
 Review comment:
   Probably best to avoid assert keyword statements, because assertions can be 
disabled in the JVM. Best to use JUnit provided APIs for assertions: 
`Assert.assertTrue( expr )`
   
   Also, might be good to add a small one-line comment explaining what we're 
checking for. As I understand it, this is mostly just a sanity check on our 
default.
   
   We can check more generally that any value in the config is suitable by 
putting this check in:
   
   
https://github.com/apache/accumulo/blob/master/core/src/main/java/org/apache/accumulo/core/conf/ConfigSanityCheck.java#L46
 in the `validate()` method. This gets executed whenever we read various 
configurations. We could add a check to ensure that whatever the user has 
configured, whichever configuration they've configured it in, that it has a 
reasonable value. This is very similar to how we're already checking that the 
INSTANCE_ZK_TIMEOUT value is within an allowable range.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile block size to 2GB

2017-09-21 Thread git
ctubbsii commented on a change in pull request #300: ACCUMULO-4708 Limit RFile 
block size to 2GB
URL: https://github.com/apache/accumulo/pull/300#discussion_r140379315
 
 

 ##
 File path: 
core/src/main/java/org/apache/accumulo/core/file/rfile/bcfile/BCFile.java
 ##
 @@ -287,8 +287,13 @@ public void finish() throws IOException {
*/
   public long getRawSize() throws IOException {
 /**
- * Expecting the size() of a block not exceeding 4GB. Assuming the 
size() will wrap to negative integer if it exceeds 2GB.
+ * size() comes from DataOutputStream, which returns Integer.MAX_VALUE 
on an overflow, which means we do not know if 2GB or more data has been written.
+ * Because the data is of an unknown length, we cannot know the block 
size. To avoid corrupt RFiles, we throw an exception. This should be addressed 
by
+ * whatever object is putting data into the stream to ensure this 
condition is never reached.
  */
+if (size() == Integer.MAX_VALUE) {
 
 Review comment:
   Just to be safe, could also do `>=` in case implementation changes. We know 
it should never be larger than max int, but if implementation starts returning 
some larger value (for whatever reason), we don't want to let that through 
either.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


  1   2   3   4   5   6   7   8   9   10   >