[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155703687 **[Test build #45619 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45619/consoleFull)** for PR 9565 at commit [`c910e6e`](https://github.com/apache/spark/commit/c910e6edf9e0d9b7307c981413602706be2f14de). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11500][SQL] Not deterministic order of ...
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/9517 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155716817 **[Test build #45624 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45624/consoleFull)** for PR 9567 at commit [`9fc5456`](https://github.com/apache/spark/commit/9fc5456812482f52b02a2d11b8b14c5bc89534b5). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155718044 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155717828 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155726984 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155727018 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155730382 **[Test build #45625 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45625/consoleFull)** for PR 9626 at commit [`90d2d7a`](https://github.com/apache/spark/commit/90d2d7aaea54ecd6c5eee0f8df125ba976682964). * This patch passes all tests. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_:\n * `sealed abstract class State[S] `\n * `sealed abstract class StateSpec[KeyType, ValueType, StateType, EmittedType] extends Serializable `\n * `case class StateSpecImpl[K, V, S, T](`\n * `sealed abstract class TrackStateDStream[KeyType, ValueType, StateType, EmittedType: ClassTag](`\n * `class InternalTrackStateDStream[K: ClassTag, V: ClassTag, S: ClassTag, E: ClassTag](`\n * ` case class StateInfo[S](`\n * ` class LimitMarker(val num: Int) extends Serializable`\n --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155738406 **[Test build #45627 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45627/consoleFull)** for PR 9347 at commit [`1636f72`](https://github.com/apache/spark/commit/1636f72d3630697e0f6e0f93275e809673fa8962). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_:\n * `case class ToUnixTimestamp(timeExp: Expression, format: Expression)`\n * `abstract class UnixTime(timeExp: Expression, format: Expression)`\n --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155738456 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45627/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155740707 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45624/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user davies commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155703086 Please fix the bug for SparkR ``` java.io.InvalidClassException: org.apache.spark.sql.catalyst.expressions.UnixTimestamp; no valid constructor at java.io.ObjectStreamClass$ExceptionInfo.newInvalidClassException(ObjectStreamClass.java:150) at java.io.ObjectStreamClass.checkDeserialize(ObjectStreamClass.java:768) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1772) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370) at scala.collection.immutable.$colon$colon.readObject(List.scala:362) at sun.reflect.GeneratedMethodAccessor5.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350) at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370) at org.apache.spark.serializer.JavaDeserializationStream.readObject(JavaSerializer.scala:76) at org.apache.spark.serializer.JavaSerializerInstance.deserialize(JavaSerializer.scala:115) at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) at org.apache.spark.scheduler.Task.run(Task.scala:88) at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214) at
[GitHub] spark pull request: [HOTFIX][SPARK-10192] Fix NPE in test that was...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9620#issuecomment-155703205 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [HOTFIX][SPARK-10192] Fix NPE in test that was...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9620#issuecomment-155703209 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45605/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
GitHub user JoshRosen opened a pull request: https://github.com/apache/spark/pull/9624 [SPARK-9866][SQL] Speed up VersionsSuite by using standard Ivy cache This patch attempts to speed up VersionsSuite by storing fetched Hive JARs in the standard Ivy cache instead of copying them to a temporary directory. The only concern here is stability; in #7026, @vanzin mentioned that Ivy could become confused by existing caches. I'm curious to know whether this is still a problem; if not, this might be a cheap way to save a few minutes of build time. You can merge this pull request into a Git repository by running: $ git pull https://github.com/JoshRosen/spark SPARK-9866 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/9624.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #9624 commit 6172d7abc39da5cf9db2ebcc19cd219d776acf5a Author: Josh RosenDate: 2015-11-11T08:34:24Z [SPARK-9866][SQL] Speed up VersionsSuite by using standard Ivy cache. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11646] WholeTextFileRDD should return T...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9622#issuecomment-155704722 **[Test build #45617 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45617/consoleFull)** for PR 9622 at commit [`4d412dd`](https://github.com/apache/spark/commit/4d412dde4289c6fe55233c79f38d4c3ad5ee6f21). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6152] Use shaded ASM5 to support closur...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9512#issuecomment-155704822 **[Test build #45598 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45598/consoleFull)** for PR 9512 at commit [`9833667`](https://github.com/apache/spark/commit/9833667c88000d895edbc90439cd2729325d6e4a). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6328] [Python] Python API for Streaming...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9186#issuecomment-155710041 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6152] Use shaded ASM5 to support closur...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9512#issuecomment-155712847 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45609/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6152] Use shaded ASM5 to support closur...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9512#issuecomment-155712845 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user cloud-fan commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155715175 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155716389 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155718026 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155718158 @HyukjinKwon Oh yeah, sorry. Finally got sometime to clean my review queue :) I wonder is there an easy way to add a test case for this? At first I thought `WriterVersion` corresponds to the the `version` field of the Thrift struct `FileMetaData` described in [parquet-format] [1], but it's not. I only found that when `WriterVersion` is set to v2, the Thrift field `PageHeader.type` is set to `DATA_PAGE_V2`. [1]: https://github.com/apache/parquet-format#metadata --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155718167 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155718924 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155718954 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/9490#discussion_r44516927 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala --- @@ -604,10 +609,33 @@ abstract class HadoopFsRelation private[sql](maybePartitionSpec: Option[Partitio } } -buildInternalScan(requiredColumns, filters, inputStatuses, broadcastedConf) +if (!inputExists) { + throw new IOException("Input paths do not exist, input paths=" ++ inputPaths.mkString("[", ",", "]")) +} else { + if (inputStatuses.isEmpty && readFromHDFS) { +logWarning("Input paths are empty, input paths=" + inputPaths.mkString("[", ",", "]")) +sqlContext.sparkContext.emptyRDD[InternalRow] + } else { +buildInternalScan(requiredColumns, filters, inputStatuses, broadcastedConf) + } +} } /** + * Most of time, HadoopFsRelation should check the inputPaths, but for some cases it is not, + * e.g. JsonRelation may read from RDD[String] + */ + def inputExists: Boolean = fileStatusCache.inputExists + + /** + * Most of time, HadoopFsRelation should read from hdfs, but some cases it is not, + * e.g. JsonRelation may read from RDD[String] + * @return + */ + def readFromHDFS: Boolean = true --- End diff -- Is there any way to fix this issue without adding any public interface methods? Especially, it's a little bit weird that a `HadoopFsRelation` doesn't `readFromHDFS`. Can we special case `JSONRelation` without affecting existing public APIs? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155737574 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45620/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155737570 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155739429 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11645][SQL] Remove OpenHashSet for the ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9621#issuecomment-155710441 **[Test build #2038 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2038/consoleFull)** for PR 9621 at commit [`bfdc937`](https://github.com/apache/spark/commit/bfdc9375d62c2d05d0d360afd643730a54b97d8f). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11594][SQL][REPL] Cannot create UDAF in...
Github user hvanhovell closed the pull request at: https://github.com/apache/spark/pull/9568 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Build][Minor] Remove non-exist yarnStable mod...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9625#issuecomment-155712178 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9552] Add force control for killExecuto...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7888#issuecomment-155713875 **[Test build #45613 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45613/consoleFull)** for PR 7888 at commit [`01c236a`](https://github.com/apache/spark/commit/01c236ad3cb435c8b63f8be59c3f5d099b797cf3). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9552] Add force control for killExecuto...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7888#issuecomment-155713956 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9552] Add force control for killExecuto...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7888#issuecomment-155713960 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45613/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11625][SQL] add java test for typed agg...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9591#issuecomment-155716403 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155716404 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11625][SQL] add java test for typed agg...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9591#issuecomment-155716387 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
GitHub user yanboliang opened a pull request: https://github.com/apache/spark/pull/9626 [SPARK-11651] [ML] LinearRegressionSummary should support get residuals by type LinearRegressionSummary should support get residuals by type. You can merge this pull request into a Git repository by running: $ git pull https://github.com/yanboliang/spark spark-11651 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/9626.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #9626 commit 90d2d7aaea54ecd6c5eee0f8df125ba976682964 Author: Yanbo LiangDate: 2015-11-11T09:39:13Z LinearRegressionSummary should support get residuals by type --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155718700 **[Test build #45625 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45625/consoleFull)** for PR 9626 at commit [`90d2d7a`](https://github.com/apache/spark/commit/90d2d7aaea54ecd6c5eee0f8df125ba976682964). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user HyukjinKwon commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155720490 I will try to find and test them first tommorow before adding a commit! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11259] [ML] Params.validateParams() sho...
Github user yanboliang commented on the pull request: https://github.com/apache/spark/pull/9224#issuecomment-155721642 @jkbradley --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-8029] Robust shuffle writer
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9610#issuecomment-155706511 **[Test build #45618 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45618/consoleFull)** for PR 9610 at commit [`6deccff`](https://github.com/apache/spark/commit/6deccff9d322b92538a470329581c8abeb8f7e6a). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11500][SQL] Not deterministic order of ...
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/9517#issuecomment-155707374 Also backported to branch-1.6. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155730500 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11651] [ML] LinearRegressionSummary sho...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9626#issuecomment-155730501 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45625/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11646] WholeTextFileRDD should return T...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9622#issuecomment-155730521 **[Test build #45617 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45617/consoleFull)** for PR 9622 at commit [`4d412dd`](https://github.com/apache/spark/commit/4d412dde4289c6fe55233c79f38d4c3ad5ee6f21). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11646] WholeTextFileRDD should return T...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9622#issuecomment-155730626 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155737425 **[Test build #45620 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45620/consoleFull)** for PR 9624 at commit [`6172d7a`](https://github.com/apache/spark/commit/6172d7abc39da5cf9db2ebcc19cd219d776acf5a). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155740990 **[Test build #45628 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45628/consoleFull)** for PR 9565 at commit [`1234515`](https://github.com/apache/spark/commit/12345150cff5c02780e0055600b584a0aeaaf441). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11448][SQL] Skip caching part-files in ...
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/9405#discussion_r44523864 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRelation.scala --- @@ -208,7 +208,15 @@ private[sql] class ParquetRelation( // Parquet data source always uses Catalyst internal representations. override val needConversion: Boolean = false - override def sizeInBytes: Long = metadataCache.dataStatuses.map(_.getLen).sum + override def sizeInBytes: Long = +if (shouldMergeSchemas && mergeRespectSummaries) { + // If we are going to merge schema and this relation is configured to + // respect summaries (i.e., skip part-files), we will assume that the size of + // this relation is large and can't be broadcasted. --- End diff -- Right. But compared to set a smaller `sizeInBytes`, a larger one seems better as wrongly broadcasting large relation should be worse? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11617] [network] Fix leak in TransportF...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9619#issuecomment-155703979 **[Test build #45608 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45608/consoleFull)** for PR 9619 at commit [`ed0c1d7`](https://github.com/apache/spark/commit/ed0c1d7e6df3357344574b3ef2dccb526ed3f9b7). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11595] [SQL] [BRANCH-1.5] Fixes ADD JAR...
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/9570#issuecomment-155703857 The failure is because of another bug that has been fixed in master by PR #9277. Will update the test case here and see whether we should backport #9277 to branch-1.5. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155706724 **[Test build #45620 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45620/consoleFull)** for PR 9624 at commit [`6172d7a`](https://github.com/apache/spark/commit/6172d7abc39da5cf9db2ebcc19cd219d776acf5a). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11500][SQL] Not deterministic order of ...
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/9517#issuecomment-155706699 LGTM, merged to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6328] [Python] Python API for Streaming...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9186#issuecomment-155710043 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45611/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11594][SQL][REPL] Cannot create UDAF in...
Github user hvanhovell commented on the pull request: https://github.com/apache/spark/pull/9568#issuecomment-155710292 Move to scala 2.10.5 fixed this. Closing PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Build][Minor] Remove non-exist yarnStable mod...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9625#issuecomment-155712196 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Build][Minor] Remove non-exist yarnStable mod...
GitHub user jerryshao opened a pull request: https://github.com/apache/spark/pull/9625 [Build][Minor] Remove non-exist yarnStable module related module in Sbt project You can merge this pull request into a Git repository by running: $ git pull https://github.com/jerryshao/apache-spark remove-old-module Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/9625.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #9625 commit 0d1cc637e0957b7fa4dd7328f08f5150f80a6192 Author: jerryshaoDate: 2015-11-11T09:17:48Z Remove old unexisted module --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Build][Minor] Remove non-exist yarnStable mod...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9625#issuecomment-155714236 **[Test build #45622 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45622/consoleFull)** for PR 9625 at commit [`0d1cc63`](https://github.com/apache/spark/commit/0d1cc637e0957b7fa4dd7328f08f5150f80a6192). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11647][WIP] Attempt to reduce time/flak...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9623#issuecomment-155729734 **[Test build #45615 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45615/consoleFull)** for PR 9623 at commit [`801afe7`](https://github.com/apache/spark/commit/801afe7d1b0666c3f84c1e030ede72d0dc35300c). * This patch passes all tests. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_:\n * `class CliSuite extends SparkFunSuite with BeforeAndAfterAll with Logging `\n --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155729648 **[Test build #45627 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45627/consoleFull)** for PR 9347 at commit [`1636f72`](https://github.com/apache/spark/commit/1636f72d3630697e0f6e0f93275e809673fa8962). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Build][Minor] Remove non-exist yarnStable mod...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9625#issuecomment-155744537 **[Test build #45622 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45622/consoleFull)** for PR 9625 at commit [`0d1cc63`](https://github.com/apache/spark/commit/0d1cc637e0957b7fa4dd7328f08f5150f80a6192). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155744119 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155744229 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9490#issuecomment-155702715 **[Test build #45606 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45606/consoleFull)** for PR 9490 at commit [`219db87`](https://github.com/apache/spark/commit/219db877bc4a599e76ae1a0541db94b387540a05). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11646] WholeTextFileRDD should return T...
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/9622#issuecomment-155702728 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9490#issuecomment-155702826 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45606/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155705102 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6152] Use shaded ASM5 to support closur...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9512#issuecomment-155704997 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-9866][SQL] Speed up VersionsSuite by us...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9624#issuecomment-155705042 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-6152] Use shaded ASM5 to support closur...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9512#issuecomment-155705002 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45598/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155709206 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11644][SQL] Remove the option to turn o...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9618#issuecomment-155709277 **[Test build #2039 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2039/consoleFull)** for PR 9618 at commit [`dd1fe92`](https://github.com/apache/spark/commit/dd1fe927fe50f397bd6bf90e6015e8e57f06b26a). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155709195 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11647][WIP] Attempt to reduce time/flak...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9623#issuecomment-155729834 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11646] WholeTextFileRDD should return T...
Github user srowen commented on the pull request: https://github.com/apache/spark/pull/9622#issuecomment-155729923 LGTM since it's internal --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11647][WIP] Attempt to reduce time/flak...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9623#issuecomment-155729835 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45615/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155742279 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11617] [network] Fix leak in TransportF...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9619#issuecomment-155704108 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45608/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155704019 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45619/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155704013 **[Test build #45619 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45619/consoleFull)** for PR 9565 at commit [`c910e6e`](https://github.com/apache/spark/commit/c910e6edf9e0d9b7307c981413602706be2f14de). * This patch **fails Scala style tests**. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_:\n * `sealed abstract class State[S] `\n * `sealed abstract class StateSpec[KeyType, ValueType, StateType, EmittedType] extends Serializable `\n * `case class StateSpecImpl[K, V, S, T](`\n * `sealed abstract class TrackStateDStream[KeyType, ValueType, StateType, EmittedType: ClassTag](`\n * `class InternalTrackStateDStream[K: ClassTag, V: ClassTag, S: ClassTag, E: ClassTag](`\n * ` case class StateInfo[S](`\n * ` class LimitMarker(val num: Int) extends Serializable`\n --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155704015 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11617] [network] Fix leak in TransportF...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9619#issuecomment-155704106 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user zjffdu commented on the pull request: https://github.com/apache/spark/pull/9490#issuecomment-155713852 Comments for the change: The case of empty or non-exist inputs is a little tricky. Here's the several cases I summarize * Only parse the inputs at execution stage. e.g. TextRelation * Need parse the inputs at analysis stage. e.g. JsonRelation, ParquetRelation & OrcRelation * Don't need to parse the inputs if the schema is provided. (when creating table) e.g. ParquetRelation & OrcRelation * Empty is also valid. e.g. JsonRelation can accept RDD[String] rather than from hdfs * Empty inputs is valid for creating table. So for these cases, I do the following changes * Add 2 api in HadoopFsRelation. sub classes can override it. Now only JsonRelation will override it. ** def inputExists: Boolean = fileStatusCache.inputExists ** def readFromHDFS: Boolean = true * If the inputs are only empty directories, it should be valid, just return EmptyRDD * If the inputs are not-existed directories/files, it is invalid, just throw exception. * If it needs to parse data in the analysis stage, it is sub classes' responsibility to check whether inputs is empty or not. Parent class (HadoopFsRelation) only check the inputs at execution stage. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11625][SQL] add java test for typed agg...
Github user cloud-fan commented on the pull request: https://github.com/apache/spark/pull/9591#issuecomment-15571 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11625][SQL] add java test for typed agg...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9591#issuecomment-155717018 **[Test build #45623 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45623/consoleFull)** for PR 9591 at commit [`ae55976`](https://github.com/apache/spark/commit/ae55976c11d7084606e39f49f0753b701f18f8bb). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155717783 **[Test build #45621 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45621/consoleFull)** for PR 9565 at commit [`1234515`](https://github.com/apache/spark/commit/12345150cff5c02780e0055600b584a0aeaaf441). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155717830 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/45621/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/9490#discussion_r44516708 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala --- @@ -431,7 +436,7 @@ abstract class HadoopFsRelation private[sql](maybePartitionSpec: Option[Partitio val hdfsPath = new Path(path) val fs = hdfsPath.getFileSystem(hadoopConf) val qualified = hdfsPath.makeQualified(fs.getUri, fs.getWorkingDirectory) - + inputExists = inputExists && fs.exists(qualified) --- End diff -- This can be quite expensive since each `fs.exists(qualified)` call invokes `FileSystem.getFileStatus()`, which is an RPC call. On the other hand, we've already called `fs.listStatus(qualified)` below. Would be better to merge these two. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11448][SQL] Skip caching part-files in ...
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/9405#discussion_r44518421 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRelation.scala --- @@ -208,7 +208,15 @@ private[sql] class ParquetRelation( // Parquet data source always uses Catalyst internal representations. override val needConversion: Boolean = false - override def sizeInBytes: Long = metadataCache.dataStatuses.map(_.getLen).sum + override def sizeInBytes: Long = +if (shouldMergeSchemas && mergeRespectSummaries) { + // If we are going to merge schema and this relation is configured to + // respect summaries (i.e., skip part-files), we will assume that the size of + // this relation is large and can't be broadcasted. --- End diff -- This assumption doesn't seem right to me... It's perfectly OK for a small Parquet dataset to require schema merging and to be configured to respect summaries. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155739762 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11593][SQL] Replace catalyst converter ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9565#issuecomment-155739783 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11044][SQL] Parquet writer version fixe...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9060#issuecomment-155719264 **[Test build #45626 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45626/consoleFull)** for PR 9060 at commit [`2eee7e3`](https://github.com/apache/spark/commit/2eee7e37b6f366336cbe19bd9545f07abb13f7db). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11102] [SQL] Uninformative exception wh...
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/9490#discussion_r44517000 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala --- @@ -431,7 +436,7 @@ abstract class HadoopFsRelation private[sql](maybePartitionSpec: Option[Partitio val hdfsPath = new Path(path) val fs = hdfsPath.getFileSystem(hadoopConf) val qualified = hdfsPath.makeQualified(fs.getUri, fs.getWorkingDirectory) - + inputExists = inputExists && fs.exists(qualified) --- End diff -- Another issue is that this block doesn't handle the case of parallel file listing (the other `if` block above). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155740586 **[Test build #45624 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45624/consoleFull)** for PR 9567 at commit [`9fc5456`](https://github.com/apache/spark/commit/9fc5456812482f52b02a2d11b8b14c5bc89534b5). * This patch **fails PySpark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11564][SQL][follow-up] clean up java tu...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9567#issuecomment-155740703 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-11396] [SQL] add native implementation ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9347#issuecomment-155747220 **[Test build #45629 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45629/consoleFull)** for PR 9347 at commit [`1636f72`](https://github.com/apache/spark/commit/1636f72d3630697e0f6e0f93275e809673fa8962). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org