[jira] [Commented] (SPARK-18021) Refactor file name specification for data sources

2016-10-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590874#comment-15590874 ] Reynold Xin commented on SPARK-18021: - cc [~tejas.patil] fyi > Refactor file name specification for

[jira] [Created] (SPARK-18021) Refactor file name specification for data sources

2016-10-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18021: --- Summary: Refactor file name specification for data sources Key: SPARK-18021 URL: https://issues.apache.org/jira/browse/SPARK-18021 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590862#comment-15590862 ] Apache Spark commented on SPARK-18012: -- User 'rxin' has created a pull request for this issue:

[jira] [Resolved] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-18012. Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15551

[jira] [Commented] (SPARK-18013) R cross join API similar to python and Scala

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590802#comment-15590802 ] Apache Spark commented on SPARK-18013: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-18013) R cross join API similar to python and Scala

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18013: Assignee: Apache Spark > R cross join API similar to python and Scala >

[jira] [Assigned] (SPARK-18013) R cross join API similar to python and Scala

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18013: Assignee: (was: Apache Spark) > R cross join API similar to python and Scala >

[jira] [Commented] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2016-10-19 Thread Shea Parkes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590680#comment-15590680 ] Shea Parkes commented on SPARK-17998: - That definitely answers it. I would say the default of 128MB

[jira] [Closed] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2016-10-19 Thread Shea Parkes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shea Parkes closed SPARK-17998. --- Resolution: Information Provided > Reading Parquet files coalesces parts into too few in-memory

[jira] [Commented] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2016-10-19 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590675#comment-15590675 ] Liwei Lin commented on SPARK-17998: --- Hi [~shea.parkes], for your case, the number is determined at

[jira] [Commented] (SPARK-17357) Simplified predicates can't be pushed down through operators because of the rule order in Optimizer

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590649#comment-15590649 ] Apache Spark commented on SPARK-17357: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-6624) Convert filters into CNF for data sources

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590650#comment-15590650 ] Apache Spark commented on SPARK-6624: - User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-11301) filter on partitioned column is case sensitive even the context is case insensitive

2016-10-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590647#comment-15590647 ] Dongjoon Hyun commented on SPARK-11301: --- Thank you! > filter on partitioned column is case

[jira] [Updated] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-10-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17755: - Summary: Master may ask a worker to launch an executor before the worker actually got the response of

[jira] [Commented] (SPARK-11301) filter on partitioned column is case sensitive even the context is case insensitive

2016-10-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590580#comment-15590580 ] Wenchen Fan commented on SPARK-11301: - thanks, I changed fixed version to 1.6.3 > filter on

[jira] [Updated] (SPARK-11301) filter on partitioned column is case sensitive even the context is case insensitive

2016-10-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-11301: Fix Version/s: (was: 1.6.2) 1.6.3 > filter on partitioned column is case

[jira] [Resolved] (SPARK-17989) Check ascendingOrder type in sort_array function ahead

2016-10-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17989. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0

[jira] [Updated] (SPARK-18013) R cross join API similar to python and Scala

2016-10-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18013: Target Version/s: 2.1.0 > R cross join API similar to python and Scala >

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-19 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590521#comment-15590521 ] Liwei Lin commented on SPARK-16845: --- Hi [~dondrake], the latest commit

[jira] [Assigned] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-10-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-14393: - Assignee: Xiangrui Meng > monotonicallyIncreasingId not monotonically increasing with

[jira] [Updated] (SPARK-18020) Kinesis receiver does not snapshot when shard completes

2016-10-19 Thread Yonathan Randolph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonathan Randolph updated SPARK-18020: -- Description: When a kinesis shard is split or combined and the old shard ends, the

[jira] [Updated] (SPARK-18020) Kinesis receiver does not snapshot when shard completes

2016-10-19 Thread Yonathan Randolph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonathan Randolph updated SPARK-18020: -- Description: When a kinesis shard is split or combined and the old shard ends, the

[jira] [Created] (SPARK-18020) Kinesis receiver does not snapshot when shard completes

2016-10-19 Thread Yonathan Randolph (JIRA)
Yonathan Randolph created SPARK-18020: - Summary: Kinesis receiver does not snapshot when shard completes Key: SPARK-18020 URL: https://issues.apache.org/jira/browse/SPARK-18020 Project: Spark

[jira] [Created] (SPARK-18019) Log instrumentation in GBTs

2016-10-19 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18019: Summary: Log instrumentation in GBTs Key: SPARK-18019 URL: https://issues.apache.org/jira/browse/SPARK-18019 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-19 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590256#comment-15590256 ] Don Drake commented on SPARK-16845: --- [~lwlin] I saw your PR, but noticed it's failing some tests. Just

[jira] [Updated] (SPARK-13135) Don't print expressions recursively in generated code

2016-10-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13135: -- Fix Version/s: 2.0.0 > Don't print expressions recursively in generated code >

[jira] [Updated] (SPARK-14350) explain output should be in a single cell rather than one line per cell

2016-10-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14350: -- Fix Version/s: 2.0.0 > explain output should be in a single cell rather than one line per cell

[jira] [Commented] (SPARK-18018) Specify alternate escape character in 'LIKE' expression

2016-10-19 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590136#comment-15590136 ] Jakob Odersky commented on SPARK-18018: --- I've started a very early prototype

[jira] [Created] (SPARK-18018) Specify alternate escape character in 'LIKE' expression

2016-10-19 Thread Jakob Odersky (JIRA)
Jakob Odersky created SPARK-18018: - Summary: Specify alternate escape character in 'LIKE' expression Key: SPARK-18018 URL: https://issues.apache.org/jira/browse/SPARK-18018 Project: Spark

[jira] [Commented] (SPARK-15777) Catalog federation

2016-10-19 Thread Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590055#comment-15590055 ] Yan commented on SPARK-15777: - There is a paragraph in the design doc about the ordering of rule application

[jira] [Commented] (SPARK-17131) Code generation fails when running SQL expressions against a wide dataset (thousands of columns)

2016-10-19 Thread Aleksander Eskilson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590024#comment-15590024 ] Aleksander Eskilson commented on SPARK-17131: - [~sowen], [~melentye] I'm not so certain this

[jira] [Created] (SPARK-18017) Changing Hadoop parameter through sparkSession.sparkContext.hadoopConfiguration doesn't work

2016-10-19 Thread Yuehua Zhang (JIRA)
Yuehua Zhang created SPARK-18017: Summary: Changing Hadoop parameter through sparkSession.sparkContext.hadoopConfiguration doesn't work Key: SPARK-18017 URL: https://issues.apache.org/jira/browse/SPARK-18017

[jira] [Updated] (SPARK-18011) SparkR serialize "NA" throws exception

2016-10-19 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-18011: --- Description: For some versions of R, if Date has "NA" field, backend will throw negative

[jira] [Assigned] (SPARK-17944) sbin/start-* scripts use of `hostname -f` fail with Solaris

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17944: Assignee: Apache Spark > sbin/start-* scripts use of `hostname -f` fail with Solaris >

[jira] [Assigned] (SPARK-17944) sbin/start-* scripts use of `hostname -f` fail with Solaris

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17944: Assignee: (was: Apache Spark) > sbin/start-* scripts use of `hostname -f` fail with

[jira] [Commented] (SPARK-17944) sbin/start-* scripts use of `hostname -f` fail with Solaris

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590009#comment-15590009 ] Apache Spark commented on SPARK-17944: -- User 'JnyJny' has created a pull request for this issue:

[jira] [Created] (SPARK-18016) Code Generation Fails When Encoding Large Object to Wide Dataset

2016-10-19 Thread Aleksander Eskilson (JIRA)
Aleksander Eskilson created SPARK-18016: --- Summary: Code Generation Fails When Encoding Large Object to Wide Dataset Key: SPARK-18016 URL: https://issues.apache.org/jira/browse/SPARK-18016

[jira] [Updated] (SPARK-17949) Introduce a JVM object based aggregate operator

2016-10-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-17949: --- Attachment: [Design Doc] Support for Arbitrary Aggregation States.pdf > Introduce a JVM object based

[jira] [Updated] (SPARK-17949) Introduce a JVM object based aggregate operator

2016-10-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-17949: --- Attachment: (was: [Design Doc] Support for Arbitrary Aggregation States.pdf) > Introduce a JVM

[jira] [Updated] (SPARK-17949) Introduce a JVM object based aggregate operator

2016-10-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-17949: --- Attachment: [Design Doc] Support for Arbitrary Aggregation States.pdf > Introduce a JVM object based

[jira] [Commented] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589920#comment-15589920 ] Nick Orka commented on SPARK-18015: --- I've just found that the exception may not reflect real error in

[jira] [Updated] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Orka updated SPARK-18015: -- Description: I've decided to clone the ticket because it had the same problem for anothe spark

[jira] [Commented] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589887#comment-15589887 ] Alex Bozarth commented on SPARK-18010: -- [~srowen] [~tgraves] This seems to "fix" the long ongoing

[jira] [Commented] (SPARK-11301) filter on partitioned column is case sensitive even the context is case insensitive

2016-10-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589866#comment-15589866 ] Dongjoon Hyun commented on SPARK-11301: --- Hi, [~cloud_fan]. The last commit is included 1.6.3. I'm

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2016-10-19 Thread nirav patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589825#comment-15589825 ] nirav patel commented on SPARK-4105: I hit this error as well but I also noticed lot of

[jira] [Updated] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Orka updated SPARK-18015: -- Description: I've decided to clone the ticket because it had the same problem for anothe spark

[jira] [Updated] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Orka updated SPARK-18015: -- Description: I've decided to clone the ticket because it had the same problem for anothe spark

[jira] [Updated] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Orka updated SPARK-18015: -- Affects Version/s: (was: 1.4.1) > CLONE - ClassCastException in instance of >

[jira] [Updated] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Orka updated SPARK-18015: -- Affects Version/s: 2.0.0 > CLONE - ClassCastException in instance of >

[jira] [Created] (SPARK-18015) CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
Nick Orka created SPARK-18015: - Summary: CLONE - ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD Key: SPARK-18015 URL: https://issues.apache.org/jira/browse/SPARK-18015 Project:

[jira] [Commented] (SPARK-9219) ClassCastException in instance of org.apache.spark.rdd.MapPartitionsRDD

2016-10-19 Thread Nick Orka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589732#comment-15589732 ] Nick Orka commented on SPARK-9219: -- I have the same issue with Spark 2.0.0

[jira] [Commented] (SPARK-17463) Serialization of accumulators in heartbeats is not thread-safe

2016-10-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589729#comment-15589729 ] Shixiong Zhu commented on SPARK-17463: -- I could not figure out the cause. Is it easy to reproduce in

[jira] [Resolved] (SPARK-10541) Allow ApplicationHistoryProviders to provide their own text when there aren't any complete apps

2016-10-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10541. Resolution: Fixed Assignee: Alex Bozarth Fix Version/s: 2.1.0 > Allow

[jira] [Commented] (SPARK-15777) Catalog federation

2016-10-19 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589723#comment-15589723 ] Nattavut Sutyanyong commented on SPARK-15777: - It is not clear to me how we will apply

[jira] [Commented] (SPARK-17995) Use new attributes for columns from outer joins

2016-10-19 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589692#comment-15589692 ] Ryan Blue commented on SPARK-17995: --- I'm not sure how that would work. Here's an example. Say I have

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Created] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
Michael Patterson created SPARK-18014: - Summary: Filters are incorrectly being grouped together when there is processing in between Key: SPARK-18014 URL: https://issues.apache.org/jira/browse/SPARK-18014

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between

2016-10-19 Thread Michael Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Patterson updated SPARK-18014: -- Description: I created a dataframe that needed to filter the data on columnA, create a

[jira] [Commented] (SPARK-17630) jvm-exit-on-fatal-error handler for spark.rpc.netty like there is available for akka

2016-10-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589463#comment-15589463 ] Shixiong Zhu commented on SPARK-17630: -- Just set up SparkUncaughtExceptionHandler as the default

[jira] [Created] (SPARK-18013) R cross join API similar to python and Scala

2016-10-19 Thread Srinath (JIRA)
Srinath created SPARK-18013: --- Summary: R cross join API similar to python and Scala Key: SPARK-18013 URL: https://issues.apache.org/jira/browse/SPARK-18013 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18012: Assignee: Reynold Xin (was: Apache Spark) > Simplify WriterContainer code >

[jira] [Commented] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589397#comment-15589397 ] Apache Spark commented on SPARK-18012: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18012: Assignee: Apache Spark (was: Reynold Xin) > Simplify WriterContainer code >

[jira] [Created] (SPARK-18012) Simplify WriterContainer code

2016-10-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18012: --- Summary: Simplify WriterContainer code Key: SPARK-18012 URL: https://issues.apache.org/jira/browse/SPARK-18012 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-18011) SparkR serialize "NA" throws exception

2016-10-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589331#comment-15589331 ] Miao Wang commented on SPARK-18011: --- We have detailed discussions on PR

[jira] [Created] (SPARK-18011) SparkR serialize "NA" throws exception

2016-10-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-18011: - Summary: SparkR serialize "NA" throws exception Key: SPARK-18011 URL: https://issues.apache.org/jira/browse/SPARK-18011 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-17219) QuantileDiscretizer should handle NaN values gracefully

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17219: Assignee: Apache Spark (was: Vincent) > QuantileDiscretizer should handle NaN values

[jira] [Assigned] (SPARK-17219) QuantileDiscretizer should handle NaN values gracefully

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17219: Assignee: Vincent (was: Apache Spark) > QuantileDiscretizer should handle NaN values

[jira] [Updated] (SPARK-18003) RDD zipWithIndex generate wrong result when one partition contains more than 2147483647 records.

2016-10-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-18003: --- Labels: correctness (was: ) > RDD zipWithIndex generate wrong result when one partition contains

[jira] [Commented] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Vinayak Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588976#comment-15588976 ] Vinayak Joshi commented on SPARK-18010: --- pinging [~ajbozarth] > Remove unneeded heavy work

[jira] [Commented] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Vinayak Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588973#comment-15588973 ] Vinayak Joshi commented on SPARK-18010: --- Created pull request:

[jira] [Assigned] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18010: Assignee: Apache Spark > Remove unneeded heavy work performed by FsHistoryProvider for

[jira] [Commented] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588960#comment-15588960 ] Apache Spark commented on SPARK-18010: -- User 'vijoshi' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18010: Assignee: (was: Apache Spark) > Remove unneeded heavy work performed by

[jira] [Created] (SPARK-18010) Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page

2016-10-19 Thread Vinayak Joshi (JIRA)
Vinayak Joshi created SPARK-18010: - Summary: Remove unneeded heavy work performed by FsHistoryProvider for building up the application listing UI page Key: SPARK-18010 URL:

[jira] [Created] (SPARK-18009) Spark 2.0.1 SQL Thrift Error

2016-10-19 Thread Jerryjung (JIRA)
Jerryjung created SPARK-18009: - Summary: Spark 2.0.1 SQL Thrift Error Key: SPARK-18009 URL: https://issues.apache.org/jira/browse/SPARK-18009 Project: Spark Issue Type: Bug Components:

[jira] [Updated] (SPARK-18008) Support skipping test compilation

2016-10-19 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan updated SPARK-18008: Description: Add support for skipping compilation of test code through

[jira] [Updated] (SPARK-18008) Support skipping test compilation

2016-10-19 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan updated SPARK-18008: Summary: Support skipping test compilation (was: Support test compilation and

[jira] [Commented] (SPARK-15687) Columnar execution engine

2016-10-19 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588641#comment-15588641 ] Kazuaki Ishizaki commented on SPARK-15687: -- [#15219|https://github.com/apache/spark/pull/15219]

[jira] [Updated] (SPARK-18005) optional binary CertificateChains (UTF8) is not a group while loading a Dataframe

2016-10-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18005: -- Target Version/s: (was: 2.0.0) > optional binary CertificateChains (UTF8) is not a group while

[jira] [Updated] (SPARK-16078) from_utc_timestamp/to_utc_timestamp may give different result in different timezone

2016-10-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16078: -- Fix Version/s: 2.0.0 > from_utc_timestamp/to_utc_timestamp may give different result in different >

[jira] [Commented] (SPARK-16078) from_utc_timestamp/to_utc_timestamp may give different result in different timezone

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588594#comment-15588594 ] Apache Spark commented on SPARK-16078: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-18008) Support test compilation and skipping javadoc generation

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588466#comment-15588466 ] Apache Spark commented on SPARK-18008: -- User 'mridulm' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18008) Support test compilation and skipping javadoc generation

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18008: Assignee: Apache Spark (was: Mridul Muralidharan) > Support test compilation and

[jira] [Assigned] (SPARK-18008) Support test compilation and skipping javadoc generation

2016-10-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18008: Assignee: Mridul Muralidharan (was: Apache Spark) > Support test compilation and

[jira] [Commented] (SPARK-18008) Support test compilation and skipping javadoc generation

2016-10-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588441#comment-15588441 ] Sean Owen commented on SPARK-18008: --- You can just pass those as flags to the build already. I use

[jira] [Created] (SPARK-18008) Support test compilation and skipping javadoc generation

2016-10-19 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-18008: --- Summary: Support test compilation and skipping javadoc generation Key: SPARK-18008 URL: https://issues.apache.org/jira/browse/SPARK-18008 Project: Spark

[jira] [Updated] (SPARK-17645) Add feature selector methods based on: False Discovery Rate (FDR) and Family Wise Error rate (FWE)

2016-10-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17645: Shepherd: Yanbo Liang Assignee: Peng Meng > Add feature selector methods based on: False

[jira] [Commented] (SPARK-18006) When union, spark SQL didn't complain about schema mismatch

2016-10-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588277#comment-15588277 ] Sean Owen commented on SPARK-18006: --- See also https://issues.apache.org/jira/browse/SPARK-9813 among

[jira] [Commented] (SPARK-18006) When union, spark SQL didn't complain about schema mismatch

2016-10-19 Thread Shawn Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588258#comment-15588258 ] Shawn Zhang commented on SPARK-18006: - I see, I will close the issue > When union, spark SQL didn't

[jira] [Closed] (SPARK-18006) When union, spark SQL didn't complain about schema mismatch

2016-10-19 Thread Shawn Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shawn Zhang closed SPARK-18006. --- Resolution: Not A Problem > When union, spark SQL didn't complain about schema mismatch >

[jira] [Commented] (SPARK-17982) Spark 2.0.0 CREATE VIEW statement fails :: java.lang.RuntimeException: Failed to analyze the canonicalized SQL. It is possible there is a bug in Spark.

2016-10-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15588225#comment-15588225 ] Dongjoon Hyun commented on SPARK-17982: --- Never mind! > Spark 2.0.0 CREATE VIEW statement fails ::

  1   2   >