[jira] [Created] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14006: Summary: Builds of 1.6 branch fail R style check Key: SPARK-14006 URL: https://issues.apache.org/jira/browse/SPARK-14006 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-13999) Run 'group by' before building cube

2016-03-19 Thread lichenglin (JIRA)
lichenglin created SPARK-13999: -- Summary: Run 'group by' before building cube Key: SPARK-13999 URL: https://issues.apache.org/jira/browse/SPARK-13999 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-13972) hive tests should fail if SQL generation failed

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13972: Assignee: Apache Spark > hive tests should fail if SQL generation failed >

[jira] [Updated] (SPARK-13942) Remove Shark-related docs and visibility for 2.x

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13942: -- Description: `Shark` was merged into `Spark SQL` since [July

[jira] [Updated] (SPARK-13691) Scala and Python generate inconsistent results

2016-03-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13691: - Description: Here is an example that Scala and Python generate different results {code} Scala:

[jira] [Commented] (SPARK-13041) Add a driver ui link and a mesos sandbox link on the dispatcher's ui page for each driver

2016-03-19 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201461#comment-15201461 ] Stavros Kontopoulos commented on SPARK-13041: - WIP > Add a driver ui link and a mesos

[jira] [Commented] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202762#comment-15202762 ] Sean Owen commented on SPARK-14005: --- That's not the argument: it's the additional API overhead for very

[jira] [Updated] (SPARK-14023) Make exceptions consistent regarding fields and columns

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14023: -- Priority: Trivial (was: Minor) I tend to agree. The first message comes because the underlying

[jira] [Commented] (SPARK-14024) SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder" while saving LinearRegressionModel

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202756#comment-15202756 ] Sean Owen commented on SPARK-14024: --- You can ignore it, it's just a warning from slf4j and has nothing

[jira] [Resolved] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14022. --- Resolution: Invalid Let's start questions on user@. Read

[jira] [Commented] (SPARK-13978) [GSoC 2016] Build monitoring UI and infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200998#comment-15200998 ] Yin Huai commented on SPARK-13978: -- Hi [~Vero], thank you for the interests! Unfortunately, we already

[jira] [Commented] (SPARK-13905) Change signature of as.data.frame() to be consistent with the R base package

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200950#comment-15200950 ] Apache Spark commented on SPARK-13905: -- User 'sun-rui' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14012) Extract VectorizedColumnReader from VectorizedParquetRecordReader

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14012: Assignee: Apache Spark > Extract VectorizedColumnReader from

[jira] [Updated] (SPARK-13764) Parse modes in JSON data source

2016-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13764: - Description: Currently, JSON data source just fails to read if some JSON documents are

[jira] [Commented] (SPARK-13989) Remove non-vectorized/unsafe-row parquet record reader

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200527#comment-15200527 ] Apache Spark commented on SPARK-13989: -- User 'sameeragarwal' has created a pull request for this

[jira] [Assigned] (SPARK-13928) Move org.apache.spark.Logging into org.apache.spark.internal.Logging

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13928: Assignee: Apache Spark > Move org.apache.spark.Logging into

[jira] [Assigned] (SPARK-13974) sub-query names do not need to be globally unique while generate SQL

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13974: Assignee: (was: Apache Spark) > sub-query names do not need to be globally unique

[jira] [Commented] (SPARK-14007) Manage the memory for hash map for shuffle hash join

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201874#comment-15201874 ] Apache Spark commented on SPARK-14007: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13874) Move docs of streaming-flume, streaming-mqtt, streaming-zeromq, streaming-akka, streaming-twitter to Spark packages

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13874: Assignee: (was: Apache Spark) > Move docs of streaming-flume, streaming-mqtt,

[jira] [Created] (SPARK-14003) Multi-session can not work when run one session is running "INSERT ... SELECT" move files step

2016-03-19 Thread Weizhong (JIRA)
Weizhong created SPARK-14003: Summary: Multi-session can not work when run one session is running "INSERT ... SELECT" move files step Key: SPARK-14003 URL: https://issues.apache.org/jira/browse/SPARK-14003

[jira] [Assigned] (SPARK-13995) Constraints should take care of Cast

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13995: Assignee: Apache Spark > Constraints should take care of Cast >

[jira] [Created] (SPARK-13998) HashingTF should extend UnaryTransformer

2016-03-19 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-13998: --- Summary: HashingTF should extend UnaryTransformer Key: SPARK-13998 URL: https://issues.apache.org/jira/browse/SPARK-13998 Project: Spark Issue Type:

[jira] [Created] (SPARK-14026) Subquery not brodcasted

2016-03-19 Thread Younes (JIRA)
Younes created SPARK-14026: -- Summary: Subquery not brodcasted Key: SPARK-14026 URL: https://issues.apache.org/jira/browse/SPARK-14026 Project: Spark Issue Type: Bug Components: Optimizer,

[jira] [Assigned] (SPARK-14025) Fix streaming job descriptions on the event line

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14025: Assignee: Apache Spark > Fix streaming job descriptions on the event line >

[jira] [Assigned] (SPARK-14025) Fix streaming job descriptions on the event line

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14025: Assignee: (was: Apache Spark) > Fix streaming job descriptions on the event line >

[jira] [Commented] (SPARK-14025) Fix streaming job descriptions on the event line

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202746#comment-15202746 ] Apache Spark commented on SPARK-14025: -- User 'lw-lin' has created a pull request for this issue:

[jira] [Commented] (SPARK-13629) Add binary toggle Param to CountVectorizer

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202146#comment-15202146 ] Apache Spark commented on SPARK-13629: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Commented] (SPARK-13843) Move streaming-flume, streaming-mqtt, streaming-zeromq, streaming-akka, streaming-twitter to Spark packages

2016-03-19 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202138#comment-15202138 ] Chris A. Mattmann commented on SPARK-13843: --- Hi - what is being done about concerns from at

[jira] [Created] (SPARK-14025) Fix streaming job descriptions on the event line

2016-03-19 Thread Liwei Lin (JIRA)
Liwei Lin created SPARK-14025: - Summary: Fix streaming job descriptions on the event line Key: SPARK-14025 URL: https://issues.apache.org/jira/browse/SPARK-14025 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-2418) Custom checkpointing with an external function as parameter

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2418. -- Resolution: Duplicate > Custom checkpointing with an external function as parameter >

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197676#comment-15197676 ] Xiangrui Meng commented on SPARK-7992: -- [~jodersky] Do you mind taking a look at this issue? We used

[jira] [Commented] (SPARK-13975) Cannot specify extra libs for executor from /extra-lib

2016-03-19 Thread Leonid Poliakov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199705#comment-15199705 ] Leonid Poliakov commented on SPARK-13975: - Thanks for you quick response, Sean The problem is,

[jira] [Commented] (SPARK-13936) PushPredicateThroughProject using Constraints

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197764#comment-15197764 ] Apache Spark commented on SPARK-13936: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-13845) BlockStatus and StreamBlockId keep on growing result driver OOM

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199194#comment-15199194 ] Apache Spark commented on SPARK-13845: -- User 'jeanlyn' has created a pull request for this issue:

[jira] [Commented] (SPARK-13973) `ipython notebook` is going away...

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199653#comment-15199653 ] Sean Owen commented on SPARK-13973: --- I think {{jupyter}} wouldn't work with IPython 3.x. But you could

[jira] [Created] (SPARK-13969) Extend input format that feature hashing can handle

2016-03-19 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-13969: -- Summary: Extend input format that feature hashing can handle Key: SPARK-13969 URL: https://issues.apache.org/jira/browse/SPARK-13969 Project: Spark

[jira] [Assigned] (SPARK-13988) Large history files block new applications from showing up in History UI.

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13988: Assignee: Apache Spark > Large history files block new applications from showing up in

[jira] [Updated] (SPARK-13954) spar-shell starts with exceptions

2016-03-19 Thread Pranas Baliuka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pranas Baliuka updated SPARK-13954: --- Priority: Minor (was: Major) > spar-shell starts with exceptions >

[jira] [Commented] (SPARK-4038) Outlier Detection Algorithm for MLlib

2016-03-19 Thread Vishal Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202736#comment-15202736 ] Vishal Mehta commented on SPARK-4038: - Hi All, Anybody working on detecting Outlier using Random

[jira] [Assigned] (SPARK-13986) Make `DeveloperApi`-annotated things public

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13986: Assignee: (was: Apache Spark) > Make `DeveloperApi`-annotated things public >

[jira] [Created] (SPARK-13978) [GSoC 2016] Monitoring UI and infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-13978: Summary: [GSoC 2016] Monitoring UI and infrastructure for Spark SQL and structured streaming Key: SPARK-13978 URL: https://issues.apache.org/jira/browse/SPARK-13978 Project:

[jira] [Updated] (SPARK-13937) PySpark ML JavaWrapper, variable _java_obj should not be static

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13937: -- Assignee: Bryan Cutler > PySpark ML JavaWrapper, variable _java_obj should not be

[jira] [Updated] (SPARK-12789) Support order by position

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12789: Summary: Support order by position (was: Support order by index) > Support order by position >

[jira] [Closed] (SPARK-13859) TPCDS query 38 returns wrong results compared to TPC official result set

2016-03-19 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JESSE CHEN closed SPARK-13859. -- Resolution: Not A Bug Fix Version/s: 2.0.0 Solution is to revert back to original TPC query

[jira] [Assigned] (SPARK-13985) WAL for determistic batches with IDs

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13985: Assignee: Apache Spark (was: Michael Armbrust) > WAL for determistic batches with IDs >

[jira] [Resolved] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13613. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11463

[jira] [Resolved] (SPARK-13942) Remove Shark-related docs for 2.x

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13942. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Remove

[jira] [Updated] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7992: - Target Version/s: 2.0.0 Component/s: Build > Hide private classes/objects in in

[jira] [Updated] (SPARK-13957) Support group by ordinal in SQL

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13957: Description: This is to support order by position in SQL, e.g. {noformat} select c1, c2, c3,

[jira] [Created] (SPARK-13977) Bring back ShuffledHashJoin

2016-03-19 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13977: -- Summary: Bring back ShuffledHashJoin Key: SPARK-13977 URL: https://issues.apache.org/jira/browse/SPARK-13977 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199848#comment-15199848 ] Xiao Li commented on SPARK-13865: - Great! Let us wait for the response of Jesse. We need to know how to

[jira] [Commented] (SPARK-13949) PySpark ml DecisionTreeClassifier, Regressor support export/import

2016-03-19 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198240#comment-15198240 ] Gayathri Murali commented on SPARK-13949: - I can work on this > PySpark ml

[jira] [Updated] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-13948: --- Summary:MiMa Check should catch if the visibility change to `private` (was: MiMa exclusions

[jira] [Assigned] (SPARK-13936) PushPredicateThroughProject using Constraints

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13936: Assignee: Apache Spark > PushPredicateThroughProject using Constraints >

[jira] [Updated] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13963: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-13964 > Add binary toggle

[jira] [Commented] (SPARK-13975) Cannot specify extra libs for executor from /extra-lib

2016-03-19 Thread Leonid Poliakov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199675#comment-15199675 ] Leonid Poliakov commented on SPARK-13975: - I am trying to avoid adding extra steps for users to

[jira] [Resolved] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13719. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11752

[jira] [Assigned] (SPARK-13991) Extend mvn enforcer rule

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13991: Assignee: Apache Spark > Extend mvn enforcer rule > > >

[jira] [Updated] (SPARK-13960) HTTP-based JAR Server doesn't respect spark.driver.host and there is no "spark.fileserver.host" option

2016-03-19 Thread Ilya Ostrovskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Ostrovskiy updated SPARK-13960: Description: There is no option to specify which hostname/IP address the jar/file server

[jira] [Created] (SPARK-13935) Other clients' connection hang up when someone do huge load

2016-03-19 Thread Tao Wang (JIRA)
Tao Wang created SPARK-13935: Summary: Other clients' connection hang up when someone do huge load Key: SPARK-13935 URL: https://issues.apache.org/jira/browse/SPARK-13935 Project: Spark Issue

[jira] [Commented] (SPARK-13363) Aggregator not working with DataFrame

2016-03-19 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199947#comment-15199947 ] koert kuipers commented on SPARK-13363: --- yes problem is still there > Aggregator not working with

[jira] [Created] (SPARK-13936) PushPredicateThroughProject using Constraints

2016-03-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-13936: --- Summary: PushPredicateThroughProject using Constraints Key: SPARK-13936 URL: https://issues.apache.org/jira/browse/SPARK-13936 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-13041) Add a driver ui link and a mesos sandbox link on the dispatcher's ui page for each driver

2016-03-19 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stavros Kontopoulos updated SPARK-13041: Description: It would be convenient to have the driver's history uri from the

[jira] [Updated] (SPARK-11891) Model export/import for RFormula and RFormulaModel

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11891: -- Shepherd: Joseph K. Bradley Assignee: Xusen Yin Target

[jira] [Updated] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-03-19 Thread Teng Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Qiu updated SPARK-13983: - Description: HiveThriftServer2 should be able to get "\--hiveconf" or ''\-\-hivevar" variables from

[jira] [Resolved] (SPARK-11888) Model export/import for spark.ml: DecisionTreeClassifier,Regressor

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11888. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11581

[jira] [Commented] (SPARK-13886) ArrayType of BinaryType not supported in Row.equals method

2016-03-19 Thread MahmoudHanafy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198797#comment-15198797 ] MahmoudHanafy commented on SPARK-13886: --- You are right, but it's mentioned in the documentation

[jira] [Created] (SPARK-13984) Schema verification always fail when using remote Hive metastore

2016-03-19 Thread Jianfeng Hu (JIRA)
Jianfeng Hu created SPARK-13984: --- Summary: Schema verification always fail when using remote Hive metastore Key: SPARK-13984 URL: https://issues.apache.org/jira/browse/SPARK-13984 Project: Spark

[jira] [Commented] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-03-19 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198280#comment-15198280 ] Xiu (Joe) Guo commented on SPARK-12981: --- Yes [~fabboe], my PR will fix your scenario too. >

[jira] [Created] (SPARK-13949) PySpark ml DecisionTreeClassifier, Regressor support export/import

2016-03-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-13949: - Summary: PySpark ml DecisionTreeClassifier, Regressor support export/import Key: SPARK-13949 URL: https://issues.apache.org/jira/browse/SPARK-13949

[jira] [Created] (SPARK-13986) Make `DeveloperApi`-annotated class/object public

2016-03-19 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-13986: - Summary: Make `DeveloperApi`-annotated class/object public Key: SPARK-13986 URL: https://issues.apache.org/jira/browse/SPARK-13986 Project: Spark Issue

[jira] [Commented] (SPARK-13994) Investigate types that are not supported by vectorized parquet record reader

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200807#comment-15200807 ] Apache Spark commented on SPARK-13994: -- User 'sameeragarwal' has created a pull request for this

[jira] [Created] (SPARK-14011) Enable `LineLength` Java checkstyle rule

2016-03-19 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14011: - Summary: Enable `LineLength` Java checkstyle rule Key: SPARK-14011 URL: https://issues.apache.org/jira/browse/SPARK-14011 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13978: - Summary: [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured

[jira] [Resolved] (SPARK-13926) Automatically use Kryo serializer when shuffling RDDs with simple types

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13926. - Resolution: Fixed Fix Version/s: 2.0.0 > Automatically use Kryo serializer when shuffling

[jira] [Assigned] (SPARK-13974) sub-query names do not need to be globally unique while generate SQL

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13974: Assignee: Apache Spark > sub-query names do not need to be globally unique while generate

[jira] [Commented] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation

2016-03-19 Thread Yang Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197244#comment-15197244 ] Yang Wang commented on SPARK-13934: --- A table identifier starting with a number will work here but the

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200867#comment-15200867 ] Xiao Li commented on SPARK-13865: - BTW, I do not think DB2 uses the same query for TPC-DS. This is a very

[jira] [Created] (SPARK-14024) SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder" while saving LinearRegressionModel

2016-03-19 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-14024: --- Summary: SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder" while saving LinearRegressionModel Key: SPARK-14024 URL:

[jira] [Updated] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13613: -- Target Version/s: 2.0.0 > Provide ignored tests to export test dataset into CSV format >

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200877#comment-15200877 ] JESSE CHEN commented on SPARK-13865: I will open a bug against TPCDS toolkit for this. Will add bug

[jira] [Resolved] (SPARK-13869) Remove redundant conditions while combining filters

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13869. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11670

[jira] [Updated] (SPARK-12789) Support order by position in SQL

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12789: Description: This is to support order by position in SQL, e.g. {noformat} select c1, c2, c3 from

[jira] [Assigned] (SPARK-13928) Move org.apache.spark.Logging into org.apache.spark.internal.Logging

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13928: Assignee: (was: Apache Spark) > Move org.apache.spark.Logging into

[jira] [Commented] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202076#comment-15202076 ] Yin Huai commented on SPARK-14006: -- [~rekhajoshm] Can you take a look at 1.6 branch

[jira] [Created] (SPARK-14023) Make descriptions in exceptions thrown consistent regarding fields and columns

2016-03-19 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-14023: --- Summary: Make descriptions in exceptions thrown consistent regarding fields and columns Key: SPARK-14023 URL: https://issues.apache.org/jira/browse/SPARK-14023

[jira] [Commented] (SPARK-13973) `ipython notebook` is going away...

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202121#comment-15202121 ] Apache Spark commented on SPARK-13973: -- User 'rekhajoshm' has created a pull request for this issue:

[jira] [Updated] (SPARK-14023) Make exceptions consistent regarding fields and columns

2016-03-19 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-14023: Summary: Make exceptions consistent regarding fields and columns (was: Make descriptions

[jira] [Updated] (SPARK-13978) [GSoC 2016] Build monitoring UI and infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13978: - Labels: GSOC2016 gsoc2016 mentor (was: GSOC2016 mentor) > [GSoC 2016] Build monitoring UI and

[jira] [Resolved] (SPARK-6601) Add HDFS NFS gateway module to spark-ec2

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6601. -- Resolution: Won't Fix EC2 issues are pretty much WontFix now that it's moved out of Spark > Add HDFS

[jira] [Updated] (SPARK-13986) Remove `DeveloperApi`-annotation for non-publics

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13986: -- Description: Spark uses `@DeveloperApi` annotation, but sometimes it seems to conflict with

[jira] [Commented] (SPARK-13860) TPCDS query 39 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200473#comment-15200473 ] Xiao Li commented on SPARK-13860: - Great! Excellent work! Thank you, [~tsuresh] BTW, also confirmed that

[jira] [Updated] (SPARK-13960) JAR/File HTTP Server doesn't respect "spark.driver.host" and there is no "spark.fileserver.host" option

2016-03-19 Thread Ilya Ostrovskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Ostrovskiy updated SPARK-13960: Summary: JAR/File HTTP Server doesn't respect "spark.driver.host" and there is no

[jira] [Issue Comment Deleted] (SPARK-13629) Add binary toggle Param to CountVectorizer

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13629: -- Comment: was deleted (was: [~mlnick] Thanks for handling these count/hashing

[jira] [Updated] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14006: - Component/s: SparkR > Builds of 1.6 branch fail R style check > ---

[jira] [Resolved] (SPARK-13038) PySpark ml.pipeline support export/import - non-nested Pipelines

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-13038. --- Resolution: Fixed Fix Version/s: 2.0.0 > PySpark ml.pipeline support

[jira] [Created] (SPARK-14010) ColumnPruning is conflict with PushPredicateThroughProject

2016-03-19 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14010: -- Summary: ColumnPruning is conflict with PushPredicateThroughProject Key: SPARK-14010 URL: https://issues.apache.org/jira/browse/SPARK-14010 Project: Spark Issue

[jira] [Updated] (SPARK-13966) Regression using .withColumn() on a parquet

2016-03-19 Thread Federico Ponzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Federico Ponzi updated SPARK-13966: --- Priority: Critical (was: Major) > Regression using .withColumn() on a parquet >

[jira] [Assigned] (SPARK-13938) word2phrase feature created in ML

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13938: Assignee: Apache Spark > word2phrase feature created in ML >

[jira] [Updated] (SPARK-13939) Kafka createDirectStream not parallelizing properly

2016-03-19 Thread Ben Teeuwen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Teeuwen updated SPARK-13939: Description: I’m trying to get a streaming app running using pyspark (1.6.0), Kafka and the

<    1   2   3   4   5   6   >