[jira] [Resolved] (SPARK-28579) MaxAbsScaler avoids conversion to breeze.vector

2019-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28579. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25311

[jira] [Assigned] (SPARK-28579) MaxAbsScaler avoids conversion to breeze.vector

2019-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28579: - Assignee: zhengruifeng > MaxAbsScaler avoids conversion to breeze.vector >

[jira] [Updated] (SPARK-28521) Fix error message for built-in functions

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28521: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > Fix error message for

[jira] [Assigned] (SPARK-28521) Fix error message for built-in functions

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28521: - Assignee: Yuming Wang > Fix error message for built-in functions >

[jira] [Resolved] (SPARK-28521) Fix error message for built-in functions

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28521. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25261

[jira] [Assigned] (SPARK-25584) Document libsvm data source in doc site

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-25584: - Assignee: zhengruifeng > Document libsvm data source in doc site >

[jira] [Resolved] (SPARK-25584) Document libsvm data source in doc site

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-25584. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25286

[jira] [Updated] (SPARK-25584) Document libsvm data source in doc site

2019-08-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-25584: -- Priority: Minor (was: Major) > Document libsvm data source in doc site >

[jira] [Assigned] (SPARK-28399) Impl RobustScaler

2019-07-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28399: - Assignee: zhengruifeng > Impl RobustScaler > - > > Key:

[jira] [Resolved] (SPARK-28399) Impl RobustScaler

2019-07-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28399. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25160

[jira] [Updated] (SPARK-28519) Tests failed on aarch64 due the value of math.log and power function is different

2019-07-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28519: -- Docs Text: The result of `java.lang.Math`'s `log`, `log1p`, `exp`, `expm1`, and `pow` may vary

[jira] [Updated] (SPARK-28519) Tests failed on aarch64 due the value of math.log and power function is different

2019-07-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28519: -- Docs Text: The result of the JVM's Math.log and Math.pow may vary across platforms. In Spark 3.0, the

[jira] [Commented] (SPARK-28519) Tests failed on aarch64 due the value of math.log and power function is different

2019-07-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16895279#comment-16895279 ] Sean Owen commented on SPARK-28519: --- Interesting thread from a JDK port list; it does seem like the

[jira] [Assigned] (SPARK-21481) Add indexOf method in ml.feature.HashingTF similar to mllib.feature.HashingTF

2019-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-21481: - Assignee: Huaxin Gao > Add indexOf method in ml.feature.HashingTF similar to

[jira] [Resolved] (SPARK-21481) Add indexOf method in ml.feature.HashingTF similar to mllib.feature.HashingTF

2019-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21481. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25250

[jira] [Commented] (SPARK-28519) Tests failed on aarch64 due the value of math.log and power function is different

2019-07-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16894524#comment-16894524 ] Sean Owen commented on SPARK-28519: --- A summary of the dev@ discussion: This is almost surely because

[jira] [Assigned] (SPARK-28507) remove deprecated API context(self, sqlContext) from pyspark/ml/util.py

2019-07-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28507: - Assignee: Huaxin Gao > remove deprecated API context(self, sqlContext) from pyspark/ml/util.py

[jira] [Resolved] (SPARK-28507) remove deprecated API context(self, sqlContext) from pyspark/ml/util.py

2019-07-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28507. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25246

[jira] [Resolved] (SPARK-28499) Optimize MinMaxScaler

2019-07-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28499. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25244

[jira] [Assigned] (SPARK-28499) Optimize MinMaxScaler

2019-07-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28499: - Assignee: zhengruifeng > Optimize MinMaxScaler > - > >

[jira] [Updated] (SPARK-28421) SparseVector.apply performance optimization

2019-07-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28421: -- Affects Version/s: 2.4.3 Priority: Minor (was: Major) Fix Version/s: 2.4.4 >

[jira] [Updated] (SPARK-25382) Remove ImageSchema.readImages in 3.0

2019-07-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-25382: -- Docs Text: In Spark 3.0.0, the deprecated ImageSchema class and its readImages methods have been

[jira] [Resolved] (SPARK-28421) SparseVector.apply performance optimization

2019-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28421. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25178

[jira] [Assigned] (SPARK-28421) SparseVector.apply performance optimization

2019-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28421: - Assignee: zhengruifeng > SparseVector.apply performance optimization >

[jira] [Resolved] (SPARK-28446) Document Kafka Headers support

2019-07-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28446. --- Resolution: Duplicate > Document Kafka Headers support > -- > >

[jira] [Assigned] (SPARK-28243) remove setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams

2019-07-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28243: - Assignee: Huaxin Gao > remove setFeatureSubsetStrategy and setSubsamplingRate from Python >

[jira] [Resolved] (SPARK-28243) remove setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams

2019-07-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28243. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25046

[jira] [Resolved] (SPARK-28416) Use java.time API in timestampAddInterval

2019-07-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28416. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25173

[jira] [Assigned] (SPARK-28416) Use java.time API in timestampAddInterval

2019-07-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28416: - Assignee: Maxim Gekk > Use java.time API in timestampAddInterval >

[jira] [Resolved] (SPARK-24283) Make standard scaler work without legacy MLlib

2019-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-24283. --- Resolution: Duplicate > Make standard scaler work without legacy MLlib >

[jira] [Commented] (SPARK-28086) Adds `random()` sql function

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886374#comment-16886374 ] Sean Owen commented on SPARK-28086: --- Yeah if this is just an alias... OK that seems simple but is this

[jira] [Commented] (SPARK-28134) Trigonometric Functions

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886371#comment-16886371 ] Sean Owen commented on SPARK-28134: --- I'm not sure this is worth it. These just take degrees as an

[jira] [Commented] (SPARK-28225) Unexpected behavior for Window functions

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886271#comment-16886271 ] Sean Owen commented on SPARK-28225: --- That's a weird one and I agree with what you expect. The 'nulls

[jira] [Commented] (SPARK-28366) Logging in driver when loading single large gzipped file via sc.textFile

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886229#comment-16886229 ] Sean Owen commented on SPARK-28366: --- ... what do you want to log? > Logging in driver when loading

[jira] [Resolved] (SPARK-28368) Row.getAs() return different values in scala and java

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28368. --- Resolution: Not A Problem > Row.getAs() return different values in scala and java >

[jira] [Commented] (SPARK-27781) Tried to access method org.apache.avro.specific.SpecificData.()V

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886227#comment-16886227 ] Sean Owen commented on SPARK-27781: --- I think this duplicates several general "Avro + Parquet versions

[jira] [Commented] (SPARK-27821) Spark WebUI - show numbers of drivers/apps in waiting/submitted/killed/running state

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16886226#comment-16886226 ] Sean Owen commented on SPARK-27821: --- Likewise I think this is more noise on the UI without a lot of

[jira] [Updated] (SPARK-27822) Spark WebUi - for running applications have a drivername column

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-27822: -- Priority: Minor (was: Major) You can already get this info from within the app. I don't see much

[jira] [Assigned] (SPARK-27944) Unify the behavior of checking empty output column names

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-27944: - Assignee: zhengruifeng > Unify the behavior of checking empty output column names >

[jira] [Resolved] (SPARK-27944) Unify the behavior of checking empty output column names

2019-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-27944. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24793

[jira] [Assigned] (SPARK-28311) Spark Thrift Server protocol version compatibility setup too late

2019-07-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28311: - Assignee: angerszhu > Spark Thrift Server protocol version compatibility setup too late >

[jira] [Resolved] (SPARK-28311) Spark Thrift Server protocol version compatibility setup too late

2019-07-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28311. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25083

[jira] [Resolved] (SPARK-28199) Move Trigger implementations to Triggers.scala and avoid exposing these to the end users

2019-07-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28199. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24996

[jira] [Resolved] (SPARK-28247) Flaky test: "query without test harness" in ContinuousSuite

2019-07-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28247. --- Resolution: Fixed Assignee: Jungtaek Lim Fix Version/s: 3.0.0 Resolved by

[jira] [Assigned] (SPARK-28199) Move Trigger implementations to Triggers.scala and avoid exposing these to the end users

2019-07-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28199: - Docs Text: In Spark 3.0, the deprecated class org.apache.spark.sql.streaming.ProcessingTime

[jira] [Resolved] (SPARK-28337) spark jars do not contain commons-jxpath jar, cause ClassNotFound exception

2019-07-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28337. --- Resolution: Not A Problem > spark jars do not contain commons-jxpath jar, cause ClassNotFound

[jira] [Commented] (SPARK-28324) The LOG function using 10 as the base, but Spark using E

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16882288#comment-16882288 ] Sean Owen commented on SPARK-28324: --- I don't think we should change this as it will break code and

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16882286#comment-16882286 ] Sean Owen commented on SPARK-4591: -- What else would go under this umbrella? > Algorithm/model parity

[jira] [Resolved] (SPARK-24462) Text socket micro-batch reader throws error when a query is restarted with saved state

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-24462. --- Resolution: Duplicate > Text socket micro-batch reader throws error when a query is restarted with

[jira] [Resolved] (SPARK-27560) HashPartitioner uses Object.hashCode which is not seeded

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-27560. --- Resolution: Not A Problem > HashPartitioner uses Object.hashCode which is not seeded >

[jira] [Resolved] (SPARK-26440) Show total CPU time across all tasks on stage pages

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26440. --- Resolution: Won't Fix > Show total CPU time across all tasks on stage pages >

[jira] [Resolved] (SPARK-26497) Show users where the pre-packaged SparkR and PySpark Dockerfiles are in the image build script.

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26497. --- Resolution: Later > Show users where the pre-packaged SparkR and PySpark Dockerfiles are in the >

[jira] [Resolved] (SPARK-26097) Show partitioning details in DAG UI

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26097. --- Resolution: Later > Show partitioning details in DAG UI > --- > >

[jira] [Updated] (SPARK-26097) Show partitioning details in DAG UI

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-26097: -- Priority: Minor (was: Major) This can be reopened with a PR that would address the different

[jira] [Updated] (SPARK-28199) Remove usage of ProcessingTime in Spark codebase

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28199: -- Labels: release-notes (was: ) > Remove usage of ProcessingTime in Spark codebase >

[jira] [Assigned] (SPARK-28267) Update building-spark.md

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28267: - Assignee: Yuming Wang > Update building-spark.md > > >

[jira] [Resolved] (SPARK-28267) Update building-spark.md

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28267. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25063

[jira] [Updated] (SPARK-28267) Update building-spark.md

2019-07-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28267: -- Priority: Trivial (was: Major) > Update building-spark.md > > >

[jira] [Resolved] (SPARK-28140) Pyspark API to create spark.mllib RowMatrix from DataFrame

2019-07-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28140. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24953

[jira] [Updated] (SPARK-28140) Pyspark API to create spark.mllib RowMatrix from DataFrame

2019-07-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28140: -- Priority: Minor (was: Major) > Pyspark API to create spark.mllib RowMatrix from DataFrame >

[jira] [Assigned] (SPARK-28140) Pyspark API to create spark.mllib RowMatrix from DataFrame

2019-07-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28140: - Assignee: Henry Davidge > Pyspark API to create spark.mllib RowMatrix from DataFrame >

[jira] [Resolved] (SPARK-25834) stream stream Outer join with update mode is not throwing exception

2019-07-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-25834. --- Resolution: Duplicate > stream stream Outer join with update mode is not throwing exception >

[jira] [Resolved] (SPARK-28159) Make the transform natively in ml framework to avoid extra conversion

2019-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28159. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24963

[jira] [Assigned] (SPARK-28159) Make the transform natively in ml framework to avoid extra conversion

2019-07-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28159: - Assignee: zhengruifeng > Make the transform natively in ml framework to avoid extra conversion

[jira] [Commented] (SPARK-28264) Revisiting Python / pandas UDF

2019-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16879980#comment-16879980 ] Sean Owen commented on SPARK-28264: --- I generally like the rationalization of the various UDF types, as

[jira] [Assigned] (SPARK-28160) TransportClient.sendRpcSync may hang forever

2019-06-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28160: - Assignee: Lantao Jin > TransportClient.sendRpcSync may hang forever >

[jira] [Resolved] (SPARK-28160) TransportClient.sendRpcSync may hang forever

2019-06-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28160. --- Resolution: Fixed Fix Version/s: 2.4.4 2.3.4 3.0.0

[jira] [Updated] (SPARK-28170) DenseVector .toArray() and .values documentation do not specify they are aliases

2019-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28170: -- Priority: Trivial (was: Minor) > DenseVector .toArray() and .values documentation do not specify

[jira] [Resolved] (SPARK-28145) Executor pods polling source can fail to replace dead executors

2019-06-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28145. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24952

[jira] [Assigned] (SPARK-28145) Executor pods polling source can fail to replace dead executors

2019-06-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28145: - Assignee: Onur Satici > Executor pods polling source can fail to replace dead executors >

[jira] [Updated] (SPARK-28164) usage description does not match with shell scripts

2019-06-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28164: -- Priority: Minor (was: Major) > usage description does not match with shell scripts >

[jira] [Assigned] (SPARK-28164) usage description does not match with shell scripts

2019-06-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28164: - Assignee: Shivu Sondur > usage description does not match with shell scripts >

[jira] [Resolved] (SPARK-28164) usage description does not match with shell scripts

2019-06-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28164. --- Resolution: Fixed Fix Version/s: 2.4.4 2.3.4 3.0.0

[jira] [Updated] (SPARK-28145) Executor pods polling source can fail to replace dead executors

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28145: -- Priority: Minor (was: Major) Issue Type: Improvement (was: New Feature) > Executor pods

[jira] [Resolved] (SPARK-26985) Test "access only some column of the all of columns " fails on big endian

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26985. --- Resolution: Fixed Assignee: ketan kunde Fix Version/s: 3.0.0 Resolved by

[jira] [Resolved] (SPARK-28154) GMM fix double caching

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28154. --- Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 3.0.0

[jira] [Assigned] (SPARK-28117) LDA and BisectingKMeans cache the input dataset if necessary

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28117: - Assignee: zhengruifeng > LDA and BisectingKMeans cache the input dataset if necessary >

[jira] [Resolved] (SPARK-28117) LDA and BisectingKMeans cache the input dataset if necessary

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28117. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24920

[jira] [Updated] (SPARK-28117) LDA and BisectingKMeans cache the input dataset if necessary

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28117: -- Priority: Minor (was: Major) > LDA and BisectingKMeans cache the input dataset if necessary >

[jira] [Updated] (SPARK-28045) add missing RankingEvaluator

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28045: -- Priority: Minor (was: Major) > add missing RankingEvaluator > > >

[jira] [Assigned] (SPARK-28045) add missing RankingEvaluator

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28045: - Assignee: zhengruifeng > add missing RankingEvaluator > > >

[jira] [Resolved] (SPARK-28045) add missing RankingEvaluator

2019-06-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28045. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24869

[jira] [Resolved] (SPARK-26896) Add maven profiles for running tests with JDK 11

2019-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26896. --- Resolution: Not A Problem > Add maven profiles for running tests with JDK 11 >

[jira] [Assigned] (SPARK-27018) Checkpointed RDD deleted prematurely when using GBTClassifier

2019-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-27018: - Assignee: zhengruifeng > Checkpointed RDD deleted prematurely when using GBTClassifier >

[jira] [Resolved] (SPARK-27018) Checkpointed RDD deleted prematurely when using GBTClassifier

2019-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-27018. --- Resolution: Fixed Fix Version/s: 2.4.4 2.3.4 3.0.0

[jira] [Assigned] (SPARK-27989) Add retries on the connection to the driver

2019-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-27989: - Assignee: Jose Luis Pedrosa > Add retries on the connection to the driver >

[jira] [Resolved] (SPARK-27989) Add retries on the connection to the driver

2019-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-27989. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24702

[jira] [Commented] (SPARK-28114) Add Jenkins job for `Hadoop-3.2` profile

2019-06-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16870710#comment-16870710 ] Sean Owen commented on SPARK-28114: --- [~dongjoon] I didn't see {{--force}} in the job you mentioned,

[jira] [Commented] (SPARK-26839) Work around classloader changes in Java 9 for Hive isolation

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867912#comment-16867912 ] Sean Owen commented on SPARK-26839: --- There is still a classloader and datanucleus and Hive issue here

[jira] [Assigned] (SPARK-28062) HuberAggregator copies coefficients vector every time an instance is added

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28062: - Assignee: Andrew Crosby > HuberAggregator copies coefficients vector every time an instance is

[jira] [Resolved] (SPARK-28062) HuberAggregator copies coefficients vector every time an instance is added

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28062. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24880

[jira] [Assigned] (SPARK-28044) MulticlassClassificationEvaluator support more metrics

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-28044: - Assignee: zhengruifeng > MulticlassClassificationEvaluator support more metrics >

[jira] [Resolved] (SPARK-28044) MulticlassClassificationEvaluator support more metrics

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28044. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24868

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28106: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > Spark SQL add jar with

[jira] [Commented] (SPARK-28093) Built-in function trim/ltrim/rtrim has bug when using trimStr

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867621#comment-16867621 ] Sean Owen commented on SPARK-28093: --- Should we even call it a correctness problem? Like is this

[jira] [Updated] (SPARK-28093) Built-in function trim/ltrim/rtrim has bug when using trimStr

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-28093: -- Labels: release-notes (was: ) > Built-in function trim/ltrim/rtrim has bug when using trimStr >

[jira] [Resolved] (SPARK-14409) Investigate adding a RankingEvaluator to ML

2019-06-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14409. --- Resolution: Duplicate > Investigate adding a RankingEvaluator to ML >

[jira] [Resolved] (SPARK-27716) Complete the transactions support for part of jdbc datasource operations.

2019-06-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-27716. --- Resolution: Won't Fix > Complete the transactions support for part of jdbc datasource operations. >

[jira] [Resolved] (SPARK-28081) word2vec 'large' count value too low for very large corpora

2019-06-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-28081. --- Resolution: Fixed Fix Version/s: 2.4.4 2.3.4 3.0.0

[jira] [Created] (SPARK-28081) word2vec 'large' count value too low for very large corpora

2019-06-17 Thread Sean Owen (JIRA)
Sean Owen created SPARK-28081: - Summary: word2vec 'large' count value too low for very large corpora Key: SPARK-28081 URL: https://issues.apache.org/jira/browse/SPARK-28081 Project: Spark Issue

<    1   2   3   4   5   6   7   8   9   10   >