[jira] [Assigned] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19658: Assignee: Apache Spark (was: Xiao Li) > Set NumPartitions of RepartitionByExpression In A

[jira] [Commented] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873522#comment-15873522 ] Apache Spark commented on SPARK-19658: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19658: Assignee: Xiao Li (was: Apache Spark) > Set NumPartitions of RepartitionByExpression In A

[jira] [Created] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Xiao Li (JIRA)
Xiao Li created SPARK-19658: --- Summary: Set NumPartitions of RepartitionByExpression In Analyzer Key: SPARK-19658 URL: https://issues.apache.org/jira/browse/SPARK-19658 Project: Spark Issue Type: Im

[jira] [Commented] (SPARK-19647) Spark query hive is extremelly slow even the result data is small

2017-02-18 Thread wuchang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873490#comment-15873490 ] wuchang commented on SPARK-19647: - Hi , I don't think that it is just a question ,but als

[jira] [Commented] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-18 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873329#comment-15873329 ] Maciej Szymkiewicz commented on SPARK-16931: [~sowen] Is there any particular

[jira] [Assigned] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18891: Assignee: Apache Spark > Support for specific collection types > -

[jira] [Assigned] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18891: Assignee: (was: Apache Spark) > Support for specific collection types > --

[jira] [Commented] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873300#comment-15873300 ] Apache Spark commented on SPARK-18891: -- User 'michalsenkyr' has created a pull reque

[jira] [Assigned] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19122: Assignee: (was: Apache Spark) > Unnecessary shuffle+sort added if join predicates orde

[jira] [Assigned] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19122: Assignee: Apache Spark > Unnecessary shuffle+sort added if join predicates ordering differ

[jira] [Commented] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873287#comment-15873287 ] Apache Spark commented on SPARK-19122: -- User 'tejasapatil' has created a pull reques

[jira] [Closed] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro closed SPARK-19638. Resolution: Duplicate > Filter pushdown not working for struct fields > ---

[jira] [Comment Edited] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873237#comment-15873237 ] Takeshi Yamamuro edited comment on SPARK-19638 at 2/18/17 4:25 PM:

[jira] [Commented] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873237#comment-15873237 ] Takeshi Yamamuro commented on SPARK-19638: -- I found this ticket is duplicated to

[jira] [Resolved] (SPARK-8510) NumPy arrays and matrices as values in sequence files

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8510. - Resolution: Won't Fix I am resolving this per comments in https://github.com/apache/spark/pull/838

[jira] [Resolved] (SPARK-19263) DAGScheduler should avoid sending conflicting task set.

2017-02-18 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19263. Resolution: Fixed Assignee: jin xing Fix Version/s: 1.2.0 > DAGScheduler sh

[jira] [Resolved] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19655. --- Resolution: Duplicate > select count(*) , requests 1 for each row > -

[jira] [Commented] (SPARK-19615) Provide Dataset union convenience for divergent schema

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873189#comment-15873189 ] Hyukjin Kwon commented on SPARK-19615: -- Let me leave loosely related JIRAs - SPARK-9

[jira] [Commented] (SPARK-19615) Provide Dataset union convenience for divergent schema

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873188#comment-15873188 ] Hyukjin Kwon commented on SPARK-19615: -- I remember I checked UNION operation in othe

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873184#comment-15873184 ] Hyukjin Kwon commented on SPARK-19655: -- I guess the problematic line is, https://gi

[jira] [Created] (SPARK-19657) start-master.sh accidentally forces the use of a loopback address in master URL

2017-02-18 Thread George Hawkins (JIRA)
George Hawkins created SPARK-19657: -- Summary: start-master.sh accidentally forces the use of a loopback address in master URL Key: SPARK-19657 URL: https://issues.apache.org/jira/browse/SPARK-19657 P

[jira] [Commented] (SPARK-19217) Offer easy cast from vector to array

2017-02-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873159#comment-15873159 ] Liang-Chi Hsieh commented on SPARK-19217: - The native casting of UserDefinedType

[jira] [Commented] (SPARK-19550) Remove reflection, docs, build elements related to Java 7

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873148#comment-15873148 ] Apache Spark commented on SPARK-19550: -- User 'lins05' has created a pull request for

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873140#comment-15873140 ] hosein commented on SPARK-19655: I think I should not use spark for my case... > select

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873131#comment-15873131 ] hosein commented on SPARK-19655: if I want to count 100 million data, 100 million 1 retur

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873130#comment-15873130 ] Herman van Hovell commented on SPARK-19655: --- We current only push filters and c

[jira] [Commented] (SPARK-19550) Remove reflection, docs, build elements related to Java 7

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873125#comment-15873125 ] Apache Spark commented on SPARK-19550: -- User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873121#comment-15873121 ] hosein commented on SPARK-19655: I surprised too : ) if you have Vertica database you can

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873118#comment-15873118 ] hosein commented on SPARK-19655: how can I get count result from my Vertica table? is the

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:11 AM: -- I con

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873114#comment-15873114 ] Sean Owen commented on SPARK-19655: --- That's more the JDBC-Vertica integration than Spar

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:09 AM: -- I con

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:07 AM: -- I con

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:06 AM: -- I con

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:06 AM: -- I con

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873110#comment-15873110 ] hosein commented on SPARK-19655: I connect to Vertica by JDBC and downloaded it's driver

[jira] [Reopened] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-19376: --- > CLONE - CheckAnalysis rejects TPCDS query 32 > > >

[jira] [Closed] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-19376. - Resolution: Invalid Fix Version/s: (was: 2.1.0) Target Version/s: (was: 2.1.0) > C

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873103#comment-15873103 ] Sean Owen commented on SPARK-19653: --- Related to https://issues.apache.org/jira/browse/S

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873102#comment-15873102 ] Sean Owen commented on SPARK-19655: --- There are queries issued to test the existence of

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:37 AM: -- I hav

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:36 AM: -- I hav

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:36 AM: -- I hav

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873093#comment-15873093 ] hosein commented on SPARK-19655: I have a Vertica database with 100 million rows and I ru

[jira] [Commented] (SPARK-19656) Can't load custom type from avro file to RDD with newAPIHadoopFile

2017-02-18 Thread Nira Amit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873090#comment-15873090 ] Nira Amit commented on SPARK-19656: --- I also tried to do this without writing my own `Av

[jira] [Created] (SPARK-19656) Can't load custom type from avro file to RDD with newAPIHadoopFile

2017-02-18 Thread Nira Amit (JIRA)
Nira Amit created SPARK-19656: - Summary: Can't load custom type from avro file to RDD with newAPIHadoopFile Key: SPARK-19656 URL: https://issues.apache.org/jira/browse/SPARK-19656 Project: Spark

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873087#comment-15873087 ] Sean Owen edited comment on SPARK-19655 at 2/18/17 10:14 AM: -

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15873087#comment-15873087 ] Sean Owen commented on SPARK-19655: --- What is the problem? selecting "1" _is_ an optimiz

[jira] [Updated] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Mostafa Shahdadi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mostafa Shahdadi updated SPARK-19376: - Priority: Minor (was: Blocker) > CLONE - CheckAnalysis rejects TPCDS query 32 >

[jira] [Closed] (SPARK-19592) Duplication in Test Configuration Relating to SparkConf Settings Should be Removed

2017-02-18 Thread Armin Braun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Armin Braun closed SPARK-19592. --- Resolution: Won't Fix > Duplication in Test Configuration Relating to SparkConf Settings Should be >

[jira] [Updated] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hosein updated SPARK-19655: --- Summary: select count(*) , requests 1 for each row (was: select count(*) , requests 1 foreach row) > select

[jira] [Updated] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hosein updated SPARK-19655: --- Description: when I want query select count( * ) by JDBC and monitor queries in database side, I see spark r

[jira] [Created] (SPARK-19655) select count(*) , requests 1 foreach row

2017-02-18 Thread hosein (JIRA)
hosein created SPARK-19655: -- Summary: select count(*) , requests 1 foreach row Key: SPARK-19655 URL: https://issues.apache.org/jira/browse/SPARK-19655 Project: Spark Issue Type: Bug Compon