[jira] [Assigned] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19658: Assignee: Apache Spark (was: Xiao Li) > Set NumPartitions of RepartitionByExpression In

[jira] [Commented] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873522#comment-15873522 ] Apache Spark commented on SPARK-19658: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19658: Assignee: Xiao Li (was: Apache Spark) > Set NumPartitions of RepartitionByExpression In

[jira] [Created] (SPARK-19658) Set NumPartitions of RepartitionByExpression In Analyzer

2017-02-18 Thread Xiao Li (JIRA)
Xiao Li created SPARK-19658: --- Summary: Set NumPartitions of RepartitionByExpression In Analyzer Key: SPARK-19658 URL: https://issues.apache.org/jira/browse/SPARK-19658 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19647) Spark query hive is extremelly slow even the result data is small

2017-02-18 Thread wuchang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873490#comment-15873490 ] wuchang commented on SPARK-19647: - Hi , I don't think that it is just a question ,but also it maybe a

[jira] [Commented] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-18 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873329#comment-15873329 ] Maciej Szymkiewicz commented on SPARK-16931: [~sowen] Is there any particular reason for

[jira] [Assigned] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18891: Assignee: Apache Spark > Support for specific collection types >

[jira] [Assigned] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18891: Assignee: (was: Apache Spark) > Support for specific collection types >

[jira] [Commented] (SPARK-18891) Support for specific collection types

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873300#comment-15873300 ] Apache Spark commented on SPARK-18891: -- User 'michalsenkyr' has created a pull request for this

[jira] [Assigned] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19122: Assignee: Apache Spark > Unnecessary shuffle+sort added if join predicates ordering

[jira] [Assigned] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19122: Assignee: (was: Apache Spark) > Unnecessary shuffle+sort added if join predicates

[jira] [Commented] (SPARK-19122) Unnecessary shuffle+sort added if join predicates ordering differ from bucketing and sorting order

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873287#comment-15873287 ] Apache Spark commented on SPARK-19122: -- User 'tejasapatil' has created a pull request for this

[jira] [Closed] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro closed SPARK-19638. Resolution: Duplicate > Filter pushdown not working for struct fields >

[jira] [Comment Edited] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873237#comment-15873237 ] Takeshi Yamamuro edited comment on SPARK-19638 at 2/18/17 4:25 PM: --- I

[jira] [Commented] (SPARK-19638) Filter pushdown not working for struct fields

2017-02-18 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873237#comment-15873237 ] Takeshi Yamamuro commented on SPARK-19638: -- I found this ticket is duplicated to SPARK-17636, so

[jira] [Resolved] (SPARK-8510) NumPy arrays and matrices as values in sequence files

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8510. - Resolution: Won't Fix I am resolving this per comments in

[jira] [Resolved] (SPARK-19263) DAGScheduler should avoid sending conflicting task set.

2017-02-18 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19263. Resolution: Fixed Assignee: jin xing Fix Version/s: 1.2.0 > DAGScheduler

[jira] [Resolved] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19655. --- Resolution: Duplicate > select count(*) , requests 1 for each row >

[jira] [Commented] (SPARK-19615) Provide Dataset union convenience for divergent schema

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873189#comment-15873189 ] Hyukjin Kwon commented on SPARK-19615: -- Let me leave loosely related JIRAs - SPARK-9813 , SPARK-9874

[jira] [Commented] (SPARK-19615) Provide Dataset union convenience for divergent schema

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873188#comment-15873188 ] Hyukjin Kwon commented on SPARK-19615: -- I remember I checked UNION operation in other DBMS and

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873184#comment-15873184 ] Hyukjin Kwon commented on SPARK-19655: -- I guess the problematic line is,

[jira] [Created] (SPARK-19657) start-master.sh accidentally forces the use of a loopback address in master URL

2017-02-18 Thread George Hawkins (JIRA)
George Hawkins created SPARK-19657: -- Summary: start-master.sh accidentally forces the use of a loopback address in master URL Key: SPARK-19657 URL: https://issues.apache.org/jira/browse/SPARK-19657

[jira] [Commented] (SPARK-19217) Offer easy cast from vector to array

2017-02-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873159#comment-15873159 ] Liang-Chi Hsieh commented on SPARK-19217: - The native casting of UserDefinedType from/to other

[jira] [Commented] (SPARK-19550) Remove reflection, docs, build elements related to Java 7

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873148#comment-15873148 ] Apache Spark commented on SPARK-19550: -- User 'lins05' has created a pull request for this issue:

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873140#comment-15873140 ] hosein commented on SPARK-19655: I think I should not use spark for my case... > select count(*) ,

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873131#comment-15873131 ] hosein commented on SPARK-19655: if I want to count 100 million data, 100 million 1 returned over network

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873130#comment-15873130 ] Herman van Hovell commented on SPARK-19655: --- We current only push filters and columns down into

[jira] [Commented] (SPARK-19550) Remove reflection, docs, build elements related to Java 7

2017-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873125#comment-15873125 ] Apache Spark commented on SPARK-19550: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873121#comment-15873121 ] hosein commented on SPARK-19655: I surprised too : ) if you have Vertica database you can test this part

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873118#comment-15873118 ] hosein commented on SPARK-19655: how can I get count result from my Vertica table? is there any

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:11 AM: -- I connect to Vertica

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873114#comment-15873114 ] Sean Owen commented on SPARK-19655: --- That's more the JDBC-Vertica integration than Spark-JDBC

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:09 AM: -- I connect to Vertica

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:07 AM: -- I connect to Vertica

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:06 AM: -- I connect to Vertica

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein edited comment on SPARK-19655 at 2/18/17 11:06 AM: -- I connect to Vertica

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873110#comment-15873110 ] hosein commented on SPARK-19655: I connect to Vertica by JDBC and downloaded it's driver from this link:

[jira] [Reopened] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-19376: --- > CLONE - CheckAnalysis rejects TPCDS query 32 > > >

[jira] [Closed] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-19376. - Resolution: Invalid Fix Version/s: (was: 2.1.0) Target Version/s: (was: 2.1.0) >

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873103#comment-15873103 ] Sean Owen commented on SPARK-19653: --- Related to https://issues.apache.org/jira/browse/SPARK-19217 >

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873102#comment-15873102 ] Sean Owen commented on SPARK-19655: --- There are queries issued to test the existence of tables in some

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:37 AM: -- I have a Vertica

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:36 AM: -- I have a Vertica

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873093#comment-15873093 ] hosein edited comment on SPARK-19655 at 2/18/17 10:36 AM: -- I have a Vertica

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873093#comment-15873093 ] hosein commented on SPARK-19655: I have a Vertica database with 100 million rows and I run this code in

[jira] [Commented] (SPARK-19656) Can't load custom type from avro file to RDD with newAPIHadoopFile

2017-02-18 Thread Nira Amit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873090#comment-15873090 ] Nira Amit commented on SPARK-19656: --- I also tried to do this without writing my own `AvroKey` and

[jira] [Created] (SPARK-19656) Can't load custom type from avro file to RDD with newAPIHadoopFile

2017-02-18 Thread Nira Amit (JIRA)
Nira Amit created SPARK-19656: - Summary: Can't load custom type from avro file to RDD with newAPIHadoopFile Key: SPARK-19656 URL: https://issues.apache.org/jira/browse/SPARK-19656 Project: Spark

[jira] [Comment Edited] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873087#comment-15873087 ] Sean Owen edited comment on SPARK-19655 at 2/18/17 10:14 AM: - What is the

[jira] [Commented] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873087#comment-15873087 ] Sean Owen commented on SPARK-19655: --- What is the problem? selecting "1" _is_ an optimization, or at

[jira] [Updated] (SPARK-19376) CLONE - CheckAnalysis rejects TPCDS query 32

2017-02-18 Thread Mostafa Shahdadi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mostafa Shahdadi updated SPARK-19376: - Priority: Minor (was: Blocker) > CLONE - CheckAnalysis rejects TPCDS query 32 >

[jira] [Closed] (SPARK-19592) Duplication in Test Configuration Relating to SparkConf Settings Should be Removed

2017-02-18 Thread Armin Braun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Armin Braun closed SPARK-19592. --- Resolution: Won't Fix > Duplication in Test Configuration Relating to SparkConf Settings Should be

[jira] [Updated] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hosein updated SPARK-19655: --- Summary: select count(*) , requests 1 for each row (was: select count(*) , requests 1 foreach row) >

[jira] [Updated] (SPARK-19655) select count(*) , requests 1 for each row

2017-02-18 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hosein updated SPARK-19655: --- Description: when I want query select count( * ) by JDBC and monitor queries in database side, I see spark

[jira] [Created] (SPARK-19655) select count(*) , requests 1 foreach row

2017-02-18 Thread hosein (JIRA)
hosein created SPARK-19655: -- Summary: select count(*) , requests 1 foreach row Key: SPARK-19655 URL: https://issues.apache.org/jira/browse/SPARK-19655 Project: Spark Issue Type: Bug