[jira] [Comment Edited] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-18 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760487#comment-15760487 ] Navya Krishnappa edited comment on SPARK-18877 at 12/19/16 7:56 AM:

[jira] [Comment Edited] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-18 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760487#comment-15760487 ] Navya Krishnappa edited comment on SPARK-18877 at 12/19/16 7:56 AM:

[jira] [Commented] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-18 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760487#comment-15760487 ] Navya Krishnappa commented on SPARK-18877: -- Thank you [~dongjoon] > Unable to read given csv

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760439#comment-15760439 ] Xiangrui Meng commented on SPARK-18924: --- cc: [~shivaram] [~felixcheung] [~falaki] [~yanboliang] for

[jira] [Created] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-18924: - Summary: Improve collect/createDataFrame performance in SparkR Key: SPARK-18924 URL: https://issues.apache.org/jira/browse/SPARK-18924 Project: Spark

[jira] [Assigned] (SPARK-18871) New test cases for IN subquery

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18871: Assignee: (was: Apache Spark) > New test cases for IN subquery >

[jira] [Assigned] (SPARK-18871) New test cases for IN subquery

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18871: Assignee: Apache Spark > New test cases for IN subquery > --

[jira] [Commented] (SPARK-18871) New test cases for IN subquery

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760370#comment-15760370 ] Apache Spark commented on SPARK-18871: -- User 'kevinyu98' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18923) Support SKIP_PYTHONDOC/RDOC in doc generation

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18923: Assignee: Apache Spark > Support SKIP_PYTHONDOC/RDOC in doc generation >

[jira] [Assigned] (SPARK-18923) Support SKIP_PYTHONDOC/RDOC in doc generation

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18923: Assignee: (was: Apache Spark) > Support SKIP_PYTHONDOC/RDOC in doc generation >

[jira] [Commented] (SPARK-18923) Support SKIP_PYTHONDOC/RDOC in doc generation

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760332#comment-15760332 ] Apache Spark commented on SPARK-18923: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Created] (SPARK-18923) Support SKIP_PYTHONDOC/RDOC in doc generation

2016-12-18 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-18923: - Summary: Support SKIP_PYTHONDOC/RDOC in doc generation Key: SPARK-18923 URL: https://issues.apache.org/jira/browse/SPARK-18923 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2016-12-18 Thread vishal agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760203#comment-15760203 ] vishal agrawal commented on SPARK-18857: We are unable to use incremental collect in a spark

[jira] [Commented] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760116#comment-15760116 ] Apache Spark commented on SPARK-18922: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18922: Assignee: (was: Apache Spark) > Fix more resource-closing-related and path-related

[jira] [Assigned] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18922: Assignee: Apache Spark > Fix more resource-closing-related and path-related test failures

[jira] [Created] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2016-12-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-18922: Summary: Fix more resource-closing-related and path-related test failures in identified ones on Windows Key: SPARK-18922 URL: https://issues.apache.org/jira/browse/SPARK-18922

[jira] [Updated] (SPARK-18703) Insertion/CTAS against Hive Tables: Staging Directories and Data Files Not Dropped Until Normal Termination of JVM

2016-12-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18703: Fix Version/s: 2.1.1 > Insertion/CTAS against Hive Tables: Staging Directories and Data Files Not

[jira] [Updated] (SPARK-18675) CTAS for hive serde table should work for all hive versions

2016-12-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18675: Fix Version/s: 2.1.1 > CTAS for hive serde table should work for all hive versions >

[jira] [Assigned] (SPARK-18921) check database existence with Hive.databaseExists instead of getDatabase

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18921: Assignee: Wenchen Fan (was: Apache Spark) > check database existence with

[jira] [Commented] (SPARK-18921) check database existence with Hive.databaseExists instead of getDatabase

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760023#comment-15760023 ] Apache Spark commented on SPARK-18921: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18921) check database existence with Hive.databaseExists instead of getDatabase

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18921: Assignee: Apache Spark (was: Wenchen Fan) > check database existence with

[jira] [Created] (SPARK-18921) check database existence with Hive.databaseExists instead of getDatabase

2016-12-18 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-18921: --- Summary: check database existence with Hive.databaseExists instead of getDatabase Key: SPARK-18921 URL: https://issues.apache.org/jira/browse/SPARK-18921 Project:

[jira] [Closed] (SPARK-18767) Unify Models' toString methods

2016-12-18 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-18767. Resolution: Won't Fix > Unify Models' toString methods > -- > >

[jira] [Commented] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759931#comment-15759931 ] Dongjoon Hyun commented on SPARK-18917: --- Hi, [~alunarbeach]. Sure, you can make a PR. BTW, please

[jira] [Updated] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18917: -- Fix Version/s: (was: 2.1.1) (was: 2.1.0) > Dataframe - Time Out

[jira] [Updated] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18917: -- Target Version/s: (was: 2.1.0, 2.1.1) > Dataframe - Time Out Issues / Taking long time in

[jira] [Commented] (SPARK-18920) Update outdated date formatting

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759917#comment-15759917 ] Apache Spark commented on SPARK-18920: -- User 'WangTaoTheTonic' has created a pull request for this

[jira] [Assigned] (SPARK-18920) Update outdated date formatting

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18920: Assignee: (was: Apache Spark) > Update outdated date formatting >

[jira] [Assigned] (SPARK-18920) Update outdated date formatting

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18920: Assignee: Apache Spark > Update outdated date formatting >

[jira] [Created] (SPARK-18920) Update outdated date formatting

2016-12-18 Thread Tao Wang (JIRA)
Tao Wang created SPARK-18920: Summary: Update outdated date formatting Key: SPARK-18920 URL: https://issues.apache.org/jira/browse/SPARK-18920 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18916) Possible bug in Pregel / mergeMsg with hashmaps

2016-12-18 Thread Seth Bromberger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759870#comment-15759870 ] Seth Bromberger commented on SPARK-18916: - Added to update:

[jira] [Commented] (SPARK-17073) generate basic stats for column

2016-12-18 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759787#comment-15759787 ] Zhenhua Wang commented on SPARK-17073: -- [~ioana-delaney] Thanks for sharing the information! >

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759635#comment-15759635 ] Apache Spark commented on SPARK-18817: -- User 'felixcheung' has created a pull request for this

[jira] [Closed] (SPARK-18919) PrimitiveKeyOpenHashMap is boxing values

2016-12-18 Thread Jakub Liska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jakub Liska closed SPARK-18919. --- Resolution: Not A Problem Ahh, my fault, the OpenHashSet is rehasing at 734004 and doubles the size

[jira] [Updated] (SPARK-18919) PrimitiveKeyOpenHashMap is boxing values

2016-12-18 Thread Jakub Liska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jakub Liska updated SPARK-18919: Description: Hey, I was benchmarking PrimitiveKeyOpenHashMap for speed and memory footprint and I

[jira] [Created] (SPARK-18919) PrimitiveKeyOpenHashMap is boxing values

2016-12-18 Thread Jakub Liska (JIRA)
Jakub Liska created SPARK-18919: --- Summary: PrimitiveKeyOpenHashMap is boxing values Key: SPARK-18919 URL: https://issues.apache.org/jira/browse/SPARK-18919 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759368#comment-15759368 ] Felix Cheung commented on SPARK-18817: -- testing fix, will open a PR shortly. > Ensure nothing is

[jira] [Commented] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758691#comment-15758691 ] Apache Spark commented on SPARK-16046: -- User 'aokolnychyi' has created a pull request for this

[jira] [Assigned] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16046: Assignee: (was: Apache Spark) > Add Spark SQL Dataset Tutorial >

[jira] [Assigned] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16046: Assignee: Apache Spark > Add Spark SQL Dataset Tutorial > --

[jira] [Commented] (SPARK-12216) Spark failed to delete temp directory

2016-12-18 Thread certman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758646#comment-15758646 ] certman commented on SPARK-12216: - I can confirm this issue exists in Windows 7 running Spark 2.x

[jira] [Assigned] (SPARK-18808) ml.KMeansModel.transform is very inefficient

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18808: Assignee: (was: Apache Spark) > ml.KMeansModel.transform is very inefficient >

[jira] [Commented] (SPARK-18808) ml.KMeansModel.transform is very inefficient

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758577#comment-15758577 ] Apache Spark commented on SPARK-18808: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18808) ml.KMeansModel.transform is very inefficient

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18808: Assignee: Apache Spark > ml.KMeansModel.transform is very inefficient >

[jira] [Comment Edited] (SPARK-18829) Printing to logger

2016-12-18 Thread David Hodeffi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758520#comment-15758520 ] David Hodeffi edited comment on SPARK-18829 at 12/18/16 9:10 AM: - If so,

[jira] [Updated] (SPARK-18827) Cann't read broadcast if broadcast blocks are stored on-disk

2016-12-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18827: -- Assignee: Yuming Wang > Cann't read broadcast if broadcast blocks are stored on-disk >

[jira] [Commented] (SPARK-18829) Printing to logger

2016-12-18 Thread David Hodeffi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758520#comment-15758520 ] David Hodeffi commented on SPARK-18829: --- If so, I think you should add it to documentation and

[jira] [Resolved] (SPARK-18827) Cann't read broadcast if broadcast blocks are stored on-disk

2016-12-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18827. --- Resolution: Fixed Fix Version/s: 2.0.3 2.1.1 Issue resolved by pull

[jira] [Commented] (SPARK-18882) Spark UI , storage tab is always empty.

2016-12-18 Thread David Hodeffi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758513#comment-15758513 ] David Hodeffi commented on SPARK-18882: --- generate 4 random tables using range() function on

[jira] [Resolved] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18918. --- Resolution: Fixed Fix Version/s: 2.1.1 Issue resolved by pull request 16327

[jira] [Updated] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18918: -- Target Version/s: (was: 2.1.0) Priority: Minor (was: Blocker) > Missing in

[jira] [Assigned] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18918: Assignee: Apache Spark (was: Xiao Li) > Missing in Configuration page >

[jira] [Assigned] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18918: Assignee: Xiao Li (was: Apache Spark) > Missing in Configuration page >

[jira] [Commented] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758459#comment-15758459 ] Apache Spark commented on SPARK-18918: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Created] (SPARK-18918) Missing in Configuration page

2016-12-18 Thread Xiao Li (JIRA)
Xiao Li created SPARK-18918: --- Summary: Missing in Configuration page Key: SPARK-18918 URL: https://issues.apache.org/jira/browse/SPARK-18918 Project: Spark Issue Type: Bug Components:

[jira] [Updated] (SPARK-18915) Return Nothing when Querying a Partitioned Data Source Table without Repairing it

2016-12-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18915: Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > Return Nothing when Querying a Partitioned