[jira] [Updated] (ARROW-5191) [Rust] Expose schema in readers (CSV, JSON) without reading batches

2019-04-22 Thread Neville Dipale (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-5191: -- Fix Version/s: 0.14.0 > [Rust] Expose schema in readers (CSV, JSON) without reading batches > -

[jira] [Assigned] (ARROW-5191) [Rust] Expose schema in readers (CSV, JSON) without reading batches

2019-04-22 Thread Neville Dipale (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-5191: - Assignee: Neville Dipale > [Rust] Expose schema in readers (CSV, JSON) without reading b

[jira] [Updated] (ARROW-5191) [Rust] Expose schema in readers (CSV, JSON) without reading batches

2019-04-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5191: -- Labels: pull-request-available (was: ) > [Rust] Expose schema in readers (CSV, JSON) without r

[jira] [Commented] (ARROW-4139) [Python] Cast Parquet column statistics to unicode if UTF8 ConvertedType is set

2019-04-22 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823025#comment-16823025 ] Uwe L. Korn commented on ARROW-4139: Have a look at https://github.com/apache/arrow/p

[jira] [Commented] (ARROW-5176) [Python] Automate formatting of python files

2019-04-22 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823033#comment-16823033 ] Uwe L. Korn commented on ARROW-5176: I have used black in many other projects and am

[jira] [Commented] (ARROW-5145) [C++] Release mode lacks convenience input validation

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823048#comment-16823048 ] Wes McKinney commented on ARROW-5145: - [~pitrou] I'm not sure about that. It seems ne

[jira] [Commented] (ARROW-5145) [C++] Release mode lacks convenience input validation

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823050#comment-16823050 ] Antoine Pitrou commented on ARROW-5145: --- Yes, my suggestion was that some "debug" c

[jira] [Updated] (ARROW-5156) `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' object has no attribute '_isfilestore'`

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5156: Labels: parquet (was: ) > `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' o

[jira] [Commented] (ARROW-5156) `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' object has no attribute '_isfilestore'`

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823052#comment-16823052 ] Wes McKinney commented on ARROW-5156: - [~jreback] [~mdurant] did something change her

[jira] [Updated] (ARROW-5156) `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' object has no attribute '_isfilestore'`

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5156: Fix Version/s: 0.14.0 > `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' obje

[jira] [Commented] (ARROW-5144) [Python] ParquetDataset and ParquetPiece not serializable

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823055#comment-16823055 ] Wes McKinney commented on ARROW-5144: - [~mrocklin] We might be able to do a 0.13.1 re

[jira] [Updated] (ARROW-5160) [C++] ABORT_NOT_OK evalutes expression twice

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5160: Summary: [C++] ABORT_NOT_OK evalutes expression twice (was: ABORT_NOT_OK evalutes expression twice

[jira] [Commented] (ARROW-2461) [Python] Build wheels for manylinux2010 tag

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823059#comment-16823059 ] Wes McKinney commented on ARROW-2461: - There is a docker image for manylinux2010 avai

[jira] [Commented] (ARROW-5165) [Python][Documentation] Build docs don't suggest assigning $ARROW_BUILD_TYPE

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823061#comment-16823061 ] Wes McKinney commented on ARROW-5165: - The logging should also be made more verbose s

[jira] [Commented] (ARROW-3802) [C++] Cast from integer to half float not implemented

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823066#comment-16823066 ] Wes McKinney commented on ARROW-3802: - TensorFlow uses the {{Eigen::half}} type to de

[jira] [Commented] (ARROW-3802) [C++] Cast from integer to half float not implemented

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823068#comment-16823068 ] Antoine Pitrou commented on ARROW-3802: --- Yes, Numpy has dedicated routines. > [C++

[jira] [Resolved] (ARROW-2796) [C++] Simplify symbols.map file, use when building libarrow_python

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-2796. - Resolution: Fixed Issue resolved by pull request 4142 [https://github.com/apache/arrow/pull/4142]

[jira] [Commented] (ARROW-5156) `df.to_parquet('s3://...', partition_cols=...)` fails with `'NoneType' object has no attribute '_isfilestore'`

2019-04-22 Thread Martin Durant (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823092#comment-16823092 ] Martin Durant commented on ARROW-5156: -- I wasn't involved in the pandas code here. T

[jira] [Resolved] (ARROW-5185) [C++] Add support for Boost with CMake configuration file

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-5185. --- Resolution: Fixed Fix Version/s: 0.14.0 Issue resolved by pull request 4173 [https://g

[jira] [Assigned] (ARROW-5167) [C++] Upgrade string-view-light to latest

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-5167: - Assignee: Antoine Pitrou > [C++] Upgrade string-view-light to latest > -

[jira] [Updated] (ARROW-5167) [C++] Upgrade string-view-light to latest

2019-04-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5167: -- Labels: pull-request-available (was: ) > [C++] Upgrade string-view-light to latest > -

[jira] [Updated] (ARROW-5166) [Python][Parquet] Statistics for uint64 columns may overflow

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5166: Labels: parquet (was: ) > [Python][Parquet] Statistics for uint64 columns may overflow > -

[jira] [Updated] (ARROW-5166) [Python][Parquet] Statistics for uint64 columns may overflow

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5166: Summary: [Python][Parquet] Statistics for uint64 columns may overflow (was: [Python] Statistics fo

[jira] [Updated] (ARROW-5166) [Python][Parquet] Statistics for uint64 columns may overflow

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5166: Fix Version/s: 0.14.0 > [Python][Parquet] Statistics for uint64 columns may overflow >

[jira] [Commented] (ARROW-4824) [Python] read_csv should accept io.StringIO objects

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823111#comment-16823111 ] Antoine Pitrou commented on ARROW-4824: --- Actually, only binary files are accepted b

[jira] [Assigned] (ARROW-4824) [Python] read_csv should accept io.StringIO objects

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-4824: - Assignee: Antoine Pitrou > [Python] read_csv should accept io.StringIO objects > ---

[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823112#comment-16823112 ] Wes McKinney commented on ARROW-1983: - I'm just returning from vacation and catching

[jira] [Updated] (ARROW-5169) [Python] non-nullable fields are converted to nullable in {{Table.from_pandas}}

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5169: Summary: [Python] non-nullable fields are converted to nullable in {{Table.from_pandas}} (was: non

[jira] [Updated] (ARROW-5169) non-nullable fields are converted to nullable in {{Table.from_pandas}}

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5169: Fix Version/s: 0.14.0 > non-nullable fields are converted to nullable in {{Table.from_pandas}} > --

[jira] [Updated] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2590: Fix Version/s: 0.14.0 > [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

[jira] [Updated] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2590: Labels: spark (was: ) > [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR

[jira] [Commented] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823143#comment-16823143 ] Wes McKinney commented on ARROW-2590: - Thanks, hopefully [~bryanc] or someone more in

[jira] [Commented] (ARROW-4139) [Python] Cast Parquet column statistics to unicode if UTF8 ConvertedType is set

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823146#comment-16823146 ] Wes McKinney commented on ARROW-4139: - I added this to my list of things to look at e

[jira] [Commented] (ARROW-4753) [C++] Extension types and layouts for text-optimized data structures

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823155#comment-16823155 ] Wes McKinney commented on ARROW-4753: - Users could experiment with such data types em

[jira] [Commented] (ARROW-4698) [C++] Let StringBuilder be constructible with a pre allocated buffer for character data

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823157#comment-16823157 ] Wes McKinney commented on ARROW-4698: - This seems hypothetically useful to avoid an e

[jira] [Closed] (ARROW-1431) [Java] JsonFileReader doesn't intialize some vectors approperately

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-1431. --- Resolution: Fixed Fix Version/s: 0.8.0 > [Java] JsonFileReader doesn't intialize some vectors

[jira] [Reopened] (ARROW-1431) [Java] JsonFileReader doesn't intialize some vectors approperately

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reopened ARROW-1431: - Assignee: Li Jin > [Java] JsonFileReader doesn't intialize some vectors approperately > --

[jira] [Resolved] (ARROW-1431) [Java] JsonFileReader doesn't intialize some vectors approperately

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1431. - Resolution: Fixed > [Java] JsonFileReader doesn't intialize some vectors approperately > ---

[jira] [Commented] (ARROW-5144) [Python] ParquetDataset and ParquetPiece not serializable

2019-04-22 Thread Matthew Rocklin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823161#comment-16823161 ] Matthew Rocklin commented on ARROW-5144: That would be helpful, yes. We're curre

[jira] [Commented] (ARROW-4648) [C++/Question] Naming/organizational inconsistencies in cpp codebase

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823160#comment-16823160 ] Wes McKinney commented on ARROW-4648: - Yes, I would be in favor of underscores (with

[jira] [Updated] (ARROW-4824) [Python] read_csv should accept io.StringIO objects

2019-04-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4824: -- Labels: pull-request-available (was: ) > [Python] read_csv should accept io.StringIO objects >

[jira] [Created] (ARROW-5192) [C++] Bundled gRPC fails building (cannot find c-ares)

2019-04-22 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-5192: - Summary: [C++] Bundled gRPC fails building (cannot find c-ares) Key: ARROW-5192 URL: https://issues.apache.org/jira/browse/ARROW-5192 Project: Apache Arrow

[jira] [Created] (ARROW-5193) [C++] Linker error with bundled zlib

2019-04-22 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-5193: - Summary: [C++] Linker error with bundled zlib Key: ARROW-5193 URL: https://issues.apache.org/jira/browse/ARROW-5193 Project: Apache Arrow Issue Type: Bug

[jira] [Commented] (ARROW-5193) [C++] Linker error with bundled zlib

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823270#comment-16823270 ] Antoine Pitrou commented on ARROW-5193: --- Passing {{-DZLIB_SOURCE=SYSTEM}} works aro

[jira] [Created] (ARROW-5194) TEST(PlasmaSerialization, GetReply) is failing

2019-04-22 Thread Guillaume Horel (JIRA)
Guillaume Horel created ARROW-5194: -- Summary: TEST(PlasmaSerialization, GetReply) is failing Key: ARROW-5194 URL: https://issues.apache.org/jira/browse/ARROW-5194 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823289#comment-16823289 ] Bryan Cutler commented on ARROW-2590: - I wasn't able to reproduce the exact error fro

[jira] [Resolved] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved ARROW-2590. - Resolution: Cannot Reproduce > [Python] Pyspark python_udf serialization error on grouped map (Am

[jira] [Comment Edited] (ARROW-2590) [Python] Pyspark python_udf serialization error on grouped map (Amazon EMR)

2019-04-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823289#comment-16823289 ] Bryan Cutler edited comment on ARROW-2590 at 4/22/19 5:55 PM: -

[jira] [Assigned] (ARROW-4702) [C++] Upgrade dependency versions

2019-04-22 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-4702: - Assignee: Antoine Pitrou > [C++] Upgrade dependency versions > -

[jira] [Commented] (ARROW-4935) [C++] Errors from jemalloc when building pyarrow from source on OSX and Debian

2019-04-22 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823401#comment-16823401 ] Uwe L. Korn commented on ARROW-4935: The {{conda_build_config.yaml}} only works when

[jira] [Resolved] (ARROW-4824) [Python] read_csv should accept io.StringIO objects

2019-04-22 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn resolved ARROW-4824. Resolution: Fixed Issue resolved by pull request 4183 [https://github.com/apache/arrow/pull/4183]

[jira] [Resolved] (ARROW-5167) [C++] Upgrade string-view-light to latest

2019-04-22 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn resolved ARROW-5167. Resolution: Fixed Fix Version/s: 0.14.0 Issue resolved by pull request 4182 [https://github.

[jira] [Created] (ARROW-5195) read_csv ignores null_values on string types

2019-04-22 Thread Scott Burns (JIRA)
Scott Burns created ARROW-5195: -- Summary: read_csv ignores null_values on string types Key: ARROW-5195 URL: https://issues.apache.org/jira/browse/ARROW-5195 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-5195) read_csv ignores null_values on string types

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5195: Fix Version/s: 0.14.0 > read_csv ignores null_values on string types >

[jira] [Updated] (ARROW-5195) [Python] read_csv ignores null_values on string types

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5195: Summary: [Python] read_csv ignores null_values on string types (was: read_csv ignores null_values

[jira] [Created] (ARROW-5196) [CPP] Uniform usage of Google cpu_features library accross the codebase

2019-04-22 Thread Areg Melik-Adamyan (JIRA)
Areg Melik-Adamyan created ARROW-5196: - Summary: [CPP] Uniform usage of Google cpu_features library accross the codebase Key: ARROW-5196 URL: https://issues.apache.org/jira/browse/ARROW-5196 Proje

[jira] [Commented] (ARROW-4139) [Python] Cast Parquet column statistics to unicode if UTF8 ConvertedType is set

2019-04-22 Thread Michael Eaton (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823647#comment-16823647 ] Michael Eaton commented on ARROW-4139: -- I submitted a rough WIP PR, in order to faci

[jira] [Created] (ARROW-5197) [Java] Improving Arrow Vector Reading performance

2019-04-22 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-5197: - Summary: [Java] Improving Arrow Vector Reading performance Key: ARROW-5197 URL: https://issues.apache.org/jira/browse/ARROW-5197 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-5198) [Java] Add hasNull flag to Vectors

2019-04-22 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-5198: - Summary: [Java] Add hasNull flag to Vectors Key: ARROW-5198 URL: https://issues.apache.org/jira/browse/ARROW-5198 Project: Apache Arrow Issue Type: Sub-task

[jira] [Created] (ARROW-5199) [Java] Add unsafe access method to ArrowBuf

2019-04-22 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-5199: - Summary: [Java] Add unsafe access method to ArrowBuf Key: ARROW-5199 URL: https://issues.apache.org/jira/browse/ARROW-5199 Project: Apache Arrow Issue Type: Sub-ta

[jira] [Updated] (ARROW-5197) [Java] Improving Arrow Vector Reading performance

2019-04-22 Thread Yurui Zhou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yurui Zhou updated ARROW-5197: -- Description: Currently the read interface of Java Arrow Vector is quite slow because the access operat

[jira] [Commented] (ARROW-5197) [Java] Improving Arrow Vector Reading performance

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823711#comment-16823711 ] Wes McKinney commented on ARROW-5197: - Possibly duplicate of ARROW-1833, I added a li

[jira] [Commented] (ARROW-5197) [Java] Improving Arrow Vector Reading performance

2019-04-22 Thread Yurui Zhou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823713#comment-16823713 ] Yurui Zhou commented on ARROW-5197: --- oh yeah, you are right. There may some overlap on

[jira] [Commented] (ARROW-5165) [Python][Documentation] Build docs don't suggest assigning $ARROW_BUILD_TYPE

2019-04-22 Thread Rok Mihevc (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823734#comment-16823734 ] Rok Mihevc commented on ARROW-5165: --- I was only building for ARROW_BUILD_TYPE=release.