[jira] [Updated] (ARROW-10428) [FlightRPC][Java] Add support for HTTP cookies

2020-10-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10428: --- Labels: pull-request-available (was: ) > [FlightRPC][Java] Add support for HTTP cookies >

[jira] [Commented] (ARROW-10426) [C++] Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223386#comment-17223386 ] Micah Kornfield commented on ARROW-10426: - I think this is just an oversight. There is a limit

[jira] [Created] (ARROW-10428) [FlightRPC][Java] Add support for HTTP cookies

2020-10-29 Thread James Duong (Jira)
James Duong created ARROW-10428: --- Summary: [FlightRPC][Java] Add support for HTTP cookies Key: ARROW-10428 URL: https://issues.apache.org/jira/browse/ARROW-10428 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-10426) [C++] Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223354#comment-17223354 ] Neal Richardson commented on ARROW-10426: - Doing some digging in the source, the error message

[jira] [Updated] (ARROW-10426) Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10426: Component/s: C++ > Arrow type large_string cannot be written to Parquet type column

[jira] [Updated] (ARROW-10426) Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10426: Labels: parquet (was: ) > Arrow type large_string cannot be written to Parquet type

[jira] [Updated] (ARROW-10426) [C++] Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10426: Summary: [C++] Arrow type large_string cannot be written to Parquet type column

[jira] [Resolved] (ARROW-10080) [R] Arrow does not release unused memory

2020-10-29 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-10080. -- Resolution: Fixed Issue resolved by pull request 8533

[jira] [Commented] (ARROW-10412) [C++] Cmake Build Fails with grpc 1.33.1, "GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable"

2020-10-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223293#comment-17223293 ] Kouhei Sutou commented on ARROW-10412: -- Could you show the log when you enable Flight, use

[jira] [Created] (ARROW-10427) Add optional session headers

2020-10-29 Thread Tiffany Lam (Jira)
Tiffany Lam created ARROW-10427: --- Summary: Add optional session headers Key: ARROW-10427 URL: https://issues.apache.org/jira/browse/ARROW-10427 Project: Apache Arrow Issue Type: New Feature

[jira] [Created] (ARROW-10426) Arrow type large_string cannot be written to Parquet type column descriptor

2020-10-29 Thread Gabriel Bassett (Jira)
Gabriel Bassett created ARROW-10426: --- Summary: Arrow type large_string cannot be written to Parquet type column descriptor Key: ARROW-10426 URL: https://issues.apache.org/jira/browse/ARROW-10426

[jira] [Commented] (ARROW-1614) [C++] Add a Tensor logical value type with constant dimensions, implemented using ExtensionType

2020-10-29 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223211#comment-17223211 ] Rok Mihevc commented on ARROW-1614: --- [~bryanc] I've replied in that thread. I've also started working

[jira] [Commented] (ARROW-9676) [R] Error converting Table with nested structs

2020-10-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223210#comment-17223210 ] Neal Richardson commented on ARROW-9676: [~Ndiquattro] can you see if this is fixed in 2.0.0? I

[jira] [Commented] (ARROW-8714) [C++] Add a Tensor logical value type with varying dimensions, implemented using ExtensionType

2020-10-29 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223206#comment-17223206 ] Rok Mihevc commented on ARROW-8714: --- I like idea of batching into equally shaped Tensors. It would

[jira] [Commented] (ARROW-4970) [C++][Parquet] Implement parquet::FileMetaData::Equals

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223199#comment-17223199 ] Joris Van den Bossche commented on ARROW-4970: -- This is fixed in

[jira] [Resolved] (ARROW-4970) [C++][Parquet] Implement parquet::FileMetaData::Equals

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-4970. -- Resolution: Fixed > [C++][Parquet] Implement parquet::FileMetaData::Equals >

[jira] [Assigned] (ARROW-10131) [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-10131: - Assignee: Ben Kietzman > [C++][Dataset] Lazily parse parquet metadata

[jira] [Resolved] (ARROW-10131) [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-10131. --- Resolution: Fixed Issue resolved by pull request 8507

[jira] [Commented] (ARROW-10425) [Python] Support reading (compressed) CSV file from remote file / binary blob

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223190#comment-17223190 ] Joris Van den Bossche commented on ARROW-10425: --- The general filesystem support is also

[jira] [Created] (ARROW-10425) [Python] Support reading (compressed) CSV file from remote file / binary blob

2020-10-29 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10425: - Summary: [Python] Support reading (compressed) CSV file from remote file / binary blob Key: ARROW-10425 URL: https://issues.apache.org/jira/browse/ARROW-10425

[jira] [Updated] (ARROW-10424) [Rust] Simplify code for impl PrimitiveArray

2020-10-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10424: --- Labels: pull-request-available (was: ) > [Rust] Simplify code for impl PrimitiveArray >

[jira] [Created] (ARROW-10424) [Rust] Simplify code for impl PrimitiveArray

2020-10-29 Thread Jira
Jorge Leitão created ARROW-10424: Summary: [Rust] Simplify code for impl PrimitiveArray Key: ARROW-10424 URL: https://issues.apache.org/jira/browse/ARROW-10424 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-10372) [C++][Dataset] Read compressed CSVs

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10372: -- Fix Version/s: 3.0.0 > [C++][Dataset] Read compressed CSVs >

[jira] [Commented] (ARROW-10423) [C++] Filter compute function seems slow compared to numpy nonzero + take

2020-10-29 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17223186#comment-17223186 ] Joris Van den Bossche commented on ARROW-10423: --- And note I timed it with released

[jira] [Updated] (ARROW-10422) [Rust] Removed unused BinaryArrayBuilder

2020-10-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10422: --- Labels: pull-request-available (was: ) > [Rust] Removed unused BinaryArrayBuilder >

[jira] [Created] (ARROW-10422) [Rust] Removed unused BinaryArrayBuilder

2020-10-29 Thread Jira
Jorge Leitão created ARROW-10422: Summary: [Rust] Removed unused BinaryArrayBuilder Key: ARROW-10422 URL: https://issues.apache.org/jira/browse/ARROW-10422 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-10423) [C++] Filter compute function seems slow compared to numpy nonzero + take

2020-10-29 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10423: - Summary: [C++] Filter compute function seems slow compared to numpy nonzero + take Key: ARROW-10423 URL: https://issues.apache.org/jira/browse/ARROW-10423

[jira] [Created] (ARROW-10420) [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool

2020-10-29 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-10420: Summary: [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool Key: ARROW-10420 URL: https://issues.apache.org/jira/browse/ARROW-10420 Project: Apache

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter and RecordBatchFileWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Affects Version/s: 2.0.0 > [Python][C++] Possible Memory Leak in RecordBatchStreamWriter and >

[jira] [Resolved] (ARROW-10389) [Rust][DataFusion] Make the custom source implementation API more explicit

2020-10-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-10389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão resolved ARROW-10389. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8527

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Attachment: Screen Shot 2020-10-29 at 9.22.58 AM.png > [Python][C++] Possible Memory Leak in

[jira] [Comment Edited] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222998#comment-17222998 ] Shouheng Yi edited comment on ARROW-10417 at 10/29/20, 4:22 PM: [~wesm]

[jira] [Commented] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222998#comment-17222998 ] Shouheng Yi commented on ARROW-10417: - [~wesm]TLDR; {{pyarrow 2.0.0}} has the same memory profile. I

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Updated] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Shouheng Yi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shouheng Yi updated ARROW-10417: Description: There might be a memory leak in the {{RecordBatchStreamWriter}}. The memory

[jira] [Commented] (ARROW-9226) [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available

2020-10-29 Thread Duan Shiqiang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222981#comment-17222981 ] Duan Shiqiang commented on ARROW-9226: -- Having the new FileSystem support connect to HDFS without

[jira] [Commented] (ARROW-10418) [C++] Arrow::HiveServer2 client returns No Data to read on openSession

2020-10-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222973#comment-17222973 ] Wes McKinney commented on ARROW-10418: -- This code hasn't been maintained since I originally

[jira] [Updated] (ARROW-10418) [C++] Arrow::HiveServer2 client returns No Data to read on openSession

2020-10-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10418: - Summary: [C++] Arrow::HiveServer2 client returns No Data to read on openSession (was:

[jira] [Commented] (ARROW-10417) [Python][C++] Possible Memory Leak in RecordBatchStreamWriter

2020-10-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222969#comment-17222969 ] Wes McKinney commented on ARROW-10417: -- Is this issue still present in 2.0.0 or on master? If so,

[jira] [Created] (ARROW-10419) Add max_rows parameter to pyarrow.csv.ReadOptions

2020-10-29 Thread Marc Garcia (Jira)
Marc Garcia created ARROW-10419: --- Summary: Add max_rows parameter to pyarrow.csv.ReadOptions Key: ARROW-10419 URL: https://issues.apache.org/jira/browse/ARROW-10419 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-10413) [Rust] [Parquet] Unignore some roundtrip tests that are passing now

2020-10-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão resolved ARROW-10413. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8546

[jira] [Updated] (ARROW-10386) [R] List column class attributes not preserved in roundtrip

2020-10-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10386: --- Labels: pull-request-available (was: ) > [R] List column class attributes not preserved in

[jira] [Commented] (ARROW-10412) [C++] Cmake Build Fails with grpc 1.33.1, "GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable"

2020-10-29 Thread Steven Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222842#comment-17222842 ] Steven Smith commented on ARROW-10412: -- I'm able to get a successful build by editing in the binary

[jira] [Commented] (ARROW-10412) [C++] Cmake Build Fails with grpc 1.33.1, "GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable"

2020-10-29 Thread Steven Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17222821#comment-17222821 ] Steven Smith commented on ARROW-10412: -- Here's the grpc binaries and cmake files, all created with

[jira] [Updated] (ARROW-10418) Arrow::HiveServer2 client returns No Data to read on openSession

2020-10-29 Thread vivek kumar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vivek kumar updated ARROW-10418: Summary: Arrow::HiveServer2 client returns No Data to read on openSession (was:

[jira] [Created] (ARROW-10418) Arrow::HiveServe2 client returns No Data to read on openSession

2020-10-29 Thread vivek kumar (Jira)
vivek kumar created ARROW-10418: --- Summary: Arrow::HiveServe2 client returns No Data to read on openSession Key: ARROW-10418 URL: https://issues.apache.org/jira/browse/ARROW-10418 Project: Apache Arrow