[jira] [Commented] (SPARK-25933) Fix pstats reference for spark.python.profile.dump in configuration.md

2018-11-03 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16674043#comment-16674043 ] Alex Hagerman commented on SPARK-25933: --- https://github.com/apache/spark/pull/22933 > Fix pstats

[jira] [Updated] (SPARK-25933) Fix pstats reference for spark.python.profile.dump in configuration.md

2018-11-03 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated SPARK-25933: -- Labels: documentation pull-request-available (was: documentation) > Fix pstats reference for

[jira] [Created] (SPARK-25933) Fix pstats reference for spark.python.profile.dump in configuration.md

2018-11-03 Thread Alex Hagerman (JIRA)
Alex Hagerman created SPARK-25933: - Summary: Fix pstats reference for spark.python.profile.dump in configuration.md Key: SPARK-25933 URL: https://issues.apache.org/jira/browse/SPARK-25933 Project:

[jira] [Assigned] (ARROW-2600) [Python] Add additional LocalFileSystem filesystem methods

2018-09-17 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman reassigned ARROW-2600: Assignee: (was: Alex Hagerman) > [Python] Add additional LocalFileSystem filesystem

[jira] [Updated] (ARROW-2760) [Python] Remove legacy property definition syntax from parquet module and test them

2018-07-12 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2760: - Component/s: Python > [Python] Remove legacy property definition syntax from parquet module and

[jira] [Commented] (ARROW-955) [Docs] Guide for building Python from source on Ubuntu 14.04 LTS without conda

2018-07-10 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16538419#comment-16538419 ] Alex Hagerman commented on ARROW-955: - Does this still need to happen with the updated dev docs? I

[jira] [Updated] (ARROW-2586) Make child builders of ListBuilder and StructBuilder shared_ptr's

2018-07-10 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2586: - Component/s: C++ > Make child builders of ListBuilder and StructBuilder shared_ptr's >

[jira] [Updated] (ARROW-2658) [Python] Serialize and Deserialize Table objects

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2658: - Summary: [Python] Serialize and Deserialize Table objects (was: Serialize and Deserialize Table

[jira] [Updated] (ARROW-2658) Serialize and Deserialize Table objects

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2658: - Component/s: Python > Serialize and Deserialize Table objects >

[jira] [Updated] (ARROW-2710) pyarrow.lib.ArrowIOError when running PyTorch DataLoader in multiprocessing

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2710: - Component/s: Python > pyarrow.lib.ArrowIOError when running PyTorch DataLoader in

[jira] [Updated] (ARROW-2710) [Python] pyarrow.lib.ArrowIOError when running PyTorch DataLoader in multiprocessing

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2710: - Summary: [Python] pyarrow.lib.ArrowIOError when running PyTorch DataLoader in multiprocessing

[jira] [Updated] (ARROW-2787) [Python] Memory Issue passing table from python to c++ via cython

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2787: - Summary: [Python] Memory Issue passing table from python to c++ via cython (was: Memory Issue

[jira] [Updated] (ARROW-2787) Memory Issue passing table from python to c++ via cython

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2787: - Component/s: Python > Memory Issue passing table from python to c++ via cython >

[jira] [Updated] (ARROW-2787) Memory Issue passing table from python to c++ via cython

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2787: - Labels: cython (was: ) > Memory Issue passing table from python to c++ via cython >

[jira] [Updated] (ARROW-2709) [Python] write_to_dataset poor performance when splitting

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2709: - Summary: [Python] write_to_dataset poor performance when splitting (was: write_to_dataset poor

[jira] [Updated] (ARROW-2274) [Python] ObjectID from string

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2274: - Summary: [Python] ObjectID from string (was: ObjectID from string) > [Python] ObjectID from

[jira] [Updated] (ARROW-2709) write_to_dataset poor performance when splitting

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2709: - Labels: parquet (was: ) > write_to_dataset poor performance when splitting >

[jira] [Updated] (ARROW-2709) write_to_dataset poor performance when splitting

2018-07-09 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman updated ARROW-2709: - Component/s: Python > write_to_dataset poor performance when splitting >

[jira] [Created] (ARROW-2601) [Python] MemoryPool bytes_allocated causes seg

2018-05-17 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2601: Summary: [Python] MemoryPool bytes_allocated causes seg Key: ARROW-2601 URL: https://issues.apache.org/jira/browse/ARROW-2601 Project: Apache Arrow Issue

[jira] [Created] (ARROW-2601) [Python] MemoryPool bytes_allocated causes seg

2018-05-17 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2601: Summary: [Python] MemoryPool bytes_allocated causes seg Key: ARROW-2601 URL: https://issues.apache.org/jira/browse/ARROW-2601 Project: Apache Arrow Issue

[jira] [Created] (ARROW-2600) [Python] Add additional LocalFileSystem filesystem methods

2018-05-17 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2600: Summary: [Python] Add additional LocalFileSystem filesystem methods Key: ARROW-2600 URL: https://issues.apache.org/jira/browse/ARROW-2600 Project: Apache Arrow

[jira] [Created] (ARROW-2600) [Python] Add additional LocalFileSystem filesystem methods

2018-05-17 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2600: Summary: [Python] Add additional LocalFileSystem filesystem methods Key: ARROW-2600 URL: https://issues.apache.org/jira/browse/ARROW-2600 Project: Apache Arrow

[jira] [Commented] (ARROW-2428) [Python] Support ExtensionArrays in to_pandas conversion

2018-05-12 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16473253#comment-16473253 ] Alex Hagerman commented on ARROW-2428: -- [~xhochy] I was reading through the meta issue and trying to

[jira] [Assigned] (ARROW-1964) [Python] Expose Builder classes

2018-05-06 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman reassigned ARROW-1964: Assignee: (was: Alex Hagerman) > [Python] Expose Builder classes >

[jira] [Assigned] (ARROW-1964) [Python] Expose Builder classes

2018-05-04 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman reassigned ARROW-1964: Assignee: Alex Hagerman > [Python] Expose Builder classes >

[jira] [Commented] (ARROW-2339) [Python] Add a fast path for int hashing

2018-04-13 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437747#comment-16437747 ] Alex Hagerman commented on ARROW-2339: -- Good to know. I'll look at the open tickets and priority to

[jira] [Commented] (ARROW-2339) [Python] Add a fast path for int hashing

2018-04-13 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437729#comment-16437729 ] Alex Hagerman commented on ARROW-2339: -- That will be interesting! Got it. Thank you for the

[jira] [Commented] (ARROW-2339) [Python] Add a fast path for int hashing

2018-04-13 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437664#comment-16437664 ] Alex Hagerman commented on ARROW-2339: -- [~pitrou] [~wesmckinn] sorry I've been absent on this work

[jira] [Created] (ARROW-2395) [Python] Correct flake8 errors outside of benchmarks

2018-04-04 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2395: Summary: [Python] Correct flake8 errors outside of benchmarks Key: ARROW-2395 URL: https://issues.apache.org/jira/browse/ARROW-2395 Project: Apache Arrow

[jira] [Created] (ARROW-2394) [Python] Correct flake8 errors in benchmarks

2018-04-04 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2394: Summary: [Python] Correct flake8 errors in benchmarks Key: ARROW-2394 URL: https://issues.apache.org/jira/browse/ARROW-2394 Project: Apache Arrow Issue

[jira] [Assigned] (ARROW-2325) [Python] Update setup.py to use Markdown project description

2018-03-29 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman reassigned ARROW-2325: Assignee: Alex Hagerman > [Python] Update setup.py to use Markdown project description >

[jira] [Created] (ARROW-2339) [Python] Add a fast path for int hashing

2018-03-21 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2339: Summary: [Python] Add a fast path for int hashing Key: ARROW-2339 URL: https://issues.apache.org/jira/browse/ARROW-2339 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-2339) [Python] Add a fast path for int hashing

2018-03-21 Thread Alex Hagerman (JIRA)
Alex Hagerman created ARROW-2339: Summary: [Python] Add a fast path for int hashing Key: ARROW-2339 URL: https://issues.apache.org/jira/browse/ARROW-2339 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-18 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16404189#comment-16404189 ] Alex Hagerman commented on ARROW-640: - I've added the __hash__ for ints and opened a PR. __eq__ was

[jira] [Commented] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-14 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399783#comment-16399783 ] Alex Hagerman commented on ARROW-640: - Sounds good. Just to verify Integer only or Number types in

[jira] [Commented] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-13 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16397909#comment-16397909 ] Alex Hagerman commented on ARROW-640: - Thanks [~pitrou] this was actually what I had implemented

[jira] [Comment Edited] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-11 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394627#comment-16394627 ] Alex Hagerman edited comment on ARROW-640 at 3/11/18 9:02 PM: -- I think this

[jira] [Comment Edited] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-11 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394627#comment-16394627 ] Alex Hagerman edited comment on ARROW-640 at 3/11/18 9:01 PM: -- I think this

[jira] [Comment Edited] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-11 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394627#comment-16394627 ] Alex Hagerman edited comment on ARROW-640 at 3/11/18 8:16 PM: -- I think this

[jira] [Commented] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-11 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394627#comment-16394627 ] Alex Hagerman commented on ARROW-640: - I think this has changed since the original ticket. The

[jira] [Comment Edited] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-11 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394627#comment-16394627 ] Alex Hagerman edited comment on ARROW-640 at 3/11/18 8:13 PM: -- I think this

[jira] [Assigned] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2018-03-07 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Hagerman reassigned ARROW-640: --- Assignee: Alex Hagerman > [Python] Arrow scalar values should have a sensible __hash__ and

[jira] [Commented] (ARROW-1391) [Python] Benchmarks for python serialization

2018-03-01 Thread Alex Hagerman (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16383033#comment-16383033 ] Alex Hagerman commented on ARROW-1391: -- I see recent commits in the repo for the benchmarks. Is this