[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-12 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17283697#comment-17283697 ] Joris Van den Bossche commented on ARROW-11456: --- bq. Note that you may be able to do the

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-09 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281909#comment-17281909 ] Antoine Pitrou commented on ARROW-11456: Yeah, well, the first question is at which layer the

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-09 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281869#comment-17281869 ] Pac A. He commented on ARROW-11456: --- We have seen that there are one or more pyarrow limits at

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-09 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281854#comment-17281854 ] Antoine Pitrou commented on ARROW-11456: Thanks for the reproducer [~apacman] . Unfortunately,

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279918#comment-17279918 ] Pac A. He commented on ARROW-11456: --- I see. I have now added code to reproduce the issue. Basically,

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279136#comment-17279136 ] Weston Pace commented on ARROW-11456: - The 31 bit limit you are referencing is not the 31 bit limit

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-04 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17278976#comment-17278976 ] Pac A. He commented on ARROW-11456: --- Unfortunately I have not been able to produce a reproducible

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277239#comment-17277239 ] Joris Van den Bossche commented on ARROW-11456: --- bq. If you still need code, I can write

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-02 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277234#comment-17277234 ] Pac A. He commented on ARROW-11456: --- For what it's worth, {{fastparquet}} v0.5.0 had no trouble at all

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276517#comment-17276517 ] Antoine Pitrou commented on ARROW-11456: Was the Parquet file generated with Arrow? > [Python]

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-01 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276501#comment-17276501 ] Pac A. He commented on ARROW-11456: --- [~jorisvandenbossche] This is very difficult in this case because

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276480#comment-17276480 ] Joris Van den Bossche commented on ARROW-11456: --- [~apacman] would you be able to provide a

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276462#comment-17276462 ] Antoine Pitrou commented on ARROW-11456: cc [~jorisvandenbossche] > [Python] Parquet reader