[jira] [Commented] (PARQUET-1968) FilterApi support In predicate

2021-02-01 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276548#comment-17276548 ] Ryan Blue commented on PARQUET-1968: Thank you! I'm not sure why it was no longer on my calendar. I

[jira] [Commented] (PARQUET-1968) FilterApi support In predicate

2021-02-01 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276526#comment-17276526 ] Ryan Blue commented on PARQUET-1968: I would really like to see a new Parquet API that can support

[jira] [Commented] (PARQUET-1901) Add filter null check for ColumnIndex

2020-08-24 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183481#comment-17183481 ] Ryan Blue commented on PARQUET-1901: It isn't clear to me how a filter implementation would handle

[jira] [Commented] (PARQUET-1809) Add new APIs for nested predicate pushdown

2020-03-04 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17051585#comment-17051585 ] Ryan Blue commented on PARQUET-1809: I think it should be fine to allow this. While there may be

[jira] [Commented] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2019-11-07 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969493#comment-16969493 ] Ryan Blue commented on PARQUET-1681: Looks like it might be 

[jira] [Commented] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2019-11-07 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969491#comment-16969491 ] Ryan Blue commented on PARQUET-1681: I think we should be able to work around this instead of

[jira] [Commented] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2019-11-07 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969489#comment-16969489 ] Ryan Blue commented on PARQUET-1681: The Avro check should ignore record names if the record is the

[jira] [Commented] (PARQUET-1685) Truncate the stored min and max for String statistics to reduce the footer size

2019-10-28 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961187#comment-16961187 ] Ryan Blue commented on PARQUET-1685: Looks like Gabor is right. The stats fields used for each

[jira] [Commented] (PARQUET-722) Building with JDK 8 fails over a maven bug

2019-08-20 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911797#comment-16911797 ] Ryan Blue commented on PARQUET-722: --- Looks like this was fixed when cascading3 support updated the

[jira] [Comment Edited] (PARQUET-722) Building with JDK 8 fails over a maven bug

2019-08-20 Thread Ryan Blue (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911797#comment-16911797 ] Ryan Blue edited comment on PARQUET-722 at 8/20/19 10:59 PM: - Looks like

[jira] [Commented] (PARQUET-1434) Release parquet-mr 1.11.0

2019-07-23 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16891462#comment-16891462 ] Ryan Blue commented on PARQUET-1434: My concern is that it has not been reviewed well enough to be

[jira] [Commented] (PARQUET-1488) UserDefinedPredicate throw NullPointerException

2019-07-12 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16884204#comment-16884204 ] Ryan Blue commented on PARQUET-1488: We discussed this on SPARK-28371. Previously, Parquet did not 

[jira] [Assigned] (PARQUET-1488) UserDefinedPredicate throw NullPointerException

2019-07-12 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1488: -- Assignee: Yuming Wang (was: Gabor Szadovszky) > UserDefinedPredicate throw

[jira] [Reopened] (PARQUET-1488) UserDefinedPredicate throw NullPointerException

2019-07-12 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reopened PARQUET-1488: > UserDefinedPredicate throw NullPointerException > ---

[jira] [Created] (PARQUET-1624) ParquetFileReader.open ignores Hadoop configuration options

2019-07-11 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1624: -- Summary: ParquetFileReader.open ignores Hadoop configuration options Key: PARQUET-1624 URL: https://issues.apache.org/jira/browse/PARQUET-1624 Project: Parquet

[jira] [Commented] (PARQUET-1142) Avoid leaking Hadoop API to downstream libraries

2019-02-22 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16775448#comment-16775448 ] Ryan Blue commented on PARQUET-1142: The next steps for this are to get compression working without

[jira] [Resolved] (PARQUET-1281) Jackson dependency

2019-02-18 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1281. Resolution: Not A Problem > Jackson dependency > -- > > Key:

[jira] [Resolved] (PARQUET-1512) Release Parquet Java 1.10.1

2019-02-04 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1512. Resolution: Fixed > Release Parquet Java 1.10.1 > --- > >

[jira] [Assigned] (PARQUET-138) Parquet should allow a merge between required and optional schemas

2019-02-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-138: - Assignee: Nicolas Trinquier (was: Ryan Blue) > Parquet should allow a merge between required

[jira] [Assigned] (PARQUET-138) Parquet should allow a merge between required and optional schemas

2019-02-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-138: - Assignee: Nicolas Trinquier (was: Nicolas Trinquier) > Parquet should allow a merge between

[jira] [Assigned] (PARQUET-138) Parquet should allow a merge between required and optional schemas

2019-02-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-138: - Assignee: Ryan Blue > Parquet should allow a merge between required and optional schemas >

[jira] [Commented] (PARQUET-1520) Update README to use correct build and version info

2019-01-31 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757679#comment-16757679 ] Ryan Blue commented on PARQUET-1520: Thanks for contributing! > Update README to use correct build

[jira] [Assigned] (PARQUET-1520) Update README to use correct build and version info

2019-01-31 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1520: -- Assignee: Dongjoon Hyun > Update README to use correct build and version info >

[jira] [Resolved] (PARQUET-1520) Update README to use correct build and version info

2019-01-31 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1520. Resolution: Fixed Fix Version/s: 1.10.2 > Update README to use correct build and version

[jira] [Resolved] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1510. Resolution: Fixed > Dictionary filter skips null values when evaluating not-equals. >

[jira] [Resolved] (PARQUET-1509) Update Docs for Hive Deprecation

2019-01-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1509. Resolution: Fixed > Update Docs for Hive Deprecation > > >

[jira] [Assigned] (PARQUET-1509) Update Docs for Hive Deprecation

2019-01-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1509: -- Assignee: BELUGA BEHR > Update Docs for Hive Deprecation >

[jira] [Resolved] (PARQUET-1513) HiddenFileFilter Streamline

2019-01-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1513. Resolution: Fixed Fix Version/s: 1.12.0 > HiddenFileFilter Streamline >

[jira] [Assigned] (PARQUET-1513) HiddenFileFilter Streamline

2019-01-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1513: -- Assignee: BELUGA BEHR > HiddenFileFilter Streamline > --- > >

[jira] [Assigned] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1510: -- Assignee: Ryan Blue > Dictionary filter skips null values when evaluating not-equals. >

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Issue Type: Bug (was: Improvement) > Dictionary filter skips null values when evaluating

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Affects Version/s: 1.9.1 1.9.0 1.10.0 > Dictionary

[jira] [Commented] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16752641#comment-16752641 ] Ryan Blue commented on PARQUET-1510: Fixed metadata. > Dictionary filter skips null values when

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Labels: correctness pull-request-available (was: pull-request-available) > Dictionary filter

[jira] [Created] (PARQUET-1512) Release Parquet Java 1.10.1

2019-01-25 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1512: -- Summary: Release Parquet Java 1.10.1 Key: PARQUET-1512 URL: https://issues.apache.org/jira/browse/PARQUET-1512 Project: Parquet Issue Type: Task

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Priority: Blocker (was: Major) > Dictionary filter skips null values when evaluating not-equals.

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Fix Version/s: 1.10.1 > Dictionary filter skips null values when evaluating not-equals. >

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Fix Version/s: 1.11.0 > Dictionary filter skips null values when evaluating not-equals. >

[jira] [Updated] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1510: --- Component/s: parquet-mr > Dictionary filter skips null values when evaluating not-equals. >

[jira] [Created] (PARQUET-1510) Dictionary filter skips null values when evaluating not-equals.

2019-01-25 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1510: -- Summary: Dictionary filter skips null values when evaluating not-equals. Key: PARQUET-1510 URL: https://issues.apache.org/jira/browse/PARQUET-1510 Project: Parquet

[jira] [Commented] (PARQUET-1447) MapredParquetOutputFormat - Save Some Array Allocations

2019-01-08 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737333#comment-16737333 ] Ryan Blue commented on PARQUET-1447: I'd be happy to merge a PR! > MapredParquetOutputFormat -

[jira] [Assigned] (PARQUET-1447) MapredParquetOutputFormat - Save Some Array Allocations

2019-01-07 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-1447: -- Resolution: Won't Fix Assignee: Ryan Blue I'm closing this because these classes are

[jira] [Resolved] (PARQUET-1465) CLONE - Add a way to append encoded blocks in ParquetFileWriter

2018-11-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1465. Resolution: Fixed See PARQUET-382. > CLONE - Add a way to append encoded blocks in

[jira] [Resolved] (PARQUET-1407) Data loss on duplicate values with AvroParquetWriter/Reader

2018-11-19 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1407. Resolution: Fixed Assignee: Nandor Kollar > Data loss on duplicate values with

[jira] [Commented] (PARQUET-1407) Data loss on duplicate values with AvroParquetWriter/Reader

2018-11-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16688796#comment-16688796 ] Ryan Blue commented on PARQUET-1407: [~scottcarey], [~jackytan], sorry for the delay. I didn't see

[jira] [Updated] (PARQUET-1407) Data loss on duplicate values with AvroParquetWriter/Reader

2018-11-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1407: --- Affects Version/s: 1.10.0 > Data loss on duplicate values with AvroParquetWriter/Reader >

[jira] [Commented] (PARQUET-1457) Data set integrity tool

2018-11-12 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16684275#comment-16684275 ] Ryan Blue commented on PARQUET-1457: [~gershinsky], this sounds like a reasonable extension to a

[jira] [Commented] (PARQUET-1414) Limit page size based on maximum row count

2018-10-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653995#comment-16653995 ] Ryan Blue commented on PARQUET-1414: [~gszadovszky], can you add a link to your benchmarks to this

[jira] [Commented] (PARQUET-1432) ACID support

2018-10-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16634298#comment-16634298 ] Ryan Blue commented on PARQUET-1432: [~yumwang], ACID guarantees are a feature of the table layout,

[jira] [Commented] (PARQUET-1201) Column indexes

2018-09-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16631130#comment-16631130 ] Ryan Blue commented on PARQUET-1201: [~gszadovszky], where is the branch for page skipping? Is it

[jira] [Commented] (PARQUET-632) Parquet file in invalid state while writing to S3 from EMR

2018-08-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581275#comment-16581275 ] Ryan Blue commented on PARQUET-632: --- [~pkgajulapalli], can you go ahead and post the stack trace? I

[jira] [Commented] (PARQUET-632) Parquet file in invalid state while writing to S3 from EMR

2018-08-14 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16579966#comment-16579966 ] Ryan Blue commented on PARQUET-632: --- [~pkgajulapalli], there isn't enough information here to know

[jira] [Created] (PARQUET-1341) Null count is suppressed when columns have no min or max and use unsigned sort order

2018-06-28 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1341: -- Summary: Null count is suppressed when columns have no min or max and use unsigned sort order Key: PARQUET-1341 URL: https://issues.apache.org/jira/browse/PARQUET-1341

[jira] [Updated] (PARQUET-381) It should be possible to merge summary files, and control which files are generated

2018-05-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-381: -- Fix Version/s: (was: 2.0.0) 1.9.0 > It should be possible to merge summary

[jira] [Commented] (PARQUET-381) It should be possible to merge summary files, and control which files are generated

2018-05-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16491350#comment-16491350 ] Ryan Blue commented on PARQUET-381: --- Fixed. Thanks for pointing this out. > It should be possible to

[jira] [Updated] (PARQUET-1309) Parquet Java uses incorrect stats and dictionary filter properties

2018-05-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1309: --- Description: In SPARK-24251, we found that the changes to use HadoopReadOptions accidentally

[jira] [Created] (PARQUET-1309) Parquet Java uses incorrect stats and dictionary filter properties

2018-05-24 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1309: -- Summary: Parquet Java uses incorrect stats and dictionary filter properties Key: PARQUET-1309 URL: https://issues.apache.org/jira/browse/PARQUET-1309 Project: Parquet

[jira] [Commented] (PARQUET-1295) Parquet libraries do not follow proper semantic versioning

2018-05-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16483153#comment-16483153 ] Ryan Blue commented on PARQUET-1295: Since there is not a well-defined public API, I understand how

[jira] [Resolved] (PARQUET-1189) Release Parquet Java 1.10

2018-04-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1189. Resolution: Fixed > Release Parquet Java 1.10 > - > >

[jira] [Resolved] (PARQUET-1264) Update Javadoc for Java 1.8

2018-04-05 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1264. Resolution: Fixed > Update Javadoc for Java 1.8 > --- > >

[jira] [Commented] (PARQUET-1253) Support for new logical type representation

2018-04-04 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16425994#comment-16425994 ] Ryan Blue commented on PARQUET-1253: Not including the UUID logical type in that union is probably

[jira] [Created] (PARQUET-1264) Update Javadoc for Java 1.8

2018-03-30 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1264: -- Summary: Update Javadoc for Java 1.8 Key: PARQUET-1264 URL: https://issues.apache.org/jira/browse/PARQUET-1264 Project: Parquet Issue Type: Improvement

[jira] [Resolved] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1263. Resolution: Fixed Assignee: Ryan Blue Merged #464. > ParquetReader's builder should use

[jira] [Resolved] (PARQUET-1183) AvroParquetWriter needs OutputFile based Builder

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1183. Resolution: Fixed Assignee: Ryan Blue Merged #460. Thanks [~zi] for reviewing! >

[jira] [Created] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1263: -- Summary: ParquetReader's builder should use Configuration from the InputFile Key: PARQUET-1263 URL: https://issues.apache.org/jira/browse/PARQUET-1263 Project: Parquet

[jira] [Updated] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1263: --- Fix Version/s: 1.10.0 > ParquetReader's builder should use Configuration from the InputFile >

[jira] [Resolved] (PARQUET-1184) Make DelegatingPositionOutputStream a concrete class

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1184. Resolution: Won't Fix Fix Version/s: (was: 1.10.0) > Make

[jira] [Commented] (PARQUET-1184) Make DelegatingPositionOutputStream a concrete class

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420982#comment-16420982 ] Ryan Blue commented on PARQUET-1184: The reason why this is an abstract class is so that you can use

[jira] [Updated] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1028: --- Fix Version/s: 1.10.0 > [JAVA] When reading old Spark-generated files with INT96, stats are

[jira] [Resolved] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1028. Resolution: Fixed Assignee: Zoltan Ivanfi > [JAVA] When reading old Spark-generated files

[jira] [Commented] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420962#comment-16420962 ] Ryan Blue commented on PARQUET-1028: This was fixed by PARQUET-1065. The expected sort order for

[jira] [Updated] (PARQUET-1055) Improve the creation of ExecutorService when reading footers

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1055: --- Fix Version/s: (was: 1.9.1) > Improve the creation of ExecutorService when reading footers >

[jira] [Updated] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1028: --- Fix Version/s: (was: 1.9.1) > [JAVA] When reading old Spark-generated files with INT96, stats

[jira] [Updated] (PARQUET-1174) Concurrent read micro benchmarks

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1174: --- Fix Version/s: (was: 1.9.1) > Concurrent read micro benchmarks >

[jira] [Updated] (PARQUET-796) Delta Encoding is not used when dictionary enabled

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-796: -- Fix Version/s: (was: 1.9.1) > Delta Encoding is not used when dictionary enabled >

[jira] [Updated] (PARQUET-1153) Parquet-thrift doesn't compile with Thrift 0.10.0

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1153: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Parquet-thrift doesn't compile with

[jira] [Updated] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1135: --- Fix Version/s: (was: 1.9.1) 1.10.0 > upgrade thrift and protobuf

[jira] [Resolved] (PARQUET-777) Add new Parquet CLI tools

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-777. --- Resolution: Fixed > Add new Parquet CLI tools > - > > Key:

[jira] [Updated] (PARQUET-1152) Parquet-thrift doesn't compile with Thrift 0.9.3

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1152: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Parquet-thrift doesn't compile with

[jira] [Updated] (PARQUET-777) Add new Parquet CLI tools

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-777: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Add new Parquet CLI tools >

[jira] [Updated] (PARQUET-1115) Warn users when misusing parquet-tools merge

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1115: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Warn users when misusing parquet-tools

[jira] [Updated] (PARQUET-1149) Upgrade Avro dependancy to 1.8.2

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1149: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Upgrade Avro dependancy to 1.8.2 >

[jira] [Updated] (PARQUET-1141) IDs are dropped in metadata conversion

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1141: --- Fix Version/s: (was: 1.9.1) 1.10.0 > IDs are dropped in metadata conversion

[jira] [Updated] (PARQUET-1025) Support new min-max statistics in parquet-mr

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1025: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Support new min-max statistics in

[jira] [Updated] (PARQUET-1077) [MR] Switch to long key ids in KEYs file

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1077: --- Fix Version/s: (was: 1.9.1) > [MR] Switch to long key ids in KEYs file >

[jira] [Updated] (PARQUET-791) Predicate pushing down on missing columns should work on UserDefinedPredicate too

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-791: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Predicate pushing down on missing columns

[jira] [Updated] (PARQUET-1024) allow for case insensitive parquet-xxx prefix in PR title

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1024: --- Fix Version/s: (was: 1.9.1) 1.10.0 > allow for case insensitive parquet-xxx

[jira] [Updated] (PARQUET-1005) Fix DumpCommand parsing to allow column projection

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1005: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Fix DumpCommand parsing to allow column

[jira] [Updated] (PARQUET-801) Allow UserDefinedPredicates in DictionaryFilter

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-801: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Allow UserDefinedPredicates in

[jira] [Updated] (PARQUET-321) Set the HDFS padding default to 8MB

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-321: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Set the HDFS padding default to 8MB >

[jira] [Commented] (PARQUET-1222) Definition of float and double sort order is ambiguous

2018-03-23 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16412154#comment-16412154 ] Ryan Blue commented on PARQUET-1222: I think Jim is right. IEEE-754 numbers are ordered correctly if

[jira] [Commented] (PARQUET-1241) Use LZ4 frame format

2018-03-12 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16395711#comment-16395711 ] Ryan Blue commented on PARQUET-1241: Does anyone know what the Hadoop compression codec produces?

[jira] [Commented] (PARQUET-1238) Invalid links found in parquet site document page

2018-02-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16379687#comment-16379687 ] Ryan Blue commented on PARQUET-1238: I didn't realize the patch was for the SVN site. Thanks, I'll

[jira] [Commented] (PARQUET-1238) Invalid links found in parquet site document page

2018-02-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16378969#comment-16378969 ] Ryan Blue commented on PARQUET-1238: [~xuchuanyin], thanks for fixing this. Could you post your

[jira] [Comment Edited] (PARQUET-796) Delta Encoding is not used when dictionary enabled

2018-02-26 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16377382#comment-16377382 ] Ryan Blue edited comment on PARQUET-796 at 2/26/18 7:06 PM: I don't recommend

[jira] [Commented] (PARQUET-1234) Release Parquet format 2.5.0

2018-02-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16371749#comment-16371749 ] Ryan Blue commented on PARQUET-1234: Are we going to release a 2.4.1 with the changes for column

[jira] [Resolved] (PARQUET-787) Add a size limit for heap allocations when reading

2018-02-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-787. --- Resolution: Fixed Fix Version/s: 1.10.0 Merged #390. > Add a size limit for heap allocations

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370458#comment-16370458 ] Ryan Blue commented on PARQUET-860: --- The S3 file system implementation should retry and recover if it

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370317#comment-16370317 ] Ryan Blue commented on PARQUET-860: --- [~e.birukov], this issue is not related to the problem you're

[jira] [Resolved] (PARQUET-1215) Add accessor for footer after a file is closed

2018-02-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1215. Resolution: Fixed Merged #457. Thanks to [~zi] and [~gszadovszky] for the reviews! > Add

  1   2   3   4   >