[GitHub] [arrow-testing] pitrou opened a new pull request #54: ARROW-11162: Add Parquet fuzz regression file

2021-01-07 Thread GitBox
pitrou opened a new pull request #54: URL: https://github.com/apache/arrow-testing/pull/54 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-testing] pitrou merged pull request #54: ARROW-11162: Add Parquet fuzz regression file

2021-01-07 Thread GitBox
pitrou merged pull request #54: URL: https://github.com/apache/arrow-testing/pull/54 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow-testing] mrkn opened a new pull request #55: ARROW-11470: Add IPC tensor stream fuzz regression files

2021-02-02 Thread GitBox
mrkn opened a new pull request #55: URL: https://github.com/apache/arrow-testing/pull/55 Add test case files downloaded from issues below: - https://oss-fuzz.com/testcase-detail/6583463383793664 - https://oss-fuzz.com/testcase-detail/623225726408 -

[GitHub] [arrow-testing] mrkn commented on pull request #55: ARROW-11470: Add IPC tensor stream fuzz regression files

2021-02-02 Thread GitBox
mrkn commented on pull request #55: URL: https://github.com/apache/arrow-testing/pull/55#issuecomment-77764 @pitrou I'll merge this for working on apache/arrow#9395. Please tell me if there is any problem. This is an au

[GitHub] [arrow-testing] mrkn merged pull request #55: ARROW-11470: Add IPC tensor stream fuzz regression files

2021-02-02 Thread GitBox
mrkn merged pull request #55: URL: https://github.com/apache/arrow-testing/pull/55 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [arrow-testing] pitrou opened a new pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou opened a new pull request #56: URL: https://github.com/apache/arrow-testing/pull/56 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-testing] pitrou commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772592633 I'm not sure what the naming scheme should be. I chose "1.0.0" for the format version, but is it the Arrow library version that generated the files instead? (in which case it s

[GitHub] [arrow-testing] pitrou commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772601749 The JSON files are refused by the Java Arrow integration test: ``` Incompatible files Expected END_OBJECT but got FIELD_NAME 16:30:12.557 [main] ERROR org.apache.arro

[GitHub] [arrow-testing] pitrou commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772617773 Ok, I edited the JSON files by hand to remove the offending field and now Java manages to read them (**cringe**), though it fails to instantiate the compression: ``` java

[GitHub] [arrow-testing] pitrou edited a comment on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou edited a comment on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772592633 I'm not sure what the naming scheme should be. I chose "1.0.0" for the format version, but is it the Arrow library version that generated the files instead? (in which ca

[GitHub] [arrow-testing] emkornfield commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
emkornfield commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772638909 Strange, I wonder why main integration tests aren't broken for the JSON issue. Regarding versioning, I used the library version that generated them. It seems like a bug

[GitHub] [arrow-testing] pitrou commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772679092 > Strange, I wonder why main integration tests aren't broken for the JSON issue. Because the JSON files that are used in the integration tests are generated by the Pyth

[GitHub] [arrow-testing] pitrou commented on pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou commented on pull request #56: URL: https://github.com/apache/arrow-testing/pull/56#issuecomment-772679327 I renamed to "2.0.0-compression", will merge. This is an automated message from the Apache Git Service. To resp

[GitHub] [arrow-testing] pitrou merged pull request #56: ARROW-11417: Add integration files for buffer compression

2021-02-03 Thread GitBox
pitrou merged pull request #56: URL: https://github.com/apache/arrow-testing/pull/56 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow-testing] pitrou opened a new pull request #57: ARROW-11559: Add fuzz regression file

2021-02-10 Thread GitBox
pitrou opened a new pull request #57: URL: https://github.com/apache/arrow-testing/pull/57 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-testing] pitrou merged pull request #57: ARROW-11559: Add fuzz regression file

2021-02-10 Thread GitBox
pitrou merged pull request #57: URL: https://github.com/apache/arrow-testing/pull/57 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow-testing] kiszk opened a new pull request #58: ARROW-8797: Add golden files for decimal256

2021-02-21 Thread GitBox
kiszk opened a new pull request #58: URL: https://github.com/apache/arrow-testing/pull/58 This PR adds golden files for decimal256 in big and little endians. This is an automated message from the Apache Git Service. To respon

[GitHub] [arrow-testing] kiszk commented on pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-21 Thread GitBox
kiszk commented on pull request #58: URL: https://github.com/apache/arrow-testing/pull/58#issuecomment-782888672 @pitrou Could you please review this? It works well on my local machine. This is an automated message from the A

[GitHub] [arrow-testing] kiszk edited a comment on pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-21 Thread GitBox
kiszk edited a comment on pull request #58: URL: https://github.com/apache/arrow-testing/pull/58#issuecomment-782888672 @pitrou Could you please review this? It works well on my local big-endian machine. This is an automated

[GitHub] [arrow-testing] pitrou commented on pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-22 Thread GitBox
pitrou commented on pull request #58: URL: https://github.com/apache/arrow-testing/pull/58#issuecomment-783354606 Thanks. Can you also remove the "large batch" files in the endian directories? They are heavy to run and don't bring anything IMHO. ---

[GitHub] [arrow-testing] kiszk commented on pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-24 Thread GitBox
kiszk commented on pull request #58: URL: https://github.com/apache/arrow-testing/pull/58#issuecomment-785697113 Sure, done This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow-testing] pitrou merged pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-25 Thread GitBox
pitrou merged pull request #58: URL: https://github.com/apache/arrow-testing/pull/58 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow-testing] pitrou commented on pull request #58: ARROW-11666: Add golden files for decimal256

2021-02-25 Thread GitBox
pitrou commented on pull request #58: URL: https://github.com/apache/arrow-testing/pull/58#issuecomment-785888734 Now you need to update the testing submodule in the Arrow main repo :-) This is an automated message from the A

[GitHub] [arrow-testing] jmgpeeters opened a new pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-04 Thread GitBox
jmgpeeters opened a new pull request #59: URL: https://github.com/apache/arrow-testing/pull/59 ARROW-11838 aims to add C++ read capability for IPC data with shared dictionaries. Write support isn't available yet either, and out of scope here, so these files - allowing testing of th

[GitHub] [arrow-testing] pitrou commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-11 Thread GitBox
pitrou commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-796719583 @jmgpeeters It seems these should go into the "golden files" used for integration testing, see https://github.com/apache/arrow-testing/tree/master/data/arrow-ipc-stream/integra

[GitHub] [arrow-testing] jmgpeeters commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-11 Thread GitBox
jmgpeeters commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-796737078 Ah, thanks, I wasn't aware of the Archery integration suite. Had a quick glance, and seems to make sense. Was a bit worried it would require support in all languages for sh

[GitHub] [arrow-testing] pitrou commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-11 Thread GitBox
pitrou commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-796768922 Indeed, the JSON format doesn't support it, so that will be a problem if we want to do roundtripping tests with the integration machinery. However, I think we can still use t

[GitHub] [arrow-testing] jmgpeeters commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-11 Thread GitBox
jmgpeeters commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-796777534 Agreed. I'll make the changes and get back to you. This is an automated message from the Apache Git Service

[GitHub] [arrow-testing] jmgpeeters commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-17 Thread GitBox
jmgpeeters commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-801014168 @pitrou I've moved the golden files into 4.0.0-shareddicts and created a json.gz to validate against. Is 4.0.0 right? Not quite sure what the convention is, but - assumin

[GitHub] [arrow-testing] pitrou commented on pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-17 Thread GitBox
pitrou commented on pull request #59: URL: https://github.com/apache/arrow-testing/pull/59#issuecomment-801043407 @jmgpeeters That sounds right. AFAIU, the version number is the Arrow release (at least C++ or Java) that these files are supposed to be compatible with.

[GitHub] [arrow-testing] pitrou merged pull request #59: ARROW-11838: files for testing IPC reads with shared dictionaries.

2021-03-17 Thread GitBox
pitrou merged pull request #59: URL: https://github.com/apache/arrow-testing/pull/59 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow-testing] jmgpeeters opened a new pull request #60: ARROW-11838: fix offset buffer in golden file.

2021-03-18 Thread GitBox
jmgpeeters opened a new pull request #60: URL: https://github.com/apache/arrow-testing/pull/60 Chasing the Java integration errors in https://github.com/apache/arrow/pull/9629 (although C++ passes), it appears I misunderstood the correct layout for the offset buffer (for ["foo", "bar", "b

[GitHub] [arrow-testing] pitrou commented on pull request #60: ARROW-11838: fix offset buffer in golden file.

2021-03-18 Thread GitBox
pitrou commented on pull request #60: URL: https://github.com/apache/arrow-testing/pull/60#issuecomment-801922086 Hmm, C++ shouldn't have accepted the non-conformant buffer. Can you try to find where the missing checks need to be added?

[GitHub] [arrow-testing] jmgpeeters commented on pull request #60: ARROW-11838: fix offset buffer in golden file.

2021-03-18 Thread GitBox
jmgpeeters commented on pull request #60: URL: https://github.com/apache/arrow-testing/pull/60#issuecomment-801926268 Yep, will do. This is an automated message from the Apache Git Service. To respond to the message, please l

[GitHub] [arrow-testing] jmgpeeters commented on pull request #60: ARROW-11838: fix offset buffer in golden file.

2021-03-19 Thread GitBox
jmgpeeters commented on pull request #60: URL: https://github.com/apache/arrow-testing/pull/60#issuecomment-802687306 @pitrou I've implemented some additional checks in the Arrow sister PR. The C++ integration check (https://github.com/apache/arrow/pull/9629/checks?check_run_id=2142504660)

[GitHub] [arrow-testing] pitrou merged pull request #60: ARROW-11838: fix offset buffer in golden file.

2021-03-23 Thread GitBox
pitrou merged pull request #60: URL: https://github.com/apache/arrow-testing/pull/60 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, pleas

[GitHub] [arrow-testing] pitrou opened a new pull request #61: ARROW-12956: Add Parquet fuzz regression file

2021-06-03 Thread GitBox
pitrou opened a new pull request #61: URL: https://github.com/apache/arrow-testing/pull/61 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[GitHub] [arrow-testing] pitrou merged pull request #61: ARROW-12956: Add Parquet fuzz regression file

2021-06-03 Thread GitBox
pitrou merged pull request #61: URL: https://github.com/apache/arrow-testing/pull/61 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, pleas

[GitHub] [arrow-testing] pitrou merged pull request #61: ARROW-12956: Add Parquet fuzz regression file

2021-06-04 Thread GitBox
pitrou merged pull request #61: URL: https://github.com/apache/arrow-testing/pull/61 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, pleas

[GitHub] [arrow-testing] Igosuki opened a new pull request #62: These are translated copies of the parquet testing files to avro.

2021-08-26 Thread GitBox
Igosuki opened a new pull request #62: URL: https://github.com/apache/arrow-testing/pull/62 N.B. : I used spark for the translation so there is some additional metadata in the files, but they can be removed. -- This is an automated message from the Apache Git Service. To respond to the m

[GitHub] [arrow-testing] pitrou commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-08-29 Thread GitBox
pitrou commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-907779226 This seems fine to me, but can you add a README explaining what these files are and how they were obtained? -- This is an automated message from the Apache Git Service. To re

[GitHub] [arrow-testing] kiszk commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-08-29 Thread GitBox
kiszk commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-907837124 Looks good to me -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [arrow-testing] alamb commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-08-30 Thread GitBox
alamb commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-908261976 @Igosuki -- I added a basic README in https://github.com/apache/arrow-testing/pull/62/commits/8d306efa213b859645c68f60ee1f6db0b1997b43 -- can you provide the command you used t

[GitHub] [arrow-testing] Igosuki commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-08-31 Thread GitBox
Igosuki commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-909371276 @alamb https://gist.github.com/Igosuki/324b011f40185269d3fc552350d21744 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [arrow-testing] Igosuki commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-08-31 Thread GitBox
Igosuki commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-909372257 It would be possible to use arrow-python and fastavro to achieve the same, I just have a lot of Spark experience and I prefer typed so I went that way. -- This is an automat

[GitHub] [arrow-testing] Igosuki commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-09-01 Thread GitBox
Igosuki commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-909371276 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubs

[GitHub] [arrow-testing] pitrou merged pull request #63: ARROW-13846: Add fuzz testing regression files

2021-09-01 Thread GitBox
pitrou merged pull request #63: URL: https://github.com/apache/arrow-testing/pull/63 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #64: ARROW-13850: Add Parquet fuzz regression files

2021-09-01 Thread GitBox
pitrou merged pull request #64: URL: https://github.com/apache/arrow-testing/pull/64 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] alamb commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-09-09 Thread GitBox
alamb commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-916343370 Thanks @Igosuki ! I am sorry for the delayed response -- I am catching up from being on vacation and hope to help push your contributions over the line real soon now -- This

[GitHub] [arrow-testing] alamb commented on a change in pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-09-09 Thread GitBox
alamb commented on a change in pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#discussion_r705603662 ## File path: data/avro/README.md ## @@ -0,0 +1,7 @@ +This directory contains AVRO files corresponding to the parquet testing files at https://github.com

[GitHub] [arrow-testing] alamb merged pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-09-09 Thread GitBox
alamb merged pull request #62: URL: https://github.com/apache/arrow-testing/pull/62 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubsc

[GitHub] [arrow-testing] Igosuki commented on pull request #62: Add basic AVRO files (translated copies of the parquet testing files to avro)

2021-09-09 Thread GitBox
Igosuki commented on pull request #62: URL: https://github.com/apache/arrow-testing/pull/62#issuecomment-916663346 All good :+1: -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [arrow-testing] pitrou merged pull request #65: ARROW-14285: Add IPC fuzz regression files

2021-10-11 Thread GitBox
pitrou merged pull request #65: URL: https://github.com/apache/arrow-testing/pull/65 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #66: ARROW-14593: Add fuzz regression files

2021-11-04 Thread GitBox
pitrou merged pull request #66: URL: https://github.com/apache/arrow-testing/pull/66 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #66: ARROW-14593: Add fuzz regression files

2021-11-05 Thread GitBox
pitrou merged pull request #66: URL: https://github.com/apache/arrow-testing/pull/66 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #67: ARROW-14664: Add Parquet fuzz regression files

2021-11-10 Thread GitBox
pitrou merged pull request #67: URL: https://github.com/apache/arrow-testing/pull/67 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #68: ARROW-14721: Add Parquet fuzz regression files

2021-11-16 Thread GitBox
pitrou merged pull request #68: URL: https://github.com/apache/arrow-testing/pull/68 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #69: ARROW-14791: Add fuzz regression file

2021-11-22 Thread GitBox
pitrou merged pull request #69: URL: https://github.com/apache/arrow-testing/pull/69 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #70: ARROW-14792: Add Parquet fuzz regression files

2021-11-22 Thread GitBox
pitrou merged pull request #70: URL: https://github.com/apache/arrow-testing/pull/70 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou merged pull request #71: ARROW-15031: Add Parquet fuzz regression file

2021-12-08 Thread GitBox
pitrou merged pull request #71: URL: https://github.com/apache/arrow-testing/pull/71 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] chadbrewbaker opened a new pull request #72: Adding emptylist regression

2021-12-20 Thread GitBox
chadbrewbaker opened a new pull request #72: URL: https://github.com/apache/arrow-testing/pull/72 See https://github.com/apache/arrow-rs/issues/1036 Serialization of {"emptylist":[]} is causing a panic in arrow-rs. -- This is an automated message from the Apache Git Service. To res

[GitHub] [arrow-julia] kou opened a new issue #273: Add GitHub Actions used in this repository to allow list

2021-12-29 Thread GitBox
kou opened a new issue #273: URL: https://github.com/apache/arrow-julia/issues/273 `apache/*` repositories can use only allowed GitHub Actions in `.github/workflows/`. We need to ask INFRA to add the following GitHub Actions to allow list: * `julia-actions/julia-buildpkg@v1.2`

[GitHub] [arrow-julia] kou commented on issue #273: Add GitHub Actions used in this repository to allow list

2021-12-29 Thread GitBox
kou commented on issue #273: URL: https://github.com/apache/arrow-julia/issues/273#issuecomment-1002772335 I've opened INFRA issue for this: https://issues.apache.org/jira/browse/INFRA-22676 -- This is an automated message from the Apache Git Service. To respond to the message, please lo

[GitHub] [arrow-julia] kou opened a new issue #271: Configure repository metadata

2021-12-29 Thread GitBox
kou opened a new issue #271: URL: https://github.com/apache/arrow-julia/issues/271 We can configure this repository's metadata by `.asf.yaml`. See also: https://cwiki.apache.org/confluence/display/INFRA/Git+-+.asf.yaml+features We should change at least notification configurations. A

[GitHub] [arrow-julia] kou commented on issue #271: Configure repository metadata

2021-12-29 Thread GitBox
kou commented on issue #271: URL: https://github.com/apache/arrow-julia/issues/271#issuecomment-1002774494 We want to send only "issue create" notifications to `issues@arrow.apache.org`. The current configuration sends all issue related notifications to `issues@arrow.apache.org`. --

[GitHub] [arrow-julia] jakkosdev opened a new issue #275: Error with CSV.Chunks and InlineStrings

2021-12-30 Thread GitBox
jakkosdev opened a new issue #275: URL: https://github.com/apache/arrow-julia/issues/275 `Arrow.write("testcsv.arrow", CSV.Chunks("data.csv"), ntasks=2)` with data.csv: ``` heading1,heading2,heading3 test1, test2, 3 test1, test2, 3 test1, test2, 3 test1, test2, 3

[GitHub] [arrow-julia] quinnj closed issue #265: Transfer repository to apache organization

2021-12-30 Thread GitBox
quinnj closed issue #265: URL: https://github.com/apache/arrow-julia/issues/265 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[GitHub] [arrow-julia] quinnj commented on issue #265: Transfer repository to apache organization

2021-12-30 Thread GitBox
quinnj commented on issue #265: URL: https://github.com/apache/arrow-julia/issues/265#issuecomment-1003086025 Repo has been transferred -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[GitHub] [arrow-julia] kou closed issue #273: Add GitHub Actions used in this repository to allow list

2022-01-21 Thread GitBox
kou closed issue #273: URL: https://github.com/apache/arrow-julia/issues/273 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@ar

[GitHub] [arrow-julia] kou closed issue #253: error when metadata is present but empty?

2022-01-21 Thread GitBox
kou closed issue #253: URL: https://github.com/apache/arrow-julia/issues/253 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@ar

[GitHub] [arrow-julia] quinnj opened a new issue #279: New package release process

2022-01-25 Thread GitBox
quinnj opened a new issue #279: URL: https://github.com/apache/arrow-julia/issues/279 In #278, there was some brief discussion around what the new release process looks like now that the repo has been transferred. I'd like to propose the following and we can see if this will work in terms

[GitHub] [arrow-julia] sl-solution opened a new issue #280: Allow missing type without converting to vector

2022-01-26 Thread GitBox
sl-solution opened a new issue #280: URL: https://github.com/apache/arrow-julia/issues/280 Not sure if it makes sense, but, would it be possible to allow missing type without copying the underlining arrow vector? As far as I understand, allowing missing only changes the `Type` of arrow ve

[GitHub] [arrow-testing] westonpace opened a new pull request #74: ARROW-15425: [Integration] Add delta dictionaries in file format to integration tests

2022-01-28 Thread GitBox
westonpace opened a new pull request #74: URL: https://github.com/apache/arrow-testing/pull/74 This adds an example IPC file containing a delta dictionary for both the file and the streaming IPC format. It requires a small change to the integration programs (https://github.com/apache/arro

[GitHub] [arrow-testing] pitrou merged pull request #73: PARQUET-2115: [C++] Add parquet regression files.

2022-01-31 Thread GitBox
pitrou merged pull request #73: URL: https://github.com/apache/arrow-testing/pull/73 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-julia] Poncito opened a new issue #281: NTuple with custom type and compression

2022-01-31 Thread GitBox
Poncito opened a new issue #281: URL: https://github.com/apache/arrow-julia/issues/281 Hello, I have a custom type defined this way: ```julia struct Char8 <: AbstractChar x::UInt8 end Char8(x::Integer) = Char8(UInt8(x)) Char8(x::AbstractChar) = Char8(UInt8(x)

[GitHub] [arrow-testing] pitrou merged pull request #75: PARQUET-2119: Add testcase from fuzzer.

2022-02-08 Thread GitBox
pitrou merged pull request #75: URL: https://github.com/apache/arrow-testing/pull/75 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-julia] NHDaly opened a new issue #282: Allow constructing an Arrow stream/file from columnar data with no column names

2022-02-08 Thread GitBox
NHDaly opened a new issue #282: URL: https://github.com/apache/arrow-julia/issues/282 We have a data source (Relations from our database engine at RelationalAI) that have _columnar data_, but without column names. (We represent a Relation as a Set of Tuples, e.g. `movie_title` relates movi

[GitHub] [arrow-julia] baumgold opened a new issue #283: Unexpected table schema

2022-02-12 Thread GitBox
baumgold opened a new issue #283: URL: https://github.com/apache/arrow-julia/issues/283 It appears that Arrow.jl somehow incorrectly handles columns with nullable types (Union with Nothing) as the schema of the resulting table containing such a column doesn't match what's reported by `Tabl

[GitHub] [arrow-julia] quinnj opened a new issue #284: Future of the Julia arrow implementation

2022-02-15 Thread GitBox
quinnj opened a new issue #284: URL: https://github.com/apache/arrow-julia/issues/284 Hey all, I'm opening this issue to facilitate some discussion around the current state of and future direction of the Julia arrow implementation (that lives in this repo). As the original primary c

[GitHub] [arrow-julia] baumgold opened a new issue #285: Dense Union incompatible between Julia/Python

2022-02-16 Thread GitBox
baumgold opened a new issue #285: URL: https://github.com/apache/arrow-julia/issues/285 When writing a table with Arrow.jl that contains a nullable column, the Arrow data cannot be read by Pyarrow. Note that when using Missing instead of Nothing Pyarrow can read the data written by Arrow.

[GitHub] [arrow-julia] kou opened a new issue #286: Check license automatically

2022-02-22 Thread GitBox
kou opened a new issue #286: URL: https://github.com/apache/arrow-julia/issues/286 We must satisfy the Apache-2.0 license and not include code the "Category X" https://apache.org/legal/resolved.html#category-x . There is a tool to check this: https://creadur.apache.org/rat/ We

[GitHub] [arrow-julia] kou opened a new issue #287: Implement release script

2022-02-22 Thread GitBox
kou opened a new issue #287: URL: https://github.com/apache/arrow-julia/issues/287 Most of release process can be automated like https://github.com/apache/arrow-rs/blob/master/dev/release/release-tarball.sh . We should have a release script to decrease maintenance cost. -- This is

[GitHub] [arrow-julia] kou opened a new issue #288: Implement release verification script

2022-02-22 Thread GitBox
kou opened a new issue #288: URL: https://github.com/apache/arrow-julia/issues/288 There aren't PMC members who are familiar with Julia. It's helpful to provide a verification script for them. They can't vote without verification. See also: https://github.com/apache/arrow-rs/blob

[GitHub] [arrow-julia] quinnj closed issue #286: Check license automatically

2022-02-22 Thread GitBox
quinnj closed issue #286: URL: https://github.com/apache/arrow-julia/issues/286 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[GitHub] [arrow-julia] baumgold opened a new issue #291: Add support for reading partitioned datasets

2022-02-24 Thread GitBox
baumgold opened a new issue #291: URL: https://github.com/apache/arrow-julia/issues/291 It would be very useful to support reading (and potentially writing) partitioned datasets. The implementation could be similar to JuliaIO/Parquet.jl#138/JuliaIO/Parquet.jl#142 and would bring Arrow.jl

[GitHub] [arrow-julia] quinnj closed issue #287: Implement release script

2022-02-25 Thread GitBox
quinnj closed issue #287: URL: https://github.com/apache/arrow-julia/issues/287 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[GitHub] [arrow-julia] kou closed issue #288: Implement release verification script

2022-02-25 Thread GitBox
kou closed issue #288: URL: https://github.com/apache/arrow-julia/issues/288 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@ar

[GitHub] [arrow-testing] pitrou merged pull request #79: ARROW-15815: [C++][Parquet] Add fuzz regression file

2022-03-01 Thread GitBox
pitrou merged pull request #79: URL: https://github.com/apache/arrow-testing/pull/79 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-testing] pitrou commented on pull request #79: ARROW-15815: [C++][Parquet] Add fuzz regression file

2022-03-01 Thread GitBox
pitrou commented on pull request #79: URL: https://github.com/apache/arrow-testing/pull/79#issuecomment-1055532430 FYI @tachyonwill -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow-testing] pitrou commented on pull request #78: PARQUET-2131: Add fuzzer found crashers.

2022-03-01 Thread GitBox
pitrou commented on pull request #78: URL: https://github.com/apache/arrow-testing/pull/78#issuecomment-1055533136 @tachyonwill Sorry for not noticing this. Nobody is actively watching the PRs for this repository, so you should ping a core developer when you open a PR. -- This is an auto

[GitHub] [arrow-testing] pitrou merged pull request #78: PARQUET-2131: Add fuzzer found crashers.

2022-03-01 Thread GitBox
pitrou merged pull request #78: URL: https://github.com/apache/arrow-testing/pull/78 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubs

[GitHub] [arrow-julia] complyue opened a new issue #293: Support Tables.partitions when reading "arrow file" format in addition to "stream" format

2022-03-02 Thread GitBox
complyue opened a new issue #293: URL: https://github.com/apache/arrow-julia/issues/293 I see #23 closed but only *stream* format got supported. How about files in "arrow file" format, each with multiple record batches inside? With released version I get a single `Table` as t

[GitHub] [arrow-julia] complyue opened a new issue #294: How to adapt `Arrow.Table` columns (naturally per record batch basis) into `CuArray`s for GPU processing?

2022-03-02 Thread GitBox
complyue opened a new issue #294: URL: https://github.com/apache/arrow-julia/issues/294 I'm figuring out ways to have table columns processed by GPU, those coming from "arrow file" format files `mmap`ed for zero-copy. The full series can not fit into GRAM, while each batch record can

[GitHub] [arrow-julia] complyue opened a new issue #295: Order of record batches from "arrow file" format files (i.e. `Arrow.Table`) not preserved

2022-03-04 Thread GitBox
complyue opened a new issue #295: URL: https://github.com/apache/arrow-julia/issues/295 https://github.com/apache/arrow-julia/blob/614fce0a5d7db8fee078be32690c5220848538e2/src/table.jl#L276-L293 I see from above that record batches will be parsed (esp. decompression could be rather

[GitHub] [arrow-testing] emkornfield merged pull request #77: PARQUET-2130: Add fuzzer found DCHECKing file.

2022-03-04 Thread GitBox
emkornfield merged pull request #77: URL: https://github.com/apache/arrow-testing/pull/77 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-u

[GitHub] [arrow-testing] emkornfield merged pull request #76: PARQUET-2124: Add fuzzer testcase

2022-03-04 Thread GitBox
emkornfield merged pull request #76: URL: https://github.com/apache/arrow-testing/pull/76 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-u

[GitHub] [arrow-julia] quinnj closed issue #284: Future of the Julia arrow implementation

2022-03-05 Thread GitBox
quinnj closed issue #284: URL: https://github.com/apache/arrow-julia/issues/284 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[GitHub] [arrow-julia] nilshg opened a new issue #300: Possible bug in `Any` concretization routine

2022-03-07 Thread GitBox
nilshg opened a new issue #300: URL: https://github.com/apache/arrow-julia/issues/300 As discussed on Slack: ``` julia> using Arrow, DataFrames julia> Arrow.write("test.arrow", (a = [1, 2], b = Any[3, 4.5])) "test.arrow" julia> DataFrame(Arrow.Table("test.arrow"))

[GitHub] [arrow-julia] kou opened a new issue #301: Release script publishes the artifacts to wrong URL

2022-03-07 Thread GitBox
kou opened a new issue #301: URL: https://github.com/apache/arrow-julia/issues/301 It publishes to https://dist.apache.org/repos/dist/release/arrow/apache-arrow-julia-X.Y.Z but we should remove "apache-" prefix because other release doesn't have "apache-" prefix. -- This is an automate

[GitHub] [arrow-julia] kou closed issue #301: Release script publishes the artifacts to wrong URL

2022-03-07 Thread GitBox
kou closed issue #301: URL: https://github.com/apache/arrow-julia/issues/301 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@ar

  1   2   3   4   5   6   7   8   9   10   >