[I] [JS] Values returned as undefined when arrow file bigger than 2gb [arrow-js]

via GitHub Mon, 26 May 2025 15:41:46 -0700


asfimport opened a new issue, #87:
URL: https://github.com/apache/arrow-js/issues/87


   Steps:
   
    
   
   1. Generate arrow file bigger than 2gb
   ```java
   
   import pyarrow as pa
   
   nums1 = [42]
   nums2 = [42.42]
   mil = 1000000
   
   for n in range(1, 140 * mil):
     nums1.append(n)
     nums2.append(1 / n)
   
   arr1 = pa.array(nums1)
   arr2 = pa.array(nums2)
   
   schema = pa.schema([
     pa.field('nums1', arr1.type),
     pa.field('nums2', arr2.type),
   ])
   
   with pa.OSFile('arraydata.arrow', 'wb') as sink:
     with pa.ipc.new_file(sink, schema=schema) as writer:
       batch = pa.record_batch([arr1, arr2], schema=schema)
       writer.write(batch) 
   ```
   2. Try to read it via the JS SDK
   ```java
   
   const fs = require("fs");
   const { tableFromIPC, RecordBatchReader } = require("apache-arrow");
   
   const filePath = "./arraydata.arrow";
   
   const stream = fs.createReadStream(filePath);
   const reader = RecordBatchReader.from(stream);
   
   (async function () {
     const table = await tableFromIPC(reader);
   
     console.log("numRows", table.numRows);
     console.log("first row", table.get(0).toArray());
   })(); 
   ```
   The code above prints:
   ```java
   
   numRows 140000000
   first row [ undefined, undefined ] 
   ```
   `numRows` is correct, but the values are coming out as `{}undefined{`}.
   
   **Reporter**: [Denis 
Gursky](https://issues.apache.org/jira/browse/ARROW-18007)
   
   <sub>**Note**: *This issue was originally created as 
[ARROW-18007](https://issues.apache.org/jira/browse/ARROW-18007). Please see 
the [migration documentation](https://github.com/apache/arrow/issues/14542) for 
further details.*</sub>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [JS] Values returned as undefined when arrow file bigger than 2gb [arrow-js]

Reply via email to