eitsupi commented on issue #34291:
URL: https://github.com/apache/arrow/issues/34291#issuecomment-1441794146

   > So far, I could not save the data into another Arrow format to facilitate 
further analyses. I tried to write the dataset into a parquet or feather 
objects but RStudio always crashes because of memory issue. I have 8 core 
computer and 32 GB ram. Is there a way to write the data into e.g. feather 
format in a more efficient way without crashing?
   
   Since you seem to be able to read all the data from CSV as a data frame, how 
about setting `as_data_frame = FALSE` to read as Arrow Table?
   I think it will work with less memory.
   
   For example, we can convert to an Arrow IPC file (Feather V2) dataset 
without going through a data frame as follows.
   
   ```r
   arrow::read_delim_arrow(
     "https://github.com/apache/arrow/files/10804095/Arrow_parse_Example4.txt";,
     delim = "\t",
     quote = "",
     as_data_frame = FALSE
   ) |>
     arrow::write_dataset("test", format = "arrow")
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to