On Mon, 27 Apr 2020, Thorsten Behrens wrote:
Dear all,
my problem is that I want to read a big geotiff raster dataset into R and
convert it to a matrix, which does not work.
The file is big but there is sufficient memory. I need all the data in the
memory at the same time.
The error occurs under R 3.6.3 as well as 4.0.0 using Ubuntu 20.04 LTS with
the latest version of the packages (see session info below) and 256GB RAM
installed.
When loading the raster dataset using rgdal (via readGDAL or raster::readAll)
I get the follwoing error in R 4.0.0:
```
Error in rgdal::getRasterData(con, offset = offs, region.dim = reg, band =
object@data@band) :
long vectors not supported yet: memory.c:3782
```
On a 16GB Fedora platform:
library(raster) # 3.1-5
rDemTest = raster(nrow = 48000, ncol = 72000, ext = extent(c(0, 72000,
0,
+ 48000))) # all fine
rDemTest
class : RasterLayer
dimensions : 48000, 72000, 3.456e+09 (nrow, ncol, ncell)
resolution : 1, 1 (x, y)
extent : 0, 72000, 0, 48000 (xmin, xmax, ymin, ymax)
crs : NA
values(rDemTest) = 1
Error: cannot allocate vector of size 25.7 Gb
So you are deceiving yourself into thinking that all is fine at this
point. Please try to instantiate an example that can be reproduced on a
machine with 8GB RAM.
Further note that rgdal::readGDAL() is not how you handle very large
objects in files, and never has been. raster can handle blocks of data
from bands in file; stars and gdalcubes can use proxy=TRUE for the same
purpose. Why did you choose rgdal::readGDAL() when this is not its
purpose?
You did not say how much RAM is on your platform.
Roger
In R 3.6.3 is is "... memory.c:3717"
However, I can load the same file with the tiff package and a file of the
same size in the native raster package format (*.grd) with the raster package
but again not with the rgdal package.
gdalinfo (gdalUtils) does not complain (see below). Hence, Even Rouault
assumes the problem is related to rgdal and not gdal
(https://github.com/OSGeo/gdal/issues/2442).
Below you find reproducible code, which generates a raster file, saves the
two formats (.tiff and .grd) and tries to read them with the different
packages.
Is this a known limitation? Any help is greatly appreciated!
Thanks a lot in advance!
Best wishes and stay healthy,
Thorsten
### Steps to reproduce the problem.
R code:
```
library(rgdal) # 1.4-8
library(raster) # 3.1-5
library(tiff) # 0.1-5
## generate and manipulate a big raster dataset
# - generate
rDemTest = raster(nrow = 48000, ncol = 72000, ext = extent(c(0, 72000, 0,
48000))) # all fine
# - manipulate
values(rDemTest) = 1 # all fine
# - convert
mDemTest = raster::as.matrix(rDemTest) # all fine
str(mDemTest)
## save a big dataset
# - as raster/gdal
sFileNameTiff = "BigData.tif"
writeRaster(rDemTest, sFileNameTiff, "GTiff", overwrite = TRUE, NAflag =
-9999) # all fine
# - as raster native
sFileNameNative = "BigData.grd"
writeRaster(rDemTest, sFileNameNative, "raster", overwrite = TRUE, NAflag =
-9999) # all fine
## load the big raster datasets with different packages and options
# - load the tiff data with the gdal package via the raster package
rDem = raster(sFileNameTiff) # all fine
extent(rDem) # all fine
mDem = raster::as.matrix(rDem) # error
rDem = readAll(rDem) # error
# - load the native raster data with the raster package
rDem = raster(sFileNameNative) # all fine
extent(rDem) # all fine
mDem = raster::as.matrix(rDem) # all fine
str(mDem)
# - load the tiff data with the tiff package
mDem = readTIFF(sFileNameTiff) # all fine
str(mDem)
# - load the tiff data with the gdal package
sfDem = readGDAL(sFileNameTiff) # error
# - load the native raster data with the gdal package
sfDem = readGDAL(sFileNameNative) # error
```
### Startup messages when rgdal is attached (requested by Roger Bivand)
library(rgdal)
rgdal: version: 1.4-8, (SVN revision 845)
Geospatial Data Abstraction Library extensions to R successfully loaded
Loaded GDAL runtime: GDAL 3.0.4, released 2020/01/28
Path to GDAL shared files:
GDAL binary built with GEOS: TRUE
Loaded PROJ.4 runtime: Rel. 6.3.1, February 10th, 2020, [PJ_VERSION: 631]
Path to PROJ.4 shared files: (autodetected)
Linking to sp version: 1.4-1
### Session info
sessionInfo()
R version 4.0.0 (2020-04-24)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 20.04 LTS
Matrix products: default
BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/liblapack.so.3
locale:
[1] LC_CTYPE=de_DE.UTF-8 LC_NUMERIC=C LC_TIME=de_DE.UTF-8
[4] LC_COLLATE=de_DE.UTF-8 LC_MONETARY=de_DE.UTF-8
LC_MESSAGES=de_DE.UTF-8
[7] LC_PAPER=de_DE.UTF-8 LC_NAME=C LC_ADDRESS=C
[10] LC_TELEPHONE=C LC_MEASUREMENT=de_DE.UTF-8
LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] gdalUtils_2.0.3.2 rgdal_1.4-8 tiff_0.1-5 raster_3.1-5 sp_1.4-1
loaded via a namespace (and not attached):
[1] compiler_4.0.0 tools_4.0.0 Rcpp_1.0.4.6 R.methodsS3_1.8.0
codetools_0.2-16
[6] grid_4.0.0 iterators_1.0.12 foreach_1.5.0 R.utils_2.9.2
R.oo_1.23.0
[11] lattice_0.20-41
### gdalInfo
gdalinfo(sFileNameTiff)
[1] "Driver: GTiff/GeoTIFF"
[2] "Files: BigData.tif"
[3] "Size is 72000, 48000"
[4] "Origin = (0.000000000000000,48000.000000000000000)"
[5] "Pixel Size = (1.000000000000000,-1.000000000000000)"
[6] "Image Structure Metadata:"
[7] " COMPRESSION=LZW"
[8] " INTERLEAVE=BAND"
[9] "Corner Coordinates:"
[10] "Upper Left ( 0.000, 48000.000) "
[11] "Lower Left ( 0.0000000, 0.0000000) "
[12] "Upper Right ( 72000.000, 48000.000) "
[13] "Lower Right ( 72000.000, 0.000) "
[14] "Center ( 36000.000, 24000.000) "
[15] "Band 1 Block=72000x1 Type=Float32, ColorInterp=Gray"
[16] " Min=1.000 Max=1.000 "
[17] " Minimum=1.000, Maximum=1.000, Mean=nan, StdDev=nan"
[18] " NoData Value=-9999"
[19] " Metadata:"
[20] " STATISTICS_MAXIMUM=1"
[21] " STATISTICS_MEAN=nan"
[22] " STATISTICS_MINIMUM=1"
[23] " STATISTICS_STDDEV=nan"
_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-geo
--
Roger Bivand
Department of Economics, Norwegian School of Economics,
Helleveien 30, N-5045 Bergen, Norway.
voice: +47 55 95 93 55; e-mail: roger.biv...@nhh.no
https://orcid.org/0000-0003-2392-6140
https://scholar.google.no/citations?user=AWeghB0AAAAJ&hl=en
_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-geo