Re: [R-sig-Geo] Tiled processing...

Jonathan Greenberg Thu, 30 Oct 2008 12:18:34 -0700

Ok, I'm nearly there using only RGDAL and R-base commands, if I can geta bit of feedback I might have a decent tiled (row-by-row) processingstructure worked up. Couple of things -- first, I was mistakenlythinking all raster formats can even really support line-by-linewriting, which is not neccessarily true (consider the various compressedimage formats). Let's assume that the user just wants a flat-binarytype file, either ENVI or ESRI format. We can use writebin and one ofthe writeGDAL(...,drivername='EHdr',...) or similar "header-only" writecommands:


elev is a DEM, but it can be any raster format GDAL can read.


***

library(rgdal)
infile='elev'
outfile_base='testout'
outfile_ext='.bil'
outfile=paste(outfile_base,outfile_ext,sep='')
outcon <- file(outfile, "wb")

infile_info=GDALinfo(infile)
nl=infile_info[[1]]
ns=infile_info[[2]]

for (row in 1:nl) {
   templine <- readGDAL(infile,region.dim=c(1,ns),offset=c(row-1,0))
   writeBin(templine[[1]], outcon,size=4)
}
close(outcon)
writeGDAL(templine,outfile_base,drivername='EHdr',type="Float32")

***

Right now, this ALMOST works except, as you can see from the final linethat the output header will incorrectly set the number of lines in theoutput to 1 (because I'm only reading one line at a time). I'm usingtempline purely as a way to carry over the header info, it won't writeany actual data out, as far as I know (or will it?) How do I modify the"metadata" of the templine to reflect the correct "header" info (e.g.set the number of rows back to the total number of rows in the image,and reset the geographic position correctly.


--j

Alexander Brenning wrote:

Hi,
maybe the RSAGA package has the solution to your problem; there areactually two ways of applying functions to grids in RSAGA, (1) byrow-by-row processing (special form of tiles), or (2) using the SAGAbinaries.
1) local.function
local.function and focal.function are very flexible tools, but they areslow because they are written in R. They work with ASCII grids (see e.g.write.ascii.grid in RSAGA, or the appropriate GDAL export functions thatwork with your GeoTIFFs).
In your case:

local.function("ingrid", varnames = "outgrid",
     fun = function(x) x + 1000)
(or use focal.function with radius = 0). This will of course work withmuch more general R functions (even with predict methods if you look atgrid.predict and multi.focal.function).
The RSAGA package technically depends on Windows (because most of itsfunctions use SAGA GIS Windows binaries) but the local.function andfocal.function are (supposed to be) platform-independent; I can provideyou with the source code if you work on a non-Windows system and can'tget it from CRAN.
2) rsaga.grid.calculus
Another approach, which involves SAGA itself and therefore (currently)depends on Windows, uses SAGA's grid calculator. SAGA is able to processlarge grids efficiently. E.g.:
# first convert the ASCII grid to a SAGA grid (.sgrd):
rsaga.esri.to.sgrd("ingrid")
rsaga.grid.calculus("ingrid", "outgrid", formula = "a+1000")
  # 'ingrid' is treated as 'a' in the formula
rsaga.sgrd.to.esri("outgrid", prec = 2)

I hope this helps...

Cheers
 Alex


Jonathan Greenberg wrote:
I've recently got back into using R to perform spatial analyses, andI'm trying to figure out how to perform "true" tiled processing, e.g.controlled reading of subsets of an input file, performing a functionon this subset, and writing the output, subset by subset, to an outputfile and, finally, setting up the appropriate "header" info (themetadata).
Using suggestions Tim Keitt and Roger Bivand gave me some years back,I wrote this simple code (all it should do is take an elevationgeotiff and add 1000 to the elevations, writing the output):
***

library(rgdal)
ds1 <- GDAL.open("elev.tif")
driver <- new('GDALDriver', 'GTiff')
ds2 <- new('GDALTransientDataset', driver, nrow(ds1), ncol(ds1), type= "Float32")
 for (row in nrow(ds1)) {
   row
x <- getRasterData(ds1, offset = c(row - 1, 0), region = c(1,ncol(ds1)))
   elevnew <- x+1000
   putRasterData(ds2, elevnew, offset = c(row - 1, 0))

 }
saveDataset(ds2, "out.tif")
closeDataset(ds2)
closeDataset(ds1)

***

Couple of issues/questions:
1) The code doesn't actually seem to work -- I get a tiny, unreadableTIFF out of the back of this algorithm -- what is wrong?2) I'm actually unclear about exactly what this is doing -- am Ireally just creating a large in-memory matrix (ds2) and then writingthe entire output image out to disk, or is this somehow writingline-by-line? If the former is true, how do I modify this to writewithin the loop, rather than all-at-once?
Thanks!

--j


--

Jonathan A. Greenberg, PhD
Postdoctoral Scholar
Center for Spatial Technologies and Remote Sensing (CSTARS)
University of California, Davis
One Shields Avenue
The Barn, Room 250N
Davis, CA 95616
Cell: 415-794-5043
AIM: jgrn307, MSN: [EMAIL PROTECTED], Gchat: jgrn307

_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

Re: [R-sig-Geo] Tiled processing...

Reply via email to