Re: [UMN_MAPSERVER-USERS] data proliferation or the data that ate the disk space

Stephen Woodbridge Sat, 25 Mar 2006 06:17:20 -0800

Richard,

Data management is a common problem. The best practices for me have beento separate physical storage and logical storage. This is easiest to doon Linux systems with symbolic links. For physical storage, I like tokeep datasets self contained, especially if I have to update them at anyfrequency. Because these are self contained (ie. in a single directorytree) it is easy to create a parallel tree with new data and just swapout the old data for the new data by changing the symlink to the newdata. This also allow any data tree to reside on any partition.

For logical storage, I think in terms of maps or applications and Ibuild a single directory for each. Into this directory, I link in thephysical datasets in need and I create all the tileindexes relative tothat directory. Then in the mapfile I set DATAPATH to point to thatdirectory. So for example, I have tiger data directories for theseparate tiger releases with physical names like:


/u/data/tiger2004fe/
/u/data/tiger2004se/
/u2/data/tiger2005fe/

In my application directory I have something like:

/u/application/tiger -> /u/data/tiger2005fe/

I call the tiger data by "tiger" regardless of the version I am showing.That way I can change the underlying data without the application caringand I don't need to rebuild the tileindexes.

If I want to move the application to another server, I move the physicaldatasets I need and the application directory and fix up the symlinks topoint to the respective new locations. In 99% of the time I do not needto rebuild the tileindexes.


Hope this helps,
  -Steve W.

Richard Taylor wrote:

Hello LIST
this is not just a MapServer question, but perhaps some of you fartherdown the path have insights that you are willing to pass on.
As my learning curve progresses i find that local data volume isincreasing rapidly. It started of course with local apps, then expandedwith my introduction to MapServer, in my case ms4w, for getting thebasics, then has continued on to local directories to send up to remoteunix system instances.
While the mapfiles allow one to give a full path to your data, meaninglocally you can get at it wherever it is, that structure does not holdwell with or all with remote instances. the end result is multiplecopies of many files, some of which are quite large, one for local apps,one for ms4w, and one for each remote mapserver.
One solution is to keep getting large storage space but feeling thismight a common problem wonder if any of the long term users or thosewith large data volumes have come to a 'best practises' solution to thisissue.
thanks in advance

richard taylor

Re: [UMN_MAPSERVER-USERS] data proliferation or the data that ate the disk space

Reply via email to