Peter wrote:
> as space is running short on the
> ISO we need to reconsider sizings. In
> today's OSGel Live chat I was
> tasked to initiate this discussion.
> Current disk footprint of each
> application is available from
> https://docs.google.com/spreadsheet/ccc?key=0Al9zh8DjmU_RdGIzd0VLLTBpQVJuNVlHMlBWSDhKLXc#gid=13
>
> Top riders currently:
> - mapguide: 550mb
(where are you getting your numbers from? we disabled that
when we ran out of disc space about 3-4 releases ago)
> - marble: 300 mb (includes disk caching!)
> - grass >250mb
(again, is that up to date? we've since dropped the large NC
sample dataset in favour of a stripped down one and the GeoTiffs
and SHP versions to be used by all)
> One way of saving space is to
> enlarge it (ie, go to 8 GB) which is
> unclear due to some unresolved
> questions; another one is to deflate
> unused datasets, a third one to
> share data among applications.
>
> Can I ask all those with 3-digit
> disk hunger to chime into this
> discussion.
Hi,
(fwiw & I'm not sure how the numbers put in the spreadsheet were calculated so
I may speak out of turn)
In general the disc space numbers as provided by the "before and after" disk
space free in the (non-chroot) build logs are wildly inaccurate and deceiving.
What ultimately matters is installed compressed size. Raster satellite images
and video tutorials are the worst here unfortunately since they are the least
compressible.
In the case of Marble, you can ignore disc caching as it is empty by default
and isn't stored in the ISO. "apt-cache show" has the marble-data package at
Installed-Size: 20736. Probably looking at the filesize of the the .deb
packages are a good indication of the on-disc compressed size. I suspect there
are other KDE apps on the disc and Marble is just taking the rap as the first
one installed.
GRASS is in a similar situation. By chance and history it gets installed one
script ahead of QGIS, and so brings in all the many dependencies used by both.
Switch the order and QGIS looks huge*. Also perhaps GRASS's biggest package is
its docs, man pages and html-- huge on raw space bug compress extremely well.
We have 67mb total sample data (compressed) which I admit is a big bite; but
I'd note that we got rid of the large 135mb North Carolina dataset some months
ago after installing the system-wide geotiff and shapefiles versions.
[*] probably in large part the dependency on gdal-dev, since it depends on lots
and lots of other -dev packages. But that's one of the most important group of
packages on the disc and commonly used so I'd be loathe to think about cutting
it.
Inline and at the end of main.sh (no longer used in the mainstream build) are a
number of tests on the completed file system for disc space hogs. It listed
before and after disk space free, and the top 75 biggest packages installed:
echo "Show top 75 packages hogging the most space on the disc:"
dpkg-query --show --showformat='${Package;-50}\t${Installed-Size}\t${Status}\n'
\
| sort -k 2 -n | grep -v deinstall | tac | head -n 75 | \
awk '{printf "%.3f MB \t %s\n", $2/(1024), $1}'
But that is deceptive too, again with the "take a guess how well it compressed"
uncompressed size, but also it misses out on all(?) of the Java apps which are
not in .deb pkgs and often hundreds of MB each (see tarball downloads in the
build log). But there too.. they often share common files (e.g. each their own
copy of tomcat and JAI) and as long as that's at the same version the `fslint`
step in build_iso.sh (that's still part of the build process, right?) hardlinks
all the duplicates together making them much more efficient.
A great tool for exploring disk use is `filelight`, it's already installed on
the Live DVD.
So in summary, many grains of salt are needed. The perennial low hanging fruit
from my perspective was getting the Java apps to used shared system libs and
tools, but unfortunately that's not really how Java apps expect to work.
We have the basics for supporting online datasets & tutorials from the main
desktop icons, we could and should make better use of that.
As a target, I'd like to see >= 500mb space free on a 4.3gb DVD for installers
and/or from the 3.8gb(?) for vFAT formatted live 4gb USB stick (which means we
are about 200mb over the soft goal but still viable). The USB stick chews
through free space very quickly as every change to the base system gets stored
as a binary diff applied at boot time- the original image remains compressed
and untouched. "apt-get upgrade" is likely to fill it all. Our best hope I
think is to wait it out, for 8gb to be the cheapest USB sticks that conferences
can buy.
I had to hack the USB creation tool's code on the disc to allow us to write to
a bootable 4gb usb stick, the default was to grey out the device if there would
be less than a gig of free space on it.
hope it helps,
Hamish
_______________________________________________
Live-demo mailing list
[email protected]
http://lists.osgeo.org/mailman/listinfo/live-demo
http://live.osgeo.org
http://wiki.osgeo.org/wiki/Live_GIS_Disc