On Oct 4, 2006, at 2:25 PM, Kyle Kienapfel wrote:

I just used a DBF Viewer 2000 from this site to cut a 300mb dbf file
down to about 50.

I played with this a bit using the shapelib tools on some files (not the TIGER shape files, but can't remember which ones). For the file I was looking at it didn't make much difference. I wanted to look at the TIGER shapefiles because I suspected it could make some difference there, but never got around to it.

if I knew which one was used by the shp
file, I could cut a few more MB there.

Hmm, let's look at a dbfawk for clues. Here's a section of the dbfawk for tgr2shp.dbfawk:

BEGIN {
# dbfinfo is the "signature" of the dbf file listing the column names in order. # dbfinfo should match the dbf file that we say this dbfawk file goes with. dbfinfo="MODULE:TLID:SIDE1:SOURCE:FEDIRP:FENAME:FETYPE:FEDIRS:CFCC:FRADD L:TOADDL:FRADDR:TOADDR:FRIADDL:TOIADDL:FRIADDR:TOIADDR:ZIPL:ZIPR:AIANHHF PL:AIANHHFPR:AIHHTLIL:AIHHTLIR:CENSUS1:CENSUS2:STATEL:STATER:COUNTYL:COU NTYR:COUSUBL:COUSUBR:SUBMCDL:SUBMCDR:PLACEL:PLACER:TRACTL:TRACTR:BLOCKL: BLOCKR";
#dbffields is which of the above fields we actually want to look at.
# No point reading dbffields that are not looked at further.
dbffields="TLID:FEDIRP:FENAME:FETYPE:FEDIRS:CFCC";
}


"dbfinfo" tells us that a shapefile with the fields indicated in dbfinfo (those names in that order) will be a match for this dbfawk.

"dbffields" tells us that we're only going to look at these fields; all others will be ignored in the dbfawk.

My (limited) understanding of dbfawks leads me to think we can eliminate all the fields that are not listed in dbffields, adjust dbfinfo accordingly, and we'll have a smaller file.

Why don't you try that and let us know how it goes?  (:

-Jason
kg4wsv


_______________________________________________
Xastir mailing list
Xastir@xastir.org
http://lists.xastir.org/cgi-bin/mailman/listinfo/xastir

Reply via email to