Re: [OSM-dev] [OSM-talk] Initial population of data in OSM

Paul Norman Mon, 04 Jun 2012 21:00:35 -0700

I've cc'ed dev@ on this since it's more of an appropriate list.


What seems the easiest way to me would be to convert the IDs to positive and
add version and changeset information. You might be able to do this within
the tool you use to convert FileGDB to OSM, or post-process. I have intended
to write a tool that will take a .osm file and output the same .osm file,
just with all IDs multiplied by -1. You could then load the data with
osmosis and update the counter for the node ID if you need to do any
subsequent changes.

 

Because it's an initial load there's no existing data to conflict with so
osmosis is likely the fastest way.

 

The bad news: I've converted FileGDB data to .osm XML and found it takes up
about 10 times the room on disk, or about 400 GB for your data. I just
completed an import into an apidb of the planet which is under half the size
of your data and it took nearly a full month on a RAID0 array of four 7200
RPM drives. The completed database was about 1 TB. Unless you have lots of
extremely fast drives or slower drives and lots of time, I suggest you
consider if there might be a way to do what you intend to do without loading
the data into an apidb. Dealing with a dataset the size of the planet is bad
enough. Yours is larger and will take even longer.

 

From: Benoit Pronovost [mailto:bpronov...@gmail.com] 
Sent: Monday, June 04, 2012 8:15 AM
To: t...@openstreetmap.org
Subject: [OSM-talk] Initial population of data in OSM

 

I've installed a local OpenStreetMap architecture for a project I'm
currently working on. We need to load a very large amount of data into our
OpenStreetMap database.
The original data is stored in a FileGDB that is around 40Gb. It has been
transfered into the OSM structure (with negative ids and no version,
changeset,etc tags).

My question is what would be the ideal way to procede to load this initial
set of data?

I'v tried bulk_upload.py which takes about 1 hour to process 1/30000 of the
data. Either my local API 0.6 needs to be tuned or it is not made to handle
such important data updates.
Osmosis requires that their be changeset and version tags which do not exist
in the data I currently have. Writing directly to the database without
passing through the API would possibly speed up the process, but I've not
found a way to do this.

Thank you in advance for your advice.

_______________________________________________
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev

Re: [OSM-dev] [OSM-talk] Initial population of data in OSM

Reply via email to