Re: [osmosis-dev] Cutting PBF file into 1° tiles

Sylvain Melin Wed, 20 Apr 2016 06:50:01 -0700

On 18/04/2016 15:01, Sylvain Melin wrote:

On 18/04/2016 12:23, Jochen Topf wrote:
On Mo, Apr 18, 2016 at 11:52:24 +0200, Sylvain Melin wrote:
On 18/04/2016 11:00, Jochen Topf wrote:
On Mo, Apr 18, 2016 at 10:10:06 +0200, Sylvain Melin wrote:
My plan is to :
- exploit a planet sized pbf file
- cut it into 1° tiles using osmosis
- filter and extract the data from these tiles as shapefiles usinglibosmium
If you are writing your own program anyway to create thoseshapefiles, whydon't you do the splitting in this step *after* creating thegeometries andbefore writing them into shapefiles? That is probably much easierto do than
based on the PBF due to the structure of the OSM data files.
Jochen
Maybe I'm wrong but because I don't want to parse the fullplanet.osm.pbf
every time I want to extract a small set of data.
The processing time seems to grow exponentially with the size ofsource file
The time of what processing exactly? I don't see anything in what youare doingthat should scale worse then linearly. Of course if you don't haveenough memory
you'll run into problems.
so having an intermediate level with 1° sized pbf containing everything
seems very practical to me.
In theory yes, but, as you noticed, you'll have to handle all objectsspecially
that straddle tile boundaries.
Also, my osmium program loops over the target tile and parse theappropriate
pbf :

/for each j in [-90,89]//
//{//
//        for each i in [-180,179]//
//        {//
//                create osmium::handler//
//                parse i_j.pbf with osmium::io::Reader//
//                extract data to single handler with osmium::apply//
//        }//
////}/
Do you think it would be more efficient to have a single big PBF andextract
data to several handlers ?
It will probably be most efficient to just do everything in one go.And only atthe moment where you are writing out the finished feature into theshapefile,decide in which shapefile it should belong. You'll only have onehandler, but
180*360 output shapefiles.
Is it even possible without filling the RAM ?
Depends on how much RAM you have. You'll need 32GB RAM for the nodelocationstore. And you'll need same RAM to buffer the output, because youcan't writeto 180*360 files at the same time efficiently. Maybe fewer fileswould bebetter? (Also you'll have not only one shape file for each tile, butprobablydozens for all the different layers of data, which makes this problemworse.)
So if you don't have this kind of memory, you have a problem.

You can also have a look at
https://github.com/joto/osm-history-splitter
which should be more efficient at splitting a planet into smallerfiles thanOsmosis. But people have reported some issues with this software. Itis on my
TODO list to look at this and fix them, but that will take a while.

Jochen
Ok I got it ! Unfortunately, I don't have enough RAM for this method.
I did not thought about it before but given the small amount of data Ineed, I wonder if using xapi to request data per degree isn't the mostobvious way to get the data I need, unless xapi has the same kind ofproblem with the borders.
I'll also take a look at osm-history-splitter.

Thank you very much !

Sylvain


I finally found a proper method to do this.

I wrote a bash script that uses overpass api to request and filter thedata, and convert the resulting osm.xml file to shapefile with my osmiumprogram.


Overpass api does not clip data on the edges of the bounding box.

Also, I only have the data I need on my hard drive and I'm sure it's upto date.


Thank you for your help.
I hope it will help people facing the same issue.

Regards,
Sylvain



_______________________________________________
osmosis-dev mailing list
[email protected]
https://lists.openstreetmap.org/listinfo/osmosis-dev

Re: [osmosis-dev] Cutting PBF file into 1° tiles

Reply via email to