Shapely

Yevgen Antymyrov Fri, 26 Mar 2010 10:05:30 -0700

Guys,

I've found this old conversation about Python vs C comparison and I havea question about it. If you don't mind, I'll continue the old thread.

My current task is to go through a big file, then for some particularfeatures(F1) I need to find all features(F2) in another file whichintersect the F1. For that I used shape to speed up the search of F2 byusing layer.SetSpatialFilterRect().

My question does not have anything to do with Shapely, only withOGR/Python binding. But as you mentioned in this email, "The Pythonscript above also uses a great deal of memory since it never freesmemory allocated by OGR's GetNextFeature()". So it means that you hadthe experience with it. I understand that you meant not the"list.append()" problem, but something with OGR's internal work. Do youknow how this could be avoided? Because my files are just gigabytes ofdata and it won't fit into my 32bit RAM.

What is see is that my script that basically does "for feature inget_features_generator():" just grows in memory even though I neverstore any data, only doing 'printf' when an intersection is found.

>/ #=======================================
/>/ import os,sys,osgeo.ogr
/>/ from shapely.wkb import loads
/>/ from shapely.geometry import Polygon

/>//>/ source = osgeo.ogr.Open(sys.argv[1])

/>/ layer = source.GetLayer()
/>/ objet = layer.GetNextFeature()
/>/ liste = []
/>/ while objet:
/>/     liste.append(loads(objet.GetGeometryRef().ExportToWkb()))
/>/     objet = layer.GetNextFeature()
/>/ print len(liste)

/>//>//>/ for i in range(len(liste)-1):

/>/     ref = liste[i]
/>/     if i % 100 == 0:
/>/         print  str(i)
/>/         os.system("date")
/>/     for j in range(i+1,len(liste)):
/>/         sec = liste[j]
/>/         if ref.disjoint(sec) == False:
/>/             print "intersection entre " + str(i) + ' et ' + str(j)

/>//>//

Hi Pascal,

A Python solution shouldn't be 100 times slower than C, especially since
both OGR and Shapely use the same GEOS library to compute relationships
between geometries. Is it possible that your other solution uses
"intersects" instead of "not disjoint" (could be faster), or is more
approximate?

The Python script above also uses a great deal of memory since it never
frees memory allocated by OGR's GetNextFeature(), and then copies the
feature to a Shapely geom (and a GEOS geom). If your computer's free
memory is low, performance can be poor. Is this a possibility?



--
Yevgen

_______________________________________________
Community mailing list
[email protected]
http://lists.gispython.org/mailman/listinfo/community

Re: [Community] Run time comparisons between C/GEOS and Python/Shapely

Reply via email to