Re: [Geowanking] real time tracking

J. Andrew Rogers Tue, 30 Oct 2007 12:28:38 -0800


On Oct 17, 2007, at 11:57 AM, Dave Rafkind wrote:

One benefit of Erlang that I can see it the ease with which itsupports "parallel" (multicore or distributed) computation. Iftracking servers and associated datastructures can be"sharded" (partitioned) along some kind of optimized geographicalboundaries then won't that be a win for most simple trackingpurposes? Admittedly you won't get much obvious help for globalqueries but for simple things like geofencing and so on it seemslike a nice solution.

What I meant was that the limitations on parallelization anddistribution in this particular environment is not really a functionof the language; it may be a good solution to A problem but does notreally address THE problem(s). You are not suggesting theimplementation of anything many people have not implemented withoutErlang. Call it a pre-mature optimization, though I can see where itcould have some utility.

Using other Erlang apps as an example (Yaws, Ejabberd, RabbitMQ),1000 updates per server per second seems entirely reasonable, andmaybe 10k/server/sec is an upper bound. Is that within theperformance bounds of current real-world tracking systems?

Update rates are a function of the lock graphs for a given datastructure and the kind of guarantees one needs to make. You cannotmake a meaningful inferences about update scalability based on thecharacteristics of unrelated data structures; it is a limitation ofalgorithms and data structures, not the programming language.

Also, I'm not sure I see the difficulty in tracking points vs linesvs polygons. Certainly time-swept data is tricky (ie keep track ofthe last 3 positions) but if you live in the present things likeaugmented R trees ( http://www.cs.purdue.edu/research/technical_reports/2005/TR%2005-020.pdf) seem adequate (although Ihave not tested them myself)

If you restrict yourself to tracking points, you can make assumptionsthat allow significant data structure optimization. Polygons aremuch more difficult to generalize in a scalable way because there isno guarantee that a natural boundary exists with which to nicelypartition any arbitrary set of polygons. Handling this case well isdifficult.

The numerous R-tree variants, like the one above, do notsubstantially improve R-trees in the general case. Usually it is acase of trading one of the numerous pathological cases for another --useful if you can tailor it to a specific app -- or making onepathological case less pathological on average. For every R-treevariant, you can find a real-world data set that essentially breaksit and several others that have mediocre performance on it. Again,one of the values of having so many R-tree variants is that you canselect the one that best matches the characteristics of your data set.


Cheers,

J. Andrew Rogers

_______________________________________________
Geowanking mailing list
[email protected]
http://lists.burri.to/mailman/listinfo/geowanking

Re: [Geowanking] real time tracking

Reply via email to