Re: How to use an array to solve the following...

Richard Gaskin Tue, 21 Feb 2012 08:48:58 -0800

Pete wrote:

Interesting, and it kinda makes sense.  For elements, there's no
positioning required like with lines/words/item, just a case of cycling
through the keys - which is what "repeat for each line <x> in the keys of
<array> does I suppose.

As with most things in computing, the truly optimal solution comes witha lot of "depends"; total date size, size of elements, distance from thestart of a chunk to the value being obtained in it, how deeply nestedare the array keys - all those and more play a role in totalperformance, which can sometimes yield unexpected results.

One challenge with arrays is their use in CGIs, where total throughputperformance is unusually critical since the app is born, lives, and diesall in the space of satisfying a single request from the user.

The problem with arrays in that context is that they don't exist withthe routine begins, since the engine itself needs to be loaded.

Arrays offer blinding speed for random access, but they're able to dothis because they rely on memory-specific structures, leaving us withthe question: how do we load the array from a cold start?

One can use custom properties, or arrayEncode/arrayDecode, orsplit/combine, but all of them are only slightly optimized versions ofwhat you'd need to do if you had to script it yourself using "repeat foreach line..." and stuffing the array elements sequentially.

So oddly enough, if the context of use requires that you take intoaccount the loading of the array, total throughput will often besubstantially slower than scooping up a delimited file and using chunkexpressions on it.

Even outside of a total-throughput context, I've seen other cases wherearrays can be slower than "repeat for each", such as deeply-nestedarrays (say, four levels deep). In such cases, while each traversal ofthe hash used to identify the location of the element value is prettydarn fast, you'll have to do four traversals of each hash to get at eachelement, and that can add up.

Moreover, arrays can impact memory in ways that chunks don't, because ina world where we don't yet have structs (see<http://quality.runrev.com/show_bug.cgi?id=8304>), element labels arereplicated for every key. With a tab-delimited list the non-dataoverhead is one char per field, but with arrays it's the length of thekey for every field, which can double the size of the data in memory ifthe keys are as long as the data.

So alas, as you folks have done here, many times the only way to knowfor sure what an optimal solution will be is to test it.

If you find yourself doing this sort of thing often, I've put together afew tips on benchmarking performance in this LiveCode Journal article:


<http://livecodejournal.com/tutorials/benchmarking-revtalk.html>

--
 Richard Gaskin
 Fourth World
 LiveCode training and consulting: http://www.fourthworld.com
 Webzine for LiveCode developers: http://www.LiveCodeJournal.com
 LiveCode Journal blog: http://LiveCodejournal.com/blog.irv

_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Re: How to use an array to solve the following...

Reply via email to