Re: [pkg-discuss] Private reply Re: CR for 6516, 5662, 6227, 5866, 6226, partial fix for 6365

Brock Pytlik Wed, 11 Feb 2009 16:48:07 -0800

[email protected] wrote:

 - split the data models per authority/repository. This will improve
performance of refilter/search functions and switching between categoriesas well this simplifies a little bit filter functions. The problem startswhen users are adding blastwave, pending, dev, sunfreeware and contribrepos, our data model contains all packages and GUI search functionalityis bad.
I agree that the complete-as-you-type search can't scale as currentlycreated. I don't agree that this solution is anything more than a patchthat will tide us over for a little while, and I think making a significantshift in how you organize the data for a short term fix is a questionabledecision. Release currently has 20k packages, dev has 25k, and pending has11k. For those repos, the problem has been roughly reduced by a factor ofthree. (really more like factors of 2, 2, and 5). Anyway, I'm not sure thatthat feature is worth reworking the organizational structure of your code.Those repos will grow, so unless I've missed something, we've just pushedthe day of reckoning off for a few months (or years depending on theirgrowth factors). It's quite possible that using the filters fromgtk.ListStore isn't going to scale going forward. I'm not familiar with thecode backing that function, nor am I certain that that's where the slowdown is, but I think I remember it being implemented.
The problem is that the size of the dataset is unbounded, to paraphrase
Brock's response.  The complete-as-you-type search doesn't really make
sense in this kind of a situation.  If you need a complete-as-you-type
feature because life won't be complete without one, consider caching
previous search results and performing an autocomplete for historical
searches.  This idea would be similar to how your web browser tries to
complete websites as you type in the URL.

I really like that idea j, but I do think the complete-as-you-type mightbe tractable with the right data structure. Since the search iscurrently only happening over package names, imagine a tree for packagenames where each layer is a letter in the name. So, if a user types inabc, I take the a branch, then the b, then the c branch. Once I getthere, I determine the number of answers that can be shown on the screenat one time, and traverse the tree alphabetically to find the first Nanswers under abc. Other than holding that tree in memory, I think thatdesign scales. And my guess (and it's only a guess) is that by usinglists cleverly, we can probably store a fairly large number of packagesefficiently in that structure w/out a huge memory hit.

When the user hits return, we basically produce a generator which willspit out the next package if/as the user scrolls. The result is poppedonto the screen, and the result is stored in a list (or we auto populatethe list as fast as we can from the generator as we make a spinner spinor bump a UI counter).

At least that was the idea I had kicking around in my head when Imentioned it in the first place.


Brock

-j


_______________________________________________
pkg-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/pkg-discuss

Re: [pkg-discuss] Private reply Re: CR for 6516, 5662, 6227, 5866, 6226, partial fix for 6365

Reply via email to