Re: [pkg-discuss] pkg(5) Design was Re: ON/SXCE bi-weekly schedule not valid anymore?

Nicolas Williams Thu, 27 Aug 2009 20:26:13 -0700

On Thu, Aug 27, 2009 at 09:19:08PM -0500, Mike Gerdts wrote:
> http://mail.opensolaris.org/pipermail/pkg-discuss/2009-August/032980.html
> 
> Here's my starting point with profiling "pkg search -l ls":
> ...
> 
>    667                          for line in self._file_handle:
>    668                                  res = line.split(" ")
>    669                                  token = self.__unquote(res[0])
>    670                                  offset = int(res[1])
>    671                                  self._dict[token] = offset
> 
> In other words, each line of code between 668 and 671 takes an average
> of 4.5 seconds to execute.  It is parsing this file:


As I said, LISP^WPython programmers know the value of every expression...

> # wc -l /var/pkg/index/token_byte_offset.v1
>   691111 /var/pkg/index/token_byte_offset.v1
> 
> Thinking that it could be an inefficient hash algorithm, I split that
> off into a separate function so it could be profiled individually.

One thing I find odd about IPS is that it has its own indexing scheme,
and it doesn't strike me as a very good idea to home-grow your own when
you can use any number of off-the-shelf ones...

> Time in read_dict_file improved by over 20% (17.9 sec -> 14.2 sec) by
> inlining one tiny function.  Overall execution time improved by 13%
> (33.9 sec -> 29.6 sec).

Hot loops need to execute fast.  This is a hot loop.  Reducing the
number of calls to read_dict_file should help even more.

> It seems as though there are two things to consider out of this:
> 
> 1) It is not just recursive calls that are horribly expensive.

Yeah, hot loops matter.

Nico
-- 
_______________________________________________
pkg-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/pkg-discuss

Re: [pkg-discuss] pkg(5) Design was Re: ON/SXCE bi-weekly schedule not valid anymore?

Reply via email to