Re: How to limit length of PrettyPrinter

dn via Python-list Thu, 23 Jul 2020 14:14:25 -0700

Redirected from Digest (see below)


On 23/07/2020 11:59, Stavros Macrakis wrote:
> Mousedancer, thanks!

Yes, I even look like a (younger) Kevin Costner!
(you believe me - right!?)

> As a finger exercise, I thought I'd try implementing print-level andprint-length as an object-to-object transformer (rather than a prettyprinter). I know that has a bunch of limitations, but I thought I mightlearn something by trying.

> Here's a simple function that will copy nested lists while limitingtheir depth and length. When it encounters a non-iterable object, ittreats it as atomic:

>
>     scalartypes = list(map(type,(1,1.0,1j,True,'x',b'x',None)))
>
>     def limit(obj,length=-2,depth=-2):
>          if type(obj) in scalartypes:
>              return obj
>          if depth==0:
>              return 'XXX'
>          lencnt = length
>          try:

> new = type(obj).__new__(type(obj)) # empty object ofsame type

>              for i in obj:
>                  lencnt = lencnt - 1
>                  if lencnt == -1:
>                      new.append('...')          # too long
>                      break
>                  else:
>                      new.append(limit(i,length,depth-1))
>              return new
>          except:                                # which exceptions?
>              return obj                         # not iterable/appendable
>
>     limit( [1,2,[31,[321,[3221, 3222],323,324],33],4,5,6], 3,3)
>
>             => [1, 2, [31, [321, 'XXX', 323, '...'], 33], '...']
>
>
>

> This works fine for lists, but not for tuples (because they'reimmutable, so no *append*) or dictionaries (must use *for/in**obj.items*, and there's no *append*). There must be some way to handlethis generically so I don't have to special-case tuples (which areimmutable, so don't have *append*) and dictionaries (where you have toiterate over *obj.items()*... and there's no *append*), but I'm stuck.Should I accumulate results in a list and then make the list into atuple or dictionary or whatever at the end? But how do I do that?

> It's not clear how I could handle /arbitrary/ objects... but let'sstart with the standard ones.


This looks like fun!

BTW why are we doing it: is it some sort of 'homework assignment' or areyou a dev 'scratching an itch'?

May I suggest a review of the first few pages/chapters in the PSL docs(Python Standard Library): Built-in Functions, -Constants, -Types, and-Exceptions. Also, try typing into the REPL:


    pp.__builtins__.__dict__()

(you will recognise the dict keys from the docs). These may give you amore authoritative basis for "scalartypes", etc.

If you're not already familiar with isinstance() and type() then these(also) most definitely useful tools, and thus worth a read...

With bottom-up prototyping it is wise to start with the 'standard'cases! (and to 'extend' one-bite at a time)

Rather than handling objects (today's expansion on the previous), mightI you refer back to the objective, which (I assume) requires the outputof a 'screen-ready' string. Accordingly, as the data-structure/networkis parsed/walked, each recognised-component could be recorded as astring, rather than kept/maintained?reproduced in its native form.


Thus:
- find a scalar, stringify it
- find a list, the string is "["
- find a list, the string is "{"
- find a tuple, the string is "("
etc

The result then, is a series of strings.

a) These could be accumulated, ready for output as a single string. Thiswould make it easy to have a further control which limits the number ofoutput characters.


b) If the accumulator is a list, then

    accumulator.append( stringified_element )

works happily. Plus, the return statement can use a str.join() toproduce a single accumulator-list as a string.(trouble is, if the values should be comma-separated, you don't want toseparate a bracket (eg as a list's open/close) from the list-contentswith a comma!) So, maybe that should be done at each layer of nesting?


Can you spell FSM?
(Finite State Machine)

Next set of thoughts: I'm wondering if you mightn't glean a few ideasfrom reviewing the pprint source-code?(on my (Fedora-Linux) machine it is stored as/usr/lib64/python3.7/pprint.py)

Indeed, with imperial ambitions of 'embrace and extend', might you beable to sub-class the pprint class and bend it to your will?

Lastly, (and contrasting with your next comment) I became a littleintrigued, so yesterday, whilst waiting for an on-line meeting's (ratherrude, IMHO) aside to finish (and thus move-on to topics which involvedme!), I had a little 'play' with the idea of a post-processor (perprevious msg).

What I built gives the impression that "quick and dirty" is athoroughly-considered and well-designed methodology, but the prototypesuccessfully shortens pprint-output to a requisite number of elements. Thus:


    source_data = [1,2,[31,[321,[3221, 3222],323,324],33],4,5,6]
    limit( source_data, 3 )

where the second argument (3) is the element-count/-limit; results in:

    [1,2,[31

ie the first three elements extracted from nested lists (tuples, sets,scalars, etc).

(recall my earlier query about what constitutes an "element"?)


> Sorry for the very basic questions!

No such thing - what is "basic" to you, might seem 'advanced' so someoneelse, and v-v. Plus, you never know how many 'lurkers' (see below) mightbe quietly-benefiting from their observation of any discussion!



PS on which subject, List Etiquette:

There are many people who 'lurk' on the list - which is fine. Presumablythey are able to read contributions and learn from what seemsinteresting. This behavior is (to me) a major justification for thedigest service - not being 'bombarded' by many email msgs is how somevoice their concerns/preference.

However, once one asks a question, one's involvement is no longerpassive ('lurking'). Hence:


>     When replying, please edit your Subject line so it is more specific
>     than "Re: Contents of Python-list digest..."
...

>        16. Re: How to limit *length* of PrettyPrinter (dn)
...

Further, many of us manage our email 'bombardment' through'organisation' rather than 'limitation' (or 'condensation'?); and thus"threading" is important - most competent mail-clients offer this, as doGMail and many web-mail services. From a list perspective, this collectsand maintains all parts of a conversation - your contributions and mine,in the 'same place'. Sadly, switching between the list-digest andsingle-messages breaks threading! Also, no-one (including the archivingsoftware) looking at the archive (or the digest) would be able to detectany link between an earlier conversation called "How to limit *length*of PrettyPrinter" and one entitled "...Digest..."!

--
Regards =dn

--
Regards =dn
--
https://mail.python.org/mailman/listinfo/python-list

Re: How to limit *length* of PrettyPrinter

Reply via email to

Re: How to limit length of PrettyPrinter