Re: [Tutor] Query: lists

Cameron Simpson Tue, 14 Aug 2018 14:40:58 -0700

On 14Aug2018 18:11, Deepti K <kdeepti2...@gmail.com> wrote:

when I pass ['bbb', 'ccc', 'axx', 'xzz', 'xaa'] as words to the below
function, it picks up only 'xzz' and not 'xaa'


def front_x(words):
 # +++your code here+++
 a = []
 b = []
 for z in words:
   if z.startswith('x'):
     words.remove(z)
     b.append(z)
     print 'z is', z
 print 'original', sorted(words)
 print 'new', sorted(b)
 print sorted(b) + sorted(words)

That is because you are making a common mistake which applies to almost anydata structure, but is particularly easy with lists and loops: you aremodifying the list _while_ iterating over it.


After you go:

 words.remove(z)

all the elements _after_ z (i.e. those after 'xzz' i.e. ['xaa']) are moved downthe list.

In your particular case, that means that 'xaa' is now at index 3, and the nextiteration of the loop would have picked up position 4. Therefore the loopdoesn't get to see the value 'xaa'.

A "for" loop and almost anything that "iterates" over a data structure does notwork by taking a copy of that structure ahead of time, and looping over thevalues. This is normal, because a data structure may be of any size - you donot want to "make a copy of all the values" by default - that can bearbitrarily expensive.

Instead, a for loop obtains an "iterator" of what you ask it to loop over. Theiterator for a list effectively has a reference to the list (in order to obtainthe values) and a notion of where in the list it is up to (i.e. a list index, acounter starting at 0 for the first element and incrementing until it exceedsthe length of the list).

So when you run "for z in words", the iterator is up to index 3 when you reach"xzz". So z[3] == "xzz". After you remove "xzz", z[3] == "xaa" and in this casethere is no longer a z[4] at all because the list is shortened. So the nextloop iteration never inspects that value. Even if the list had more value, theloop would still skip the "xaa" value.


You should perhaps ask yourself: why am I removing values from "words"?

If you're just trying to obtain the values starting with "x" you do not need tomodify words because you're already collecting the values you want in "b".

If you're trying to partition words into values starting with "x" and valuesnot starting with "x", you're better off making a separate collection for the"not starting with x" values. And that has me wondering what the list "b" inyour code was for originally.

As a matter of principle, functions that "compute a value" (in your case, alist of the values starting with "x") should try not to modify what they aregiven as parameters. When you pass values to Python functions, you are passinga reference, not a new copy. If a function modifies that reference's _content_,as you do when you go "words.move(z)", you're modifying the original.


Try running this code:

 my_words = ['bbb', 'ccc', 'axx', 'xzz', 'xaa']
 print 'words before =", my_words
 front_x(my_words)
 print 'words after =", my_words

You will find that "my_words" has been modified. This is called a "sideeffect", where calling a function affects something outside it. It is usuallyundesirable.


Cheers,
Cameron Simpson <c...@cskk.id.au>
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Query: lists

Reply via email to