On Sun, May 1, 2011 at 11:54, darklow <dark...@gmail.com> wrote:
> Finally! removing portion by portion from actors.list i found the exact line
> that creates error.

Very good, thanks!
And now, the long awaited news: this bug is fixed! :D

The problem was fired by this role:
  [(1972) Le rescapé]

The imdbpy2sql.py script did nothing wrong, stripping the first and last
square brackets; after that, the string is considered to be a character
name (remember that we're parsing a role) and parsed using the
imdb.utils.analyze_name function.
This function parses also people names, and in some circumstances
the name contains a reference to the dates of birth and death.
Unfortunately, stripping these notes, I made some wrong assumption (like
the fact that a name can't begin with a parenthesis), and this lead to the
last char of the name stripped - whichever it was.
In our case, the name started with an open parenthesis and ended with
a char that in utf8 occupies two bytes... the rest of the story is known. :-)

The fix is already in the mercurial repository; thank you very much for
the extensive tests and debug!

-- 
Davide Alberani <davide.alber...@gmail.com>  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Reply via email to