On Sun, May 1, 2011 at 11:54, darklow <dark...@gmail.com> wrote: > Finally! removing portion by portion from actors.list i found the exact line > that creates error.
Very good, thanks! And now, the long awaited news: this bug is fixed! :D The problem was fired by this role: [(1972) Le rescapé] The imdbpy2sql.py script did nothing wrong, stripping the first and last square brackets; after that, the string is considered to be a character name (remember that we're parsing a role) and parsed using the imdb.utils.analyze_name function. This function parses also people names, and in some circumstances the name contains a reference to the dates of birth and death. Unfortunately, stripping these notes, I made some wrong assumption (like the fact that a name can't begin with a parenthesis), and this lead to the last char of the name stripped - whichever it was. In our case, the name started with an open parenthesis and ended with a char that in utf8 occupies two bytes... the rest of the story is known. :-) The fix is already in the mercurial repository; thank you very much for the extensive tests and debug! -- Davide Alberani <davide.alber...@gmail.com> [PGP KeyID: 0x465BFD47] http://www.mimante.net/ ------------------------------------------------------------------------------ WhatsUp Gold - Download Free Network Management Software The most intuitive, comprehensive, and cost-effective network management toolset available today. Delivers lowest initial acquisition cost and overall TCO of any competing solution. http://p.sf.net/sfu/whatsupgold-sd _______________________________________________ Imdbpy-help mailing list Imdbpy-help@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/imdbpy-help