After looking at the pyparsing results, I think I see the problem with 
your original code.  You are selecting only the characters after the 
rightmost "-" character, but you really want to select everything to 
the right of "- -".  In some of the titles, the encoded Chinese 
includes a "-" character, so you are chopping off everything before 
that.

Try changing your code to:
    title=full_title.split("- -")[1]

I think then your original program will work.

-- Paul

-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to