Re: [Pywikipedia-l] Using a true MediaWiki parser (mwparserfromhell) instead of textlib methods

2013-07-14 Thread Ben Kurtovic
On Jul 14, 2013, at 6:48 AM, Dr. Trigon wrote: > One important question for me here is "How is the handling/behaviour > for malform(at)ed wiki syntax [...] > > I am despirately seeking a parser that has the same error behaviour > and gives the same results like the original mw parser also in case

Re: [Pywikipedia-l] Using a true MediaWiki parser (mwparserfromhell) instead of textlib methods

2013-07-14 Thread Dr. Trigon
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 One important question for me here is "How is the handling/behaviour for malform(at)ed wiki syntax, like e.g. a text body: ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### === bad header 1 == A text containing mathematical equ

Re: [Pywikipedia-l] Script halted, textlib.py?

2013-07-14 Thread info
Hi Binaris, I was investigating in that issue again and found a wrong order for the re.escape() sequence. This caused some errors in past with blanks and underlines in the section headers. I fixed it in r11755/r11756. Could you please validate it for your script. Thanks and sorry for trouble