Re: [pywikibot] weblinchecker data migration from compat to core

2015-08-07 Thread masti

On 08/07/2015 11:50 PM, Merlijn van Deen wrote:

Hi masti,

On 7 August 2015 at 23:11, masti mast...@gmail.com
mailto:mast...@gmail.com wrote:

weblinchecker.py working file deadlinks-wikipedia-pl.dat has a
binary format in core. In Compat it was a text file.


This confuses me. As far as I can see, both compat and core use 'pickle'
to write the working file (and as far as I can see, compat always has).
There are, however, different versions of the pickle format, which could
explain the binary/text difference. Could you post an excerpt of the old
file?



(dp0
Vhttp://www.european-athletics.org/european-athletics-awards-night/baldini-says-goodbye-at-the-giro-al-sas.html
p1
(lp2
(VStefano Baldini
p3
F1404150662.121833
S'404 Not Found'
p4
tp5
a(V2010 w lekkoatletyce
p6
F1404569856.34401
S'404 Not Found'
p7
tp8
a(VStefano Baldini
p9
F1404729661.197063
S'404 Not Found'
p10
tp11
a(V2010 w lekkoatletyce
p12
F1404761790.987255
S'404 Not Found'
p13
tp14
a(VStefano Baldini
p15
F1413308613.299339
S'404 Not Found'
p16
tp17





the script works after copying it to the new directory but it looks
it does not recognize information about previously reported links.

Is there an error message posted, is the file just being overwritten, or
is there some other kind of issue?


no. no error, it does not overwrite the file. I am still testing wether 
it adds the records.
But it tries to report deadlinks the were previously reported. So this 
makes me think the script does not recognize properly old records.





Best regards,
Merlijn


___
pywikibot mailing list
pywikibot@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/pywikibot




___
pywikibot mailing list
pywikibot@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/pywikibot


Re: [pywikibot] weblinchecker data migration from compat to core

2015-08-07 Thread Merlijn van Deen
Hi masti,

On 7 August 2015 at 23:11, masti mast...@gmail.com wrote:

 weblinchecker.py working file deadlinks-wikipedia-pl.dat has a binary
 format in core. In Compat it was a text file.


This confuses me. As far as I can see, both compat and core use 'pickle' to
write the working file (and as far as I can see, compat always has). There
are, however, different versions of the pickle format, which could explain
the binary/text difference. Could you post an excerpt of the old file?


 the script works after copying it to the new directory but it looks it
 does not recognize information about previously reported links.

Is there an error message posted, is the file just being overwritten, or is
there some other kind of issue?

Best regards,
Merlijn
___
pywikibot mailing list
pywikibot@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/pywikibot


[pywikibot] weblinchecker data migration from compat to core

2015-08-07 Thread masti
weblinchecker.py working file deadlinks-wikipedia-pl.dat has a binary 
format in core. In Compat it was a text file.
the script works after copying it to the new directory but it looks it 
does not recognize information about previously reported links.
As I ran it since some years on pl.wiki I need to preserve this info to 
not to report the links.


is there a way to migrate the data to the new format or a descritpion of 
both formats so I can do a file conversion?


masti

___
pywikibot mailing list
pywikibot@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/pywikibot