Hello, At Saturday 16 March 2013 01:41:47 DaB. wrote: > Hi all, > > I have a script running collecting data in multiple wikipedia(s), I started > to notice that revision table in lbwiki_p has some incorrect data. > > Here is an example: > mysql> select rev_id, rev_user, rev_page, rev_deleted, rev_len, > rev_timestamp from revision where rev_id = 185751; > +--------+----------+----------+-------------+---------+----------------+ > > | rev_id | rev_user | rev_page | rev_deleted | rev_len | rev_timestamp | > > +--------+----------+----------+-------------+---------+----------------+ > > | 185751 | 580 | 83446 | 0 | NULL | 20061203231418 | > > +--------+----------+----------+-------------+---------+----------------+
The result is correct. > > According to my understanding if a record exist rev_len shouldn't be NULL, > if the revision deleted then rev_deleted should get flag but rev_length > should remain as it is. > > Hope someone can look into this, because people who are doing analysis > might end up getting wrong results. rev_lenght will remain as it is – the problem is that rev_lenght was not there from the very beginning and was never (AFAIK) back-populated; so very old rows has no lenght and are NULL. > Best; > -- > Anuradha Uduwage (Anu) Sincerely, DaB. -- Userpage: [[:w:de:User:DaB.]] — PGP: 0x2d3ee2d42b255885
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette