Re: [Wikitech-l] Raw page view no longer showing language links for some pages?
On 03/09/2013 10:00 PM, Brian Cassidy wrote: Hello, I'm the co-author of the WWW::Wikipedia Perl module ( https://metacpan.org/release/WWW-Wikipedia). It programmatically parses the raw source of a Wikipedia page. Of late, a few changes in behaviour have been reported to me -- all related to the language functionality. As it turns out a number of pages are no longer returning the language links in the raw source code like they used to. The canonical test for us was to load Russia in English, then grab the Russian link. As you can see, the page for Russia no longer has those links (which can normally be seen down the left-hand side of the real page): http://en.wikipedia.org/w/index.php?title=Russiaaction=raw A shorter example is the page for Rotation http://en.wikipedia.org/w/index.php?title=Rotationaction=raw I did find that some pages still have language links. See this one for Babushka http://en.wikipedia.org/w/index.php?title=Babushkaaction=raw Has there been some change that no longer outputs those links in some instances, or is this an actual bug? I apologize that this wasn't sent to some official bug tracker, but I couldn't find that info off-hand from the Wikipedia site. Thanks in advance, Brian, thanks for your note. Here's our recent blog post on how to file a bug report or feature request in our Bugzilla installation: https://blog.wikimedia.org/2013/03/18/how-to-create-a-good-first-bug-report/ If you find that the API doesn't give you some of what you need for WWW::Wikipedia, please do file a bug. Thanks! -- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Re: [Wikitech-l] Raw page view no longer showing language links for some pages?
This is all due to the introduction of Wikidata http://wikidata.org. On Thu, Mar 21, 2013 at 12:32 PM, Sumana Harihareswara suma...@wikimedia.org wrote: On 03/09/2013 10:00 PM, Brian Cassidy wrote: Hello, I'm the co-author of the WWW::Wikipedia Perl module ( https://metacpan.org/release/WWW-Wikipedia). It programmatically parses the raw source of a Wikipedia page. Of late, a few changes in behaviour have been reported to me -- all related to the language functionality. As it turns out a number of pages are no longer returning the language links in the raw source code like they used to. The canonical test for us was to load Russia in English, then grab the Russian link. As you can see, the page for Russia no longer has those links (which can normally be seen down the left-hand side of the real page): http://en.wikipedia.org/w/index.php?title=Russiaaction=raw A shorter example is the page for Rotation http://en.wikipedia.org/w/index.php?title=Rotationaction=raw I did find that some pages still have language links. See this one for Babushka http://en.wikipedia.org/w/index.php?title=Babushkaaction=raw Has there been some change that no longer outputs those links in some instances, or is this an actual bug? I apologize that this wasn't sent to some official bug tracker, but I couldn't find that info off-hand from the Wikipedia site. Thanks in advance, Brian, thanks for your note. Here's our recent blog post on how to file a bug report or feature request in our Bugzilla installation: https://blog.wikimedia.org/2013/03/18/how-to-create-a-good-first-bug-report/ If you find that the API doesn't give you some of what you need for WWW::Wikipedia, please do file a bug. Thanks! -- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Re: [Wikitech-l] Raw page view no longer showing language links for some pages?
Brian, due to the recent introduction of wikidata.org, most language links are now stored there. Regardless, you should try to avoid getting langlinks from the raw source, because it wastes a lot of bandwidth. Please consider using mediawiki APIhttps://www.mediawiki.org/wiki/APIto get just the data you need. For example, for langlinks, use this query: http://en.wikipedia.org/w/api.php?action=queryprop=langlinkstitles=Russialllimit=maxformat=jsonfm (change jsonfm to json in your code) Using this method you can request langlinks for multiple titles in one request. On Sat, Mar 9, 2013 at 10:00 PM, Brian Cassidy brian.cass...@gmail.comwrote: Hello, I'm the co-author of the WWW::Wikipedia Perl module ( https://metacpan.org/release/WWW-Wikipedia). It programmatically parses the raw source of a Wikipedia page. Of late, a few changes in behaviour have been reported to me -- all related to the language functionality. As it turns out a number of pages are no longer returning the language links in the raw source code like they used to. The canonical test for us was to load Russia in English, then grab the Russian link. As you can see, the page for Russia no longer has those links (which can normally be seen down the left-hand side of the real page): http://en.wikipedia.org/w/index.php?title=Russiaaction=raw A shorter example is the page for Rotation http://en.wikipedia.org/w/index.php?title=Rotationaction=raw I did find that some pages still have language links. See this one for Babushka http://en.wikipedia.org/w/index.php?title=Babushkaaction=raw Has there been some change that no longer outputs those links in some instances, or is this an actual bug? I apologize that this wasn't sent to some official bug tracker, but I couldn't find that info off-hand from the Wikipedia site. Thanks in advance, -- Brian Cassidy (brian.cass...@gmail.com) ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l