Yes, I added a user agent, and that worked. Thanks!

- Anand

On Tue, Dec 29, 2009 at 4:19 PM, Platonides <[email protected]> wrote:

> Anand Ramanathan wrote:
> > Hi,
> >
> > I have a requirement to inspect a given wikipage's url to determine if
> > its size exceeds 500kb, and to get the page, truncate it to 500kb and
> > pass it to some api.
> >
> > I couldnt find a way to determine the size of a wikipage through the api.
>
> You can get page size via the api using rvprop=size, or by grabbing the
> revision content and looking at its length.
>
>
> > I also tried to call HEAD on the wikipage. I do get results sometimes,
> > but sometimes I get an http 403 forbidden. The same thing happens with
> > http GET on the page. The pages I tried most often for testing were
> > http://en.wikipedia.org/wiki/Barack_Obama
> > and http://en.wikipedia.org/wiki/United_States.
>
> Identify yourself. Give your program a proper user-agent, like "Anand
> 500kb program". If you use the generic user agent of your framework or
> try to make you pass as a browser you will be blocked.
>
> > Can you suggest an effective way to achieve the requirement above?
> >
> > Thanks
> > Anand
>
> _______________________________________________
> Mediawiki-api mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
>
_______________________________________________
Mediawiki-api mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api

Reply via email to