Hi all,

I'm using scrapy for a site with bulletin board (phpbb) and I would like to start "scraping" the pages ONLY after the autentication went good.

In my code, the authentication is done inside star_request method:

def start_requests(self):
        self.log("start_requests called")

        return [
            Request(

                "http://<mysite>/phpBB3",
                callback=self.parse_welcome,
                priority=100
            )
        ]


    def parse_welcome(self, response):
        self.log("parse_welcome called")

        request = FormRequest.from_response(
            response,
            formnumber=1,

formdata={"username": "rightusername", "password": "rightpassword"}
        )

        return request

    rules = (
        Rule(LinkExtractor(),callback = 'parse_standard',follow=True),

    )

  [cut]


From the output that I got, it seems that some pages are scraped without the autentication first.

Am I wrong/missing something? Am I using "priority" in the right way?

Thanks,

Massimo

--
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Reply via email to