Re: DOM related question and problem

2009-12-01 Thread Diez B. Roggisch

bla bla schrieb:

 Nice post on extracting data, simple and too the point :), I use
python for simple html extracting data, but for larger projects like
the web, files, or documents i tried extract data which worked great, they
build quick custom screen scrapers, extracting data, and data parsing
programs


You don't happen to be affiliated with that commercial venture?

Which seems to be shady, to say the least. No real address, dns 
registered by a rather shady provider... better steer clear from this, 
and use lxml.


Diez
--
http://mail.python.org/mailman/listinfo/python-list


Re: DOM related question and problem

2009-12-01 Thread bla bla
 Nice post on extracting data, simple and too the point :), I use
python for simple html extracting data, but for larger projects like
the web, files, or documents i tried extract data which worked great, they
build quick custom screen scrapers, extracting data, and data parsing
programs
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: DOM related question and problem

2009-11-21 Thread elca



Stefan Behnel-3 wrote:
> 
> elca, 18.11.2009 19:04:
>> these day im making python script related with DOM.
>> 
>> problem is these day many website structure is very complicate .
>> [...]
>> what is best method to check  can extract such like following info
>> quickly?
> 
> This should help:
> 
> http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/
> 
> Stefan
> -- 
> http://mail.python.org/mailman/listinfo/python-list
> 
> 

hello
yes..i know this website already.
but failed to use it lxml solution

-- 
View this message in context: 
http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26455800.html
Sent from the Python - python-list mailing list archive at Nabble.com.

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: DOM related question and problem

2009-11-20 Thread Stefan Behnel
elca, 18.11.2009 19:04:
> these day im making python script related with DOM.
> 
> problem is these day many website structure is very complicate .
> [...]
> what is best method to check  can extract such like following info quickly?

This should help:

http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/

Stefan
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: DOM related question and problem

2009-11-18 Thread elca



Chris Rebert-6 wrote:
> 
> On Wed, Nov 18, 2009 at 10:04 AM, elca  wrote:
>> Hello,
>> these day im making python script related with DOM.
>>
>> problem is these day many website structure is very complicate .
>>
>> what is best method to check DOM structure and path..
>>
>> i mean...following is some example.
>>
>> what is best method to check  can extract such like following info
>> quickly?
>>
>> before i was spent much time to extract such info .
>>
>> and yes im also new to python and DOM.
>>
>>    IE.Document.Frames(1).Document.forms('comment').value = 'hello'
>>
>> if i use DOM inspector, can i extract such info quickly ? if so would you
>> show me some sample?
>>
>> here is some site . i want to extract some dom info.
>>
>> today i was spent all day long to extract what is dom info. but failed
>>
>> http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060
>>
>> at the end of this page,can find some comment input box.
>>
>> i want to know what kind of dom element should have to use, such like
>>
>>    IE.Document.Frames(1).Document.forms('comment').value = 'hello'
>>
>> anyhelp much appreciate thanks
> 
> This sounds suspiciously like a spambot. Why do you want to submit
> comments in an automated fashion exactly?
> 
> Cheers,
> Chris
> --
> http://blog.rebertia.com
> -- 
> http://mail.python.org/mailman/listinfo/python-list
> 
> 
Hello
this is not spambot actually.
it related with my blog scraper..
anyone can help me or advice much appreciate
-- 
View this message in context: 
http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26418556.html
Sent from the Python - python-list mailing list archive at Nabble.com.

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: DOM related question and problem

2009-11-18 Thread Chris Rebert
On Wed, Nov 18, 2009 at 10:04 AM, elca  wrote:
> Hello,
> these day im making python script related with DOM.
>
> problem is these day many website structure is very complicate .
>
> what is best method to check DOM structure and path..
>
> i mean...following is some example.
>
> what is best method to check  can extract such like following info quickly?
>
> before i was spent much time to extract such info .
>
> and yes im also new to python and DOM.
>
>    IE.Document.Frames(1).Document.forms('comment').value = 'hello'
>
> if i use DOM inspector, can i extract such info quickly ? if so would you
> show me some sample?
>
> here is some site . i want to extract some dom info.
>
> today i was spent all day long to extract what is dom info. but failed
>
> http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060
>
> at the end of this page,can find some comment input box.
>
> i want to know what kind of dom element should have to use, such like
>
>    IE.Document.Frames(1).Document.forms('comment').value = 'hello'
>
> anyhelp much appreciate thanks

This sounds suspiciously like a spambot. Why do you want to submit
comments in an automated fashion exactly?

Cheers,
Chris
--
http://blog.rebertia.com
-- 
http://mail.python.org/mailman/listinfo/python-list


DOM related question and problem

2009-11-18 Thread elca

Hello,
these day im making python script related with DOM.

problem is these day many website structure is very complicate .

what is best method to check DOM structure and path..

i mean...following is some example.

what is best method to check  can extract such like following info quickly?

before i was spent much time to extract such info .

and yes im also new to python and DOM.

IE.Document.Frames(1).Document.forms('comment').value = 'hello'

if i use DOM inspector, can i extract such info quickly ? if so would you
show me some sample?

here is some site . i want to extract some dom info. 

today i was spent all day long to extract what is dom info. but failed

http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060

at the end of this page,can find some comment input box.

i want to know what kind of dom element should have to use, such like 

IE.Document.Frames(1).Document.forms('comment').value = 'hello'

anyhelp much appreciate thanks


-- 
View this message in context: 
http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26412730.html
Sent from the Python - python-list mailing list archive at Nabble.com.

-- 
http://mail.python.org/mailman/listinfo/python-list