Re: DOM related question and problem
bla bla schrieb: Nice post on extracting data, simple and too the point :), I use python for simple html extracting data, but for larger projects like the web, files, or documents i tried extract data which worked great, they build quick custom screen scrapers, extracting data, and data parsing programs You don't happen to be affiliated with that commercial venture? Which seems to be shady, to say the least. No real address, dns registered by a rather shady provider... better steer clear from this, and use lxml. Diez -- http://mail.python.org/mailman/listinfo/python-list
Re: DOM related question and problem
Nice post on extracting data, simple and too the point :), I use python for simple html extracting data, but for larger projects like the web, files, or documents i tried extract data which worked great, they build quick custom screen scrapers, extracting data, and data parsing programs -- http://mail.python.org/mailman/listinfo/python-list
Re: DOM related question and problem
Stefan Behnel-3 wrote: > > elca, 18.11.2009 19:04: >> these day im making python script related with DOM. >> >> problem is these day many website structure is very complicate . >> [...] >> what is best method to check can extract such like following info >> quickly? > > This should help: > > http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/ > > Stefan > -- > http://mail.python.org/mailman/listinfo/python-list > > hello yes..i know this website already. but failed to use it lxml solution -- View this message in context: http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26455800.html Sent from the Python - python-list mailing list archive at Nabble.com. -- http://mail.python.org/mailman/listinfo/python-list
Re: DOM related question and problem
elca, 18.11.2009 19:04: > these day im making python script related with DOM. > > problem is these day many website structure is very complicate . > [...] > what is best method to check can extract such like following info quickly? This should help: http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/ Stefan -- http://mail.python.org/mailman/listinfo/python-list
Re: DOM related question and problem
Chris Rebert-6 wrote: > > On Wed, Nov 18, 2009 at 10:04 AM, elca wrote: >> Hello, >> these day im making python script related with DOM. >> >> problem is these day many website structure is very complicate . >> >> what is best method to check DOM structure and path.. >> >> i mean...following is some example. >> >> what is best method to check can extract such like following info >> quickly? >> >> before i was spent much time to extract such info . >> >> and yes im also new to python and DOM. >> >> IE.Document.Frames(1).Document.forms('comment').value = 'hello' >> >> if i use DOM inspector, can i extract such info quickly ? if so would you >> show me some sample? >> >> here is some site . i want to extract some dom info. >> >> today i was spent all day long to extract what is dom info. but failed >> >> http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060 >> >> at the end of this page,can find some comment input box. >> >> i want to know what kind of dom element should have to use, such like >> >> IE.Document.Frames(1).Document.forms('comment').value = 'hello' >> >> anyhelp much appreciate thanks > > This sounds suspiciously like a spambot. Why do you want to submit > comments in an automated fashion exactly? > > Cheers, > Chris > -- > http://blog.rebertia.com > -- > http://mail.python.org/mailman/listinfo/python-list > > Hello this is not spambot actually. it related with my blog scraper.. anyone can help me or advice much appreciate -- View this message in context: http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26418556.html Sent from the Python - python-list mailing list archive at Nabble.com. -- http://mail.python.org/mailman/listinfo/python-list
Re: DOM related question and problem
On Wed, Nov 18, 2009 at 10:04 AM, elca wrote: > Hello, > these day im making python script related with DOM. > > problem is these day many website structure is very complicate . > > what is best method to check DOM structure and path.. > > i mean...following is some example. > > what is best method to check can extract such like following info quickly? > > before i was spent much time to extract such info . > > and yes im also new to python and DOM. > > IE.Document.Frames(1).Document.forms('comment').value = 'hello' > > if i use DOM inspector, can i extract such info quickly ? if so would you > show me some sample? > > here is some site . i want to extract some dom info. > > today i was spent all day long to extract what is dom info. but failed > > http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060 > > at the end of this page,can find some comment input box. > > i want to know what kind of dom element should have to use, such like > > IE.Document.Frames(1).Document.forms('comment').value = 'hello' > > anyhelp much appreciate thanks This sounds suspiciously like a spambot. Why do you want to submit comments in an automated fashion exactly? Cheers, Chris -- http://blog.rebertia.com -- http://mail.python.org/mailman/listinfo/python-list
DOM related question and problem
Hello, these day im making python script related with DOM. problem is these day many website structure is very complicate . what is best method to check DOM structure and path.. i mean...following is some example. what is best method to check can extract such like following info quickly? before i was spent much time to extract such info . and yes im also new to python and DOM. IE.Document.Frames(1).Document.forms('comment').value = 'hello' if i use DOM inspector, can i extract such info quickly ? if so would you show me some sample? here is some site . i want to extract some dom info. today i was spent all day long to extract what is dom info. but failed http://www.segye.com/Articles/News/Politics/Article.asp?aid=20091118001261&ctg1=06&ctg2=00&subctg1=06&subctg2=00&cid=010101060 at the end of this page,can find some comment input box. i want to know what kind of dom element should have to use, such like IE.Document.Frames(1).Document.forms('comment').value = 'hello' anyhelp much appreciate thanks -- View this message in context: http://old.nabble.com/DOM-related-question-and-problem-tp26412730p26412730.html Sent from the Python - python-list mailing list archive at Nabble.com. -- http://mail.python.org/mailman/listinfo/python-list