Hi Rick,

    I had looked at the XML stuff but have not played with it yet. Most of the 
standard web page stuff is easy to get but don't use the .text of a tag to get 
it because I want to avoid being forced to use a browser; for limitation arise 
based on the browser being used.

    So, I know the standard HTML tags, HTML5 and XML are a little different but 
have standard structures to draw from.

    Most stuff displayed on the screen are sequential and left to right, so I 
get all that information, including the extracting link display text then place 
the link= after that text on the next line, where a number is placed after the 
equals sign, which points to a key inside a dictionary to fetch the link 
itself. I have a hotkey to display that link or save it in your clipboard if 
you want that link to transport or use.

    My browser, which Breaking Ness kind of is, so as to avoid the mechanics of 
a website where they either want to monitor you or spy on you, or track you.

    Granted that some stuff does not work, for some sites require a paid 
subscription and a login. So some don't work, but the majority do using my 
approach, avoiding being actually on the web site and having to use a browser 
that they look for using the object tags to capture your computer info.

    But as I had said, HTML5 has some interesting objects to use and will have 
to study that for my future project. To run apps and such from.

    My program/app gives you either a tree view of stored web pages or you can 
list the web page of interest in a list view and expand that list view or sort 
that list view alphabetically if you are interested in something and want to 
use a quick search to find it.
    All hotkeys are the same for either view technique. The third view is the 
downloaded webpage inside a text box. There I just expose the link as a link=# 
where # is the key inside a dictionary of links.

    You can even use the text box to go to your own link but of course have to 
know the structure of a link or source tag which is either href= or src= where 
the path to the link is inside quotes.
Format: href="LinkPath" or src="LinkPath"

    These are standards but in HTML 5 you have much more you can use which 
allows objects to run stuff...

    As I had mentioned, avoiding browser usage keeps you from many web errors 
and I can extract all data in the order left to right, top to bottom; which is 
the usual norm but not always.

    Give my app a try and let me know what is a pain and what you like. As I 
had said, I do not allow running of scripts for it is downloaded as a text for 
a text box. Running the script means using the hotkey to take you to the 
link/page of interest. Which means loading it into your browser on your 
computer.

    So time to get moving outside to take advantage of the perfect weather 
today.

        Bruce










  Sent: Monday, September 08, 2014 8:10 AM
  Subject: Bruce, RSS Feeds As Option


  Hi Bruce: One method I have used to pick up dynamic news was to use the RSS 
Feeds provided by some sites.

  You can access the page, use xml to process the feeds by displaying the 
standard feed subjects in a listbox and then either linking to the news item or 
downloading it, if necessary - usually the content is provided in the feed so 
you don't have to download anything but can display the content in a listbox as 
the user arrows up and down the news item titles in the listbox.

  It has been some time so I forget the exact technical requirements to do this 
but I have done it a few times and it is allot cleaner than scraping many pages 
and how news is usually provided to application programs and client Websites.

  Just a heads up for future research and expanding the possibilities and 
possibilities are usually good.

  Rick USA




---
This email is free from viruses and malware because avast! Antivirus protection 
is active.
http://www.avast.com

Reply via email to