Hi Rick,
I had looked at the XML stuff but have not played with it yet. Most of the
standard web page stuff is easy to get but don't use the .text of a tag to get
it because I want to avoid being forced to use a browser; for limitation arise
based on the browser being used.
So, I know the standard HTML tags, HTML5 and XML are a little different but
have standard structures to draw from.
Most stuff displayed on the screen are sequential and left to right, so I
get all that information, including the extracting link display text then place
the link= after that text on the next line, where a number is placed after the
equals sign, which points to a key inside a dictionary to fetch the link
itself. I have a hotkey to display that link or save it in your clipboard if
you want that link to transport or use.
My browser, which Breaking Ness kind of is, so as to avoid the mechanics of
a website where they either want to monitor you or spy on you, or track you.
Granted that some stuff does not work, for some sites require a paid
subscription and a login. So some don't work, but the majority do using my
approach, avoiding being actually on the web site and having to use a browser
that they look for using the object tags to capture your computer info.
But as I had said, HTML5 has some interesting objects to use and will have
to study that for my future project. To run apps and such from.
My program/app gives you either a tree view of stored web pages or you can
list the web page of interest in a list view and expand that list view or sort
that list view alphabetically if you are interested in something and want to
use a quick search to find it.
All hotkeys are the same for either view technique. The third view is the
downloaded webpage inside a text box. There I just expose the link as a link=#
where # is the key inside a dictionary of links.
You can even use the text box to go to your own link but of course have to
know the structure of a link or source tag which is either href= or src= where
the path to the link is inside quotes.
Format: href="LinkPath" or src="LinkPath"
These are standards but in HTML 5 you have much more you can use which
allows objects to run stuff...
As I had mentioned, avoiding browser usage keeps you from many web errors
and I can extract all data in the order left to right, top to bottom; which is
the usual norm but not always.
Give my app a try and let me know what is a pain and what you like. As I
had said, I do not allow running of scripts for it is downloaded as a text for
a text box. Running the script means using the hotkey to take you to the
link/page of interest. Which means loading it into your browser on your
computer.
So time to get moving outside to take advantage of the perfect weather
today.
Bruce
Sent: Monday, September 08, 2014 8:10 AM
Subject: Bruce, RSS Feeds As Option
Hi Bruce: One method I have used to pick up dynamic news was to use the RSS
Feeds provided by some sites.
You can access the page, use xml to process the feeds by displaying the
standard feed subjects in a listbox and then either linking to the news item or
downloading it, if necessary - usually the content is provided in the feed so
you don't have to download anything but can display the content in a listbox as
the user arrows up and down the news item titles in the listbox.
It has been some time so I forget the exact technical requirements to do this
but I have done it a few times and it is allot cleaner than scraping many pages
and how news is usually provided to application programs and client Websites.
Just a heads up for future research and expanding the possibilities and
possibilities are usually good.
Rick USA
---
This email is free from viruses and malware because avast! Antivirus protection
is active.
http://www.avast.com