sample:
<!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html <http://www.kurshtml.edu.pl/html/html,html.html> xmlns="http://www.w3.org/1999/xhtml"
xml:lang="pl" lang="pl">
<head <http://www.kurshtml.edu.pl/html/head,html.html>>
<meta <http://www.kurshtml.edu.pl/html/meta,html.html> http-equiv="Content-Type"
content="text/html; charset=iso-8859-2" />
<meta <http://www.kurshtml.edu.pl/html/meta,html.html> name="Description"
content="description about page" />
<meta <http://www.kurshtml.edu.pl/html/meta,html.html> name="Keywords"
content="bee, fly,monky" />
<title <http://www.kurshtml.edu.pl/html/title,html.html>>page about
insects</title <http://www.kurshtml.edu.pl/html/title,html.html>>
</head <http://www.kurshtml.edu.pl/html/head,html.html>>
<body <http://www.kurshtml.edu.pl/html/body,html.html>>
*Insects* (fromLatin <http://en.wikipedia.org/wiki/Latin> /insectum/, acalque <http://en.wikipedia.org/wiki/Calque> ofGreek <http://en.wikipedia.org/wiki/Ancient_Greek> ἔντομον [/éntomon/], "cut into
sections") are aclass <http://en.wikipedia.org/wiki/Class_%28biology%29> ofinvertebrates <http://en.wikipedia.org/wiki/Invertebrate> within thearthropod <http://en.wikipedia.org/wiki/Arthropod> phylum that have
achitinous <http://en.wikipedia.org/wiki/Chitin> exoskeleton <http://en.wikipedia.org/wiki/Exoskeleton>, a three-part body (head <http://en.wikipedia.org/wiki/Head>,thorax
<http://en.wikipedia.org/wiki/Thorax_%28insect_anatomy%29> andabdomen <http://en.wikipedia.org/wiki/Abdomen>), three pairs of jointedlegs <http://en.wikipedia.org/wiki/Arthropod_leg>,compound eyes
<http://en.wikipedia.org/wiki/Compound_eye> and one pair ofantennae <http://en.wikipedia.org/wiki/Antenna_%28biology%29>. They are among the most diverse groups ofanimals <http://en.wikipedia.org/wiki/Animal> on the
planet, including more than a million describedspecies <http://en.wikipedia.org/wiki/Species> and representing more than half of all known living organisms.^[2] <http://en.wikipedia.org/wiki/Insect#cite_note-Chapman-2> ^[3]
<http://en.wikipedia.org/wiki/Insect#cite_note-3> The number ofextant <http://en.wikipedia.org/wiki/Extant_taxon> species is estimated at between six and ten million,^[2]
<http://en.wikipedia.org/wiki/Insect#cite_note-Chapman-2> ^[4] <http://en.wikipedia.org/wiki/Insect#cite_note-4> ^[5] <http://en.wikipedia.org/wiki/Insect#cite_note-number-5> and potentially represent over 90% of
the differing animal life forms on Earth.^[6] <http://en.wikipedia.org/wiki/Insect#cite_note-6> Insects may be found in nearly allenvironments <http://en.wikipedia.org/wiki/Natural_environment>, although only a small
number of species reside in the oceans, a habitat dominated by another arthropod group,crustaceans <http://en.wikipedia.org/wiki/Crustacean>.
</body <http://www.kurshtml.edu.pl/html/body,html.html>>
</html <http://www.kurshtml.edu.pl/html/html,html.html>>
i need somthing like :
Insects
(from
Latin
insectum,
a
calque
of
Greek
????µ??
[éntomon],
"cut
into
...
although
only
a
small
number
of
species
reside
in
the
oceans,
a
habitat
dominated
by
another
arthropod
group,
crustaceans
W dniu 2014-06-05 07:15, John Myles White pisze:
I'm still lost. Do you want to print out the text for an HTML table to
STDOUT?
-- John
On Jun 4, 2014, at 9:33 AM, Kevin Squire <kevin.squ...@gmail.com
<mailto:kevin.squ...@gmail.com>> wrote:
I would think "web page" = "HTML document".
On Wednesday, June 4, 2014, John Myles White
<johnmyleswh...@gmail.com <mailto:johnmyleswh...@gmail.com>> wrote:
I don't really understand what you mean by web page.
-- John
> On Jun 4, 2014, at 9:20 AM, Paul Analyst <paul.anal...@mail.com
<javascript:;>> wrote:
>
>
> Any help ?
> How to read and change the content of web pages to the vector
["word1", "word2", "word3", ",,,", "wordlast"]?
>
> W dniu 2014-06-04 12:47, paul analyst pisze:
>> How to read and change the content of web pages to the vector
["word1", "word2", "word3", ",,,", "wordlast"]?
>>
>> Paul
>