Hey, nice work!

This reminds me of what we did some time back in a firm 
called HelpInHand. We built an application that would 
fetch a web page and turn it into a WML-page (dividing it 
into several pages if necessary). (See 
http://web.archive.org/web/20010721134013/http://helpinhand.com/) 
Mobile phone users then could surf anything.

Now, mobile phones using wml never was a great success, 
and nowadays, mobile phones surf html, so the application 
wasn't all that much used.

I used it a bit myself (along with the "customizer", that 
you may find some links to from the above link), for 
composing web pages with content from different sites (to 
view in an ordinary html broswer on a desktop computer).

Rebol did a great job parsing through web pages and 
determining their structure (or lack of such).

I reused som of that code when making the distorter 
(http://www.oops-as.no/roy/dis), another "funny web 
filter".


And some of that old code is still in use for the RIX (a 
rebol search engine indexing rebol stuff: 
http://www.oops-as.no/rix).

So, as we all seem to agree, rebol is great for web 
mining. Of all sorts.

HY

Dixit Thomas C <[EMAIL PROTECTED]> (Sat, 16 Apr 2005 
15:20:52 +0200):

>
>rebol is in fact quite useful for web mining which is the 
>reason i'm
>using it for a project i'm working on called webminutes
>(http://www.webminutes.org).
>
>parse grammars are in fact quite powerful for that!
>
>you might be interested in looking at how webminutes 
>works.
>unfortunately, the web site is only in french right now. 
>the code
>however is in english.
>
>so what is this webminutes concept? it was originally 
>designed for
>people like me who take public transport, have access to 
>a printer and
>haven't quite found their ideal newspaper. the idea is 
>thus to
>generate automatically a newspaper which you print at 
>work with
>content retrieved from various websites.
>
>you configure it once on the website using a php 
>interface by writing
>rebol code and parsing grammars in the html forms. you 
>then use daily
>the rebol/core scripts to generate your webminutes 
>newspaper.
>
>if there is a demand, i might look into making the 
>website
>multilingual... let me know.
>
>i believe i put some english documentation a while ago on 
>sourceforge
>but it might help but is out-of-date.
>
>        tc
>
>
>
>On 4/15/05, Ladislav Mecir <[EMAIL PROTECTED]> wrote:
>>=20
>> Kevin Wise napsal(a):
>>=20
>> >Not something I really need right now.  Thanks.=3D20
>> >
>> >-----Original Message-----
>> >From: Gregg Irwin 
>>[mailto:[EMAIL PROTECTED]
>> >Sent: Thursday, April 14, 2005 5:30 PM
>> >To: Pawel Gawronski
>> >Subject: [REBOL] Re: web mining
>> >
>> >
>> >Hi Pawe=3DB3,
>> >
>> >PG> However, there is a problem with statistial 
>>analysis needed to =3D
>> >implement
>> >PG> such features as like keyword and phrase extraction 
>>or n-gram and =
>=3D
>> >HMM
>> >models
>> >PG> simply becouse Rebol lacks proper functions and 
>>libraries (eg.
>> >chi-square
>> >PG> tests).
>> >
>> >Not my area, but chi-square is easy enough to do. Would 
>>it be useful
>> >to anyone, or does there need to be more to it than 
>>just the simple
>> >chi-square calculation?=3D20
>> >
>> >-- Gregg                        =3D20
>> >
>> >--=3D20
>> >To unsubscribe from the list, just send an email 
>>to=3D20
>> >lists at rebol.com with unsubscribe as the subject.
>> >
>> >
>> Some statistic functions are available at my site and I 
>>can offer more
>> implemented in Rebol, I was just too lazy to make them 
>>more visible.
>>=20
>> -L
>> --
>> To unsubscribe from the list, just send an email to
>> lists at rebol.com with unsubscribe as the subject.
>>=20
>>
>-- 
>To unsubscribe from the list, just send an email to 
>lists at rebol.com with unsubscribe as the subject.
>

-- 
To unsubscribe from the list, just send an email to 
lists at rebol.com with unsubscribe as the subject.

Reply via email to