rebol is in fact quite useful for web mining which is the reason i'm
using it for a project i'm working on called webminutes
(http://www.webminutes.org).
parse grammars are in fact quite powerful for that!
you might be interested in looking at how webminutes works.
unfortunately, the web site is only in french right now. the code
however is in english.
so what is this webminutes concept? it was originally designed for
people like me who take public transport, have access to a printer and
haven't quite found their ideal newspaper. the idea is thus to
generate automatically a newspaper which you print at work with
content retrieved from various websites.
you configure it once on the website using a php interface by writing
rebol code and parsing grammars in the html forms. you then use daily
the rebol/core scripts to generate your webminutes newspaper.
if there is a demand, i might look into making the website
multilingual... let me know.
i believe i put some english documentation a while ago on sourceforge
but it might help but is out-of-date.
tc
On 4/15/05, Ladislav Mecir <[EMAIL PROTECTED]> wrote:
>=20
> Kevin Wise napsal(a):
>=20
> >Not something I really need right now. Thanks.=3D20
> >
> >-----Original Message-----
> >From: Gregg Irwin [mailto:[EMAIL PROTECTED]
> >Sent: Thursday, April 14, 2005 5:30 PM
> >To: Pawel Gawronski
> >Subject: [REBOL] Re: web mining
> >
> >
> >Hi Pawe=3DB3,
> >
> >PG> However, there is a problem with statistial analysis needed to =3D
> >implement
> >PG> such features as like keyword and phrase extraction or n-gram and =
=3D
> >HMM
> >models
> >PG> simply becouse Rebol lacks proper functions and libraries (eg.
> >chi-square
> >PG> tests).
> >
> >Not my area, but chi-square is easy enough to do. Would it be useful
> >to anyone, or does there need to be more to it than just the simple
> >chi-square calculation?=3D20
> >
> >-- Gregg =3D20
> >
> >--=3D20
> >To unsubscribe from the list, just send an email to=3D20
> >lists at rebol.com with unsubscribe as the subject.
> >
> >
> Some statistic functions are available at my site and I can offer more
> implemented in Rebol, I was just too lazy to make them more visible.
>=20
> -L
> --
> To unsubscribe from the list, just send an email to
> lists at rebol.com with unsubscribe as the subject.
>=20
>
--
To unsubscribe from the list, just send an email to
lists at rebol.com with unsubscribe as the subject.