Hey, nice work! This reminds me of what we did some time back in a firm called HelpInHand. We built an application that would fetch a web page and turn it into a WML-page (dividing it into several pages if necessary). (See http://web.archive.org/web/20010721134013/http://helpinhand.com/) Mobile phone users then could surf anything.
Now, mobile phones using wml never was a great success, and nowadays, mobile phones surf html, so the application wasn't all that much used. I used it a bit myself (along with the "customizer", that you may find some links to from the above link), for composing web pages with content from different sites (to view in an ordinary html broswer on a desktop computer). Rebol did a great job parsing through web pages and determining their structure (or lack of such). I reused som of that code when making the distorter (http://www.oops-as.no/roy/dis), another "funny web filter". And some of that old code is still in use for the RIX (a rebol search engine indexing rebol stuff: http://www.oops-as.no/rix). So, as we all seem to agree, rebol is great for web mining. Of all sorts. HY Dixit Thomas C <[EMAIL PROTECTED]> (Sat, 16 Apr 2005 15:20:52 +0200): > >rebol is in fact quite useful for web mining which is the >reason i'm >using it for a project i'm working on called webminutes >(http://www.webminutes.org). > >parse grammars are in fact quite powerful for that! > >you might be interested in looking at how webminutes >works. >unfortunately, the web site is only in french right now. >the code >however is in english. > >so what is this webminutes concept? it was originally >designed for >people like me who take public transport, have access to >a printer and >haven't quite found their ideal newspaper. the idea is >thus to >generate automatically a newspaper which you print at >work with >content retrieved from various websites. > >you configure it once on the website using a php >interface by writing >rebol code and parsing grammars in the html forms. you >then use daily >the rebol/core scripts to generate your webminutes >newspaper. > >if there is a demand, i might look into making the >website >multilingual... let me know. > >i believe i put some english documentation a while ago on >sourceforge >but it might help but is out-of-date. > > tc > > > >On 4/15/05, Ladislav Mecir <[EMAIL PROTECTED]> wrote: >>=20 >> Kevin Wise napsal(a): >>=20 >> >Not something I really need right now. Thanks.=3D20 >> > >> >-----Original Message----- >> >From: Gregg Irwin >>[mailto:[EMAIL PROTECTED] >> >Sent: Thursday, April 14, 2005 5:30 PM >> >To: Pawel Gawronski >> >Subject: [REBOL] Re: web mining >> > >> > >> >Hi Pawe=3DB3, >> > >> >PG> However, there is a problem with statistial >>analysis needed to =3D >> >implement >> >PG> such features as like keyword and phrase extraction >>or n-gram and = >=3D >> >HMM >> >models >> >PG> simply becouse Rebol lacks proper functions and >>libraries (eg. >> >chi-square >> >PG> tests). >> > >> >Not my area, but chi-square is easy enough to do. Would >>it be useful >> >to anyone, or does there need to be more to it than >>just the simple >> >chi-square calculation?=3D20 >> > >> >-- Gregg =3D20 >> > >> >--=3D20 >> >To unsubscribe from the list, just send an email >>to=3D20 >> >lists at rebol.com with unsubscribe as the subject. >> > >> > >> Some statistic functions are available at my site and I >>can offer more >> implemented in Rebol, I was just too lazy to make them >>more visible. >>=20 >> -L >> -- >> To unsubscribe from the list, just send an email to >> lists at rebol.com with unsubscribe as the subject. >>=20 >> >-- >To unsubscribe from the list, just send an email to >lists at rebol.com with unsubscribe as the subject. > -- To unsubscribe from the list, just send an email to lists at rebol.com with unsubscribe as the subject.
