On Thu, Sep 01, 2005 at 10:30:29PM -0400, Henri Yandell wrote: > Never got round to adding it to Commons, robots.txt parser: > > http://www.osjava.org/norbert/ -> > http://www.robotstxt.org/wc/norobots-rfc.html > > Web-spider: > > http://www.osjava.org/scraping-engine/ > > HTML pseudo-scraper (probably more for Jakarta Silk/Web Components): > > http://www.osjava.org/genjava/multiprojects/gj-scrape/ (poor site at > the moment, it's a substring()/indexOf() parsing system instead of > trying to be fancy). > > Hen >
Henri, I think a web spider and robots.txt parser would be a welcome addition to the project. If you are personally interested in porting these applications to use HttpClient / Http Components go ahead and add the web spider to the project goals and yourself to the list of intitial committers. In my opinion voting you in to a committer status is a matter of formality Oleg > On 8/31/05, Michael Becke <[EMAIL PROTECTED]> wrote: > > True, but that is somewhat of an unofficial version. The real one is > > at > > <http://svn.apache.org/repos/asf/jakarta/commons/proper/httpclient/trunk/PROPOSAL.html> > > and it does specifically mention client side HTTP: > > > > (1) Scope of the Package > > > > The package shall create and maintain a Java library implementing the > > client side of the HTTP/1.1 protocol, as defined in RFC 2616 and RFC > > 2617. > > > > Mike > > > > On 8/31/05, Oleg Kalnichevski <[EMAIL PROTECTED]> wrote: > > > On Wed, 2005-08-31 at 13:28 -0700, Eric Johnson wrote: > > > > A good start indeed. > > > > > > > > It seems to me that explicitly identifying the RFCs and specs that will > > > > be addressed would go a long way towards clarifying what it *DOES* and > > > > *DOES NOT* do. > > > > > > > > > > Good idea, Eric. Actually after having looked at the HttpClient features > > > page [1] I just realized that even existing project scope does not > > > inhibit development of server side components. The first line clearly > > > states "Standards based, pure Java, implementation of HTTP versions 1.0 > > > and 1.1". There's no mentioning of client-side only scope. > > > > > > Oleg > > > > > > [1] http://jakarta.apache.org/commons/httpclient/features.html > > > > > > > -Eric. > > > > > > > > Oleg Kalnichevski wrote: > > > > > > > > >On Sun, 2005-08-28 at 17:25 -0400, Michael Becke wrote: > > > > > > > > > > > > > > >>Now that we've gotten the ball rolling, let's try to focus a little > > > > >>more on HttpClient's charter as we move to Jakarta level. This is the > > > > >>document that will define our goals, name, and what exactly is or > > > > >>isn't in bounds as far as development goes. > > > > >> > > > > >> > > > > > > > > > >Folks, > > > > > > > > > >I have taken liberty of throwing together a first draft of the new > > > > >project charter based on the ideas expressed so far > > > > > > > > > >http://wiki.apache.org/jakarta-httpclient/NewProjectCharter > > > > > > > > > >Please do not be shy and add your ideas, comments, and raise objections > > > > >if you think something is wrong > > > > > > > > > >Oleg > > > > > > > > > > > > > > >--------------------------------------------------------------------- > > > > >To unsubscribe, e-mail: [EMAIL PROTECTED] > > > > >For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > > > > > > > > > > > > > > > > > > > --------------------------------------------------------------------- > > > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > > > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
