On Wed, 2008-09-03 at 23:33 +0200, Thorsten Scherler wrote: > On Wed, 2008-09-03 at 14:56 +0200, Ryan McKinley wrote: > >... > > With droids how would you make a file system crawler? > > That depends on the use case. > > If you need link extraction (crawler droid) then I would create a file > system plugin (implements Protocol). Then define the new protocol in the > droids-core-context.xml like > <bean name="org.apache.droids.api.Protocol/file" > class="org.apache.droids.protocol.file.File" scope="prototype"/> > > Maybe I can add an implementation tonight.
done http://svn.apache.org/viewvc?rev=691821&view=rev ... > > Also, it seems that Parse.java assumes you are only working with > > text. > > With a textual representation of the incoming stream. However on the > todo list one top priority is to reuse Tika for the parser > implementation. There is an issue open for this LABS-118. > > > Within the DefaultWorker it looks like each uri is opened twice: first > > in getParse() then again in handle( Parse ). Something about that > > feels wrong. > > You mean it is more efficient to open the stream once and reuse it > later, yeah I agree. fix that http://svn.apache.org/viewvc?rev=691826&view=rev Thanks for pointing out. salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
