On Wed, 2008-09-03 at 23:33 +0200, Thorsten Scherler wrote:
> On Wed, 2008-09-03 at 14:56 +0200, Ryan McKinley wrote:
> >...
> > With droids how would you make a file system crawler?
> 
> That depends on the use case. 
> 
> If you need link extraction (crawler droid) then I would create a file
> system plugin (implements Protocol). Then define the new protocol in the
> droids-core-context.xml like 
> <bean name="org.apache.droids.api.Protocol/file"
>     class="org.apache.droids.protocol.file.File" scope="prototype"/>
> 
> Maybe I can add an implementation tonight. 

done
http://svn.apache.org/viewvc?rev=691821&view=rev

...
> > Also, it seems that Parse.java assumes you are only working with  
> > text.  
> 
> With a textual representation of the incoming stream. However on the
> todo list one top priority is to reuse Tika for the parser
> implementation.

There is an issue open for this LABS-118.

> 
> > Within the DefaultWorker it looks like each uri is opened twice: first  
> > in getParse() then again in handle( Parse ).  Something about that  
> > feels wrong.
> 
> You mean it is more efficient to open the stream once and reuse it
> later, yeah I agree.

fix that
http://svn.apache.org/viewvc?rev=691826&view=rev

Thanks for pointing out.

salu2
-- 
Thorsten Scherler                                 thorsten.at.apache.org
Open Source Java                      consulting, training and solutions


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to