On Fri, 2008-01-11 at 15:48 +0200, Jukka Zitting wrote:
> Hi,
> 
> On Dec 16, 2007 7:46 PM, Thorsten Scherler <[EMAIL PROTECTED]> wrote:
> > Otis as well wrote me the other day asking about droids and I can repeat
> > what I told him: any help is welcome.
> 
> Have you looked at Apache Tika?

I can remember that I had a look last year but not really ATM. 

I could not really find any docs at http://incubator.apache.org/tika/
nor did http://svn.apache.org/repos/asf/incubator/tika/trunk/README.txt
told me where I can find a getting started guide.

>  I think it would make a nice
> dependency for droids, and if you like I'd be eager to look at
> integrating Tika with whatever you currently have.
> 

Actually I reckon tika can help defining the
org.apache.droids.parse.ParseData object of droids. Till now this are
just outlinks but since tika can extract many more I reckon we can
implement some methods there which will have a dep on tika. 

I need to finish migrating (still missing filter and handler factories)
but then it will be usable. I will use droids at work so I had some
business time to spend on it. 

In my usecase I will extend the DefaultCrawler and only implement 2-3
methods. I need to crawl a calendar and invoke a custom parsing with the
calendar objects.

I will write a separate mail when I finished the rewrite. I would be
nice if you could have a look at it then.

Thanks for your interest.

salu2 
> BR,
> 
> Jukka Zitting
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
-- 
Thorsten Scherler                                 thorsten.at.apache.org
Open Source Java                      consulting, training and solutions


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to